Skip to content

Script for scraping expired domain names from Simply.com - Using BeautifulSoup and proxies. Saves data to MariaDB. Potential for threading in future updates. Ideal for SEO and domain investors.

License

Notifications You must be signed in to change notification settings

ha1fdan/DomainScraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Expired Domains Scraper for simply.com

This Python project scrapes a list of expired domains from a Danish hosting company named Simply.com using web scraping techniques. The domains listed on the webpage are those that have recently expired and are now available for registration. The script utilizes BeautifulSoup for parsing the HTML content and extracting domain names efficiently.


Features

  • Supports the use of proxies, including SOCKS proxies, for enhanced privacy and security during web scraping activities.
  • Inserts the extracted domain names into a MariaDB database for further analysis or processing.
  • Threading support is not currently implemented.

Get started

.env

DEBUG=False
URL="https://www.simply.com/en/ninja/"

# Proxy details
PROXY_HOST='proxy-nl.privateinternetaccess.com' #Default for PIA Proxy
PROXY_PORT='1080'                               #Default for PIA Proxy
PROXY_USERNAME=''                               #Your proxy username
PROXY_PASSWORD=''                               #Your proxy password

# MariaDB details
DB_HOST='localhost'                             #Database host
DB_USER='root'                                  #Database username
DB_PASSWORD=''                                  #Database password
DB_NAME=''                                      #Database name

Database table schema:

CREATE TABLE scraped_domains (
    id INT AUTO_INCREMENT PRIMARY KEY,
    domain VARCHAR(255),
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

How to export database to csv? See here

About

Script for scraping expired domain names from Simply.com - Using BeautifulSoup and proxies. Saves data to MariaDB. Potential for threading in future updates. Ideal for SEO and domain investors.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages