aragog-crawler

Python web scraper for the wizardingworld.com website

Features

Scraps content from each of J.K. Rowling original writings from https://www.wizardingworld.com/writing-by-jk-rowling

Install Dependencies python3 -m pip install -r requirements.txt
Then simply call the extract_content() method from crawler.py and it should return a list of dictionaries containing the 'title' and 'text' for each of the writings (there are 93 as of november 2022)

I don't own any of the content scraped. This project is for educational purposes only.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.gitignore		.gitignore
README.md		README.md
crawler.py		crawler.py
dicts.png		dicts.png
requirements.txt		requirements.txt