Skip to content

Latest commit

 

History

History
22 lines (15 loc) · 373 Bytes

README.md

File metadata and controls

22 lines (15 loc) · 373 Bytes

Guardian comments scraper

Scrapes comments from guardian articles and outputs them to JSON or CSV formats.

Requires python and BeautifulSoup (pip install BeautifulSoup)

Usage

python scrape.py <url>

With format specified (defaults to json)

python scrape.py <url> -o csv

With a list of links (one per line)

python scrape.py <file.txt>