Skip to content
@scrapy

Scrapy project

An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.

Pinned Loading

  1. scrapy scrapy Public

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    Python 58.1k 11k

  2. scrapyd scrapyd Public

    A service daemon to run Scrapy spiders

    Python 3.1k 574

  3. parsel parsel Public

    Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

    Python 1.3k 152

  4. w3lib w3lib Public

    Python library of web-related functions

    Python 412 108

  5. protego protego Public

    A pure-Python robots.txt parser with support for modern conventions.

    DIGITAL Command Language 70 28

  6. itemadapter itemadapter Public

    Common interface for data container classes

    Python 68 12

Repositories

Showing 10 of 29 repositories
  • itemadapter Public

    Common interface for data container classes

    scrapy/itemadapter’s past year of commit activity
    Python 68 BSD-3-Clause 12 10 2 Updated Aug 26, 2025
  • scrapy Public

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    scrapy/scrapy’s past year of commit activity
    Python 58,081 BSD-3-Clause 11,030 470 (20 issues need help) 217 Updated Aug 21, 2025
  • scrapy-lint Public

    A linter for Scrapy projects.

    scrapy/scrapy-lint’s past year of commit activity
    Python 20 MIT 4 40 (2 issues need help) 0 Updated Aug 16, 2025
  • scrapyd Public

    A service daemon to run Scrapy spiders

    scrapy/scrapyd’s past year of commit activity
    Python 3,059 BSD-3-Clause 574 6 0 Updated Aug 12, 2025
  • scrapyd-client Public

    Command line client for Scrapyd server

    scrapy/scrapyd-client’s past year of commit activity
    Python 777 BSD-3-Clause 146 5 0 Updated Aug 12, 2025
  • w3lib Public

    Python library of web-related functions

    scrapy/w3lib’s past year of commit activity
    Python 412 BSD-3-Clause 108 11 (1 issue needs help) 5 Updated Aug 7, 2025
  • itemloaders Public

    Library to populate items using XPath and CSS with a convenient API

    scrapy/itemloaders’s past year of commit activity
    Python 47 BSD-3-Clause 16 18 (1 issue needs help) 4 Updated Jul 27, 2025
  • parsel Public

    Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

    scrapy/parsel’s past year of commit activity
    Python 1,265 BSD-3-Clause 152 34 (1 issue needs help) 12 Updated Jul 27, 2025
  • protego Public

    A pure-Python robots.txt parser with support for modern conventions.

    scrapy/protego’s past year of commit activity
    DIGITAL Command Language 70 BSD-3-Clause 28 7 (3 issues need help) 0 Updated Jul 26, 2025
  • cssselect Public

    CSS Selectors for Python

    scrapy/cssselect’s past year of commit activity
    Python 303 61 18 5 Updated Jul 26, 2025