is there alternative project like this project? #976

socialpercon · 2021-09-19T02:23:47Z

pyspider version:
Operating system:
Start up command:

Expected behavior

Actual behavior

How to reproduce

is there alternative project like this project?

i don't understand why this project no longer maintainece. i think alternative project more powerful... but i don't know...

Chaffy-0 · 2021-12-06T23:40:30Z

Scrapy

JermellB · 2021-12-19T03:34:55Z

This project isn't maintained any more because their javascript rendering capability is done by phantomjs which is no longer maintained.

Like @Chaffy-0 said, Scrapy is likely the best option if you wanted to do a spider like this.

These days, elasticsearch comes paired with one if you were doing something simple and didn't need to collect and process your own data from the wild.

Most places I've done stuff @ will use things like selenium + chrome or firefox, paired with beautiful soup for the rendered html parsing. Then you could keep track of where you'd spider with simple things like a bloom filter implemented on top of redis or something.

But yeah, Scrapy if you don't feel like getting too dirty.

milahu · 2022-04-18T07:27:21Z

some active python web scraper projects
https://github.com/Gerapy/Gerapy
https://github.com/howie6879/ruia

roniemartinez · 2022-06-02T16:10:14Z

Just in case people will be interested in my project 🙇 : https://github.com/roniemartinez/dude

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is there alternative project like this project? #976

is there alternative project like this project? #976

socialpercon commented Sep 19, 2021

Chaffy-0 commented Dec 6, 2021

JermellB commented Dec 19, 2021 •

edited

Loading

milahu commented Apr 18, 2022

roniemartinez commented Jun 2, 2022

is there alternative project like this project? #976

is there alternative project like this project? #976

Comments

socialpercon commented Sep 19, 2021

Expected behavior

Actual behavior

How to reproduce

Chaffy-0 commented Dec 6, 2021

JermellB commented Dec 19, 2021 • edited Loading

milahu commented Apr 18, 2022

roniemartinez commented Jun 2, 2022

JermellB commented Dec 19, 2021 •

edited

Loading