Benefits of using scrapy over requests/selenium

IceSea@lemmy.world · 1 year ago

Benefits of using scrapy over requests/selenium

Wats0ns@programming.dev · 1 year ago

The huge feature of scrapy is it’s pipelining system: you scrape a page, pass it to the filtering part, then to the deduplication part, then to the DB and so on

Hugely useful when you’re scraping and extraction data, I reckon if you’re only extracting raw pages then it’s less useful I guess

qwertyasdef@programming.dev · 1 year ago

Oh shit that sounds useful. I just did a project where I implemented a custom stream class to chain together calls to requests and beautifulsoup.