Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Would you still recommend Scrapy if the task wasn't specifically crawling?


Nope. It is very specifically tailored to crawling. If you just need something distributed why not check out RQ [0], Gearman [1] or Celery [2]? RQ and Celery are python specific.

[0] : http://python-rq.org/docs/

[1] : http://gearman.org/

[2] : http://docs.celeryproject.org


I once used it to automate the, well, scraping of statistics from an affiliate network account. So you can do pretty specific stuff, as long as it involves HTTP/HTTPS requests.


depends on the task. For example they have a decent file/image downloading middleware.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: