Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Frontera | 1,244 | 12 | 3 | 7 months ago | 22 | April 05, 2019 | 98 | bsd-3-clause | Python | |
A scalable frontier for web crawlers | ||||||||||
Scrapy Cluster | 1,137 | 18 | 2 | 6 months ago | 15 | December 23, 2020 | 17 | mit | Python | |
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster. | ||||||||||
Netdiscovery | 557 | 3 years ago | apache-2.0 | Java | ||||||
NetDiscovery 是一款基于 Vert.x、RxJava 2 等框架实现的通用爬虫框架/中间件。 | ||||||||||
Scrapy_demo | 150 | a year ago | 2 | Python | ||||||
all kinds of scrapy demo | ||||||||||
Dodder | 71 | 3 years ago | 2 | mit | Java | |||||
A distributed DHT crawler that sniffs torrents from BitTorrent network | ||||||||||
Dig Etl Engine | 65 | 5 years ago | 58 | mit | ||||||
Download DIG to run on your laptop or server. | ||||||||||
Scrapy Kafka | 63 | 6 years ago | 1 | August 14, 2015 | 1 | apache-2.0 | Python | |||
Kafka-based components for Scrapy | ||||||||||
Scrapy Distributed | 40 | a year ago | 8 | February 20, 2021 | 10 | Python | ||||
A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy. | ||||||||||
Scrapy Kafka Redis | 35 | 3 years ago | 5 | July 24, 2018 | apache-2.0 | Python | ||||
Distributed crawling/scraping, Kafka And Redis based components for Scrapy | ||||||||||
Pomp Craigslist Example | 33 | 6 years ago | 2 | HTML | ||||||
Extract data from Craigslist.org by python3 and pomp framework |