Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Cyborg | 300 | 6 years ago | 2 | Python | ||||||
Python web scraping framework | ||||||||||
Sasila | 264 | 5 years ago | 17 | November 02, 2017 | 1 | apache-2.0 | Python | |||
一个灵活、友好的爬虫框架 | ||||||||||
Scrapyz | 188 | 8 years ago | 6 | July 27, 2015 | Python | |||||
"Scrape Easy" - an extension of the Scrapy framework. | ||||||||||
Ayakashi | 177 | 2 | 10 months ago | 40 | June 29, 2023 | 8 | other | TypeScript | ||
:zap: Ayakashi.io - The next generation web scraping framework | ||||||||||
Scrapy S3pipeline | 66 | 1 | 2 years ago | 8 | January 31, 2021 | 1 | mit | Python | ||
Scrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket. | ||||||||||
Dotnetcrawler | 63 | 4 years ago | C# | |||||||
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c | ||||||||||
Scrapy Distributed | 40 | a year ago | 8 | February 20, 2021 | 10 | Python | ||||
A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy. | ||||||||||
Phoenix_pipeline | 39 | 7 years ago | 12 | mit | Python | |||||
Turning news into events since 2014. | ||||||||||
Nano Pipe | 31 | 2 | 4 years ago | 7 | May 27, 2020 | JavaScript | ||||
A tiny library (<450 bytes gzipped) to create chainable functions/pipelines including support for async generators. | ||||||||||
Scrapy Crawl Asp | 16 | 8 years ago | 1 | mit | Python | |||||
heavy-duty scraping framework for crawling ASP.net pages |