Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
Scrapy	49,918	4,185	445	5 months ago	96	September 18, 2023	692	bsd-3-clause	Python
Scrapy, a fast high-level web crawling & scraping framework for Python.
Huginn	42,091	69	52	11 days ago	8	September 22, 2017	698	mit	Ruby
Create agents that monitor and act on your behalf. Your agents are standing by!
Crawlee	12,871		42	2 days ago	747	December 10, 2023	96	apache-2.0	TypeScript
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Awesome Web Scraping	6,060			7 months ago			1	other	Makefile
List of libraries, tools and APIs for web scraping and data processing.
Awesome Crawler	5,859			7 months ago			27	mit
A collection of awesome web crawler,spider in different languages
Autoscraper	5,159		1	a year ago	16	July 17, 2022	9	mit	Python
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Douyin_tiktok_download_api	4,844			7 months ago	21	September 23, 2023	60	mit	Python
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具，支持API调用，在线批量解析及下载。
Rod	4,505		140	5 months ago	406	November 06, 2023	106	mit	Go
A Devtools driver for web automation and scraping
Node Osmosis	4,083	218	58	a year ago	27	March 01, 2019	117		JavaScript
Web scraper for NodeJS
Browser Fingerprinting	3,353			a year ago			7		JavaScript
Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

Alternatives To Scraper Tutorial

Select To Compare

Scrapy ⭐ 49,918

Scrapy, a fast high-level web crawling & scraping framework for Python.

dependent packages 445total releases 96most recent commit 5 months ago

Huginn ⭐ 42,091

Create agents that monitor and act on your behalf. Your agents are standing by!

dependent packages 52total releases 8most recent commit 11 days ago

Crawlee ⭐ 12,871

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

dependent packages 42total releases 747most recent commit 2 days ago

Awesome Web Scraping ⭐ 6,060

List of libraries, tools and APIs for web scraping and data processing.

most recent commit 7 months ago

Awesome Crawler ⭐ 5,859

A collection of awesome web crawler,spider in different languages

most recent commit 7 months ago

Autoscraper ⭐ 5,159

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

dependent packages 1total releases 16most recent commit a year ago

Douyin_tiktok_download_api ⭐ 4,844

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、T

total releases 21most recent commit 7 months ago

Rod ⭐ 4,505

A Devtools driver for web automation and scraping

dependent packages 140total releases 406most recent commit 5 months ago

Node Osmosis ⭐ 4,083

Web scraper for NodeJS

dependent packages 58total releases 27most recent commit a year ago

Browser Fingerprinting ⭐ 3,353

Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

most recent commit a year ago

Suggest An Alternative To scraper-tutorial

Alternative Project Comparisons

Scraper Tutorial vs Scrapy

Scraper Tutorial vs Huginn

Scraper Tutorial vs Crawlee

Scraper Tutorial vs Awesome Web Scraping

Scraper Tutorial vs Awesome Crawler

Scraper Tutorial vs Autoscraper

Scraper Tutorial vs Douyin_tiktok_download_api

Scraper Tutorial vs Rod

Scraper Tutorial vs Node Osmosis

Scraper Tutorial vs Browser Fingerprinting

Popular Scraper Projects

Cheerio ⭐ 27,702

The fast, flexible, and elegant library for parsing and manipulating HTML and XML.

dependent packages 20,519total releases 70latest release June 26, 2022most recent commit 3 months ago

Lux ⭐ 24,752

👾 Fast and simple video download library and CLI tool written in Go

dependent packages 8total releases 40latest release November 06, 2023most recent commit 3 months ago

Colly ⭐ 22,516

Elegant Scraper and Crawler Framework for Golang

dependent packages 328total releases 22latest release March 08, 2022most recent commit 14 days ago

Easyspider ⭐ 20,149

A visual no-code/code-free web crawler/spider易采集：一个可视化浏览器自动化测试/数据采集/爬虫软件，可以无代码图形化

most recent commit 3 months ago

Newspaper ⭐ 13,147

News, full-text, and article metadata extraction in Python 3. Advanced docs:

dependent packages 97total releases 18latest release September 28, 2018most recent commit 9 months ago

Popular Web Crawler Projects

Changedetection.io ⭐ 13,943

The best and simplest free open source website change detection, website watcher, restock monitor and notification service. Restock Monitor, change detection. Designed for simplicity - Simply monitor which websites had a text change for free. Free Open source web page change detection, Website defacement monitoring, Price change notification

most recent commit 5 months ago

Crawlab ⭐ 10,521

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

total releases 1latest release March 03, 2019most recent commit 6 months ago

Spider Flow ⭐ 8,075

新一代爬虫平台，以图形化方式定义爬虫流程，不写代码即可完成爬虫。

most recent commit a year ago

Katana ⭐ 7,995

A next-generation crawling and spidering framework.

dependent packages 1total releases 8latest release September 14, 2023most recent commit 5 months ago

Ani Cli ⭐ 5,724

A cli tool to browse and play anime

most recent commit 5 months ago

Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories

No Spam. Unsubscribe easily at any time.

Javascript

Tutorials

Scraper

Web Crawler

Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.