Crawler

Web crawler based on Puppeteer
Alternatives To Crawler
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Crawlee12,106422 days ago747December 10, 202396apache-2.0TypeScript
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Browser Fingerprinting3,353
a year ago7JavaScript
Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?
Thal2,268
3 years agomitJavaScript
Getting started with Puppeteer and Chrome Headless for Web Scraping
Phpscraper486
5 months ago34June 18, 202320gpl-3.0PHP
A universal web-util for PHP.
Browsertrix Crawler470
3 months ago91agpl-3.0JavaScript
Run a high-fidelity browser-based crawler in a single Docker container
Zimit209
3 months ago31gpl-3.0Python
Make a ZIM file from any Web site and surf offline!
Aws Pdf Textract Pipeline148
3 months ago5mitTypeScript
:mag: Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
Gpt4v Scraper126
3 months agoJavaScript
AI agent that can SEE 👁️, control, navigate, & do stuff for you on your browser.
Actor Scraper9312a year ago12May 28, 201913apache-2.0JavaScript
House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
Browser Pool777a year ago82June 20, 20228TypeScript
A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Alternatives To Crawler
Select To Compare


Alternative Project Comparisons
Popular Puppeteer Projects
Popular Web Crawler Projects
Popular Web Browsers Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Typescript
Crawler
Spider
Web Crawler
Puppeteer
Headless Chrome