Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Cheerio | 27,590 | 112,614 | 20,519 | 12 days ago | 70 | June 26, 2022 | 34 | mit | TypeScript | |
The fast, flexible, and elegant library for parsing and manipulating HTML and XML. | ||||||||||
Jsoup | 10,463 | 21,589 | 1,763 | 3 months ago | 44 | November 27, 2023 | 87 | mit | Java | |
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety. | ||||||||||
Node Osmosis | 4,083 | 218 | 58 | 8 months ago | 27 | March 01, 2019 | 117 | JavaScript | ||
Web scraper for NodeJS | ||||||||||
Gdom | 1,180 | 3 | 4 years ago | 3 | November 20, 2017 | 6 | other | Python | ||
DOM Traversing and Scraping using GraphQL | ||||||||||
Skrape.it | 714 | 3 | 3 months ago | 14 | July 19, 2022 | 25 | mit | Kotlin | ||
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion. | ||||||||||
Secret Agent | 575 | 32 | a year ago | 33 | May 02, 2021 | 43 | mit | TypeScript | ||
The web scraper that's nearly impossible to block - now called @ulixee/hero | ||||||||||
Scalpel | 315 | 7 | 3 months ago | 21 | December 08, 2023 | 9 | apache-2.0 | Haskell | ||
A high level web scraping library for Haskell. | ||||||||||
Scraply | 114 | 2 years ago | 1 | July 07, 2022 | apache-2.0 | Go | ||||
Scraply a simple dom scraper to fetch information from any html based website | ||||||||||
Scraper | 49 | 2 | 3 | 3 months ago | 5 | February 01, 2021 | 1 | mit | Go | |
A dual interface Go module for building simple web scrapers | ||||||||||
Fulldom Server | 31 | 2 | 3 years ago | 3 | October 17, 2016 | 13 | agpl-3.0 | JavaScript | ||
Proxy-like server that will show you the DOM of a page after JS runs |