Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Cheerio | 27,702 | 112,614 | 20,519 | 24 days ago | 70 | June 26, 2022 | 34 | mit | TypeScript | |
The fast, flexible, and elegant library for parsing and manipulating HTML and XML. | ||||||||||
Jsoup | 10,463 | 21,589 | 1,763 | 4 months ago | 44 | November 27, 2023 | 87 | mit | Java | |
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety. | ||||||||||
Node Osmosis | 4,083 | 218 | 58 | 9 months ago | 27 | March 01, 2019 | 117 | JavaScript | ||
Web scraper for NodeJS | ||||||||||
Scraper | 1,639 | 108 | 371 | 4 months ago | 27 | October 29, 2023 | 9 | isc | Rust | |
HTML parsing and querying with CSS selectors | ||||||||||
Parsel | 1,010 | 1,468 | 152 | 7 months ago | 22 | April 18, 2023 | 36 | bsd-3-clause | Python | |
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors | ||||||||||
Web Scraper Chrome Extension | 980 | 6 years ago | 132 | lgpl-3.0 | JavaScript | |||||
Web data extraction tool implemented as chrome extension | ||||||||||
Amazon Scraper Python | 766 | 4 years ago | 12 | January 11, 2019 | 10 | mit | Python | |||
Non-official client to get some info about products sold on Amazon | ||||||||||
Pdfquery | 693 | 26 | 5 | a year ago | 18 | March 27, 2016 | 25 | mit | Python | |
A fast and friendly PDF scraping library. | ||||||||||
Surgeon | 593 | 4 | 5 | 4 years ago | 64 | June 05, 2020 | 15 | other | JavaScript | |
Declarative DOM extraction expression evaluator. 👨⚕️ | ||||||||||
Scrapple | 452 | 1 | 5 years ago | 10 | September 24, 2016 | 4 | mit | Python | ||
A framework for creating semi-automatic web content extractors |