Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Jsoup | 10,463 | 21,589 | 1,763 | 4 months ago | 44 | November 27, 2023 | 87 | mit | Java | |
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety. | ||||||||||
Parsel | 1,010 | 1,468 | 152 | 8 months ago | 22 | April 18, 2023 | 36 | bsd-3-clause | Python | |
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors | ||||||||||
Xidel | 611 | 6 months ago | 18 | gpl-3.0 | Pascal | |||||
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents. | ||||||||||
Hquery.php | 345 | 1 | 4 | 4 months ago | 23 | July 19, 2019 | 15 | mit | PHP | |
An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies. | ||||||||||
Xquery | 155 | 92 | 6 years ago | 2 | May 15, 2018 | mit | Go | |||
Extract data or evaluate value from HTML/XML documents using XPath | ||||||||||
Scraperboard | 94 | 9 years ago | November 26, 2023 | Go | ||||||
Golang library to easily scrape websites based on simple XML declarations | ||||||||||
Tatooine | 78 | 3 | 1 | a year ago | 84 | March 27, 2023 | 8 | other | TypeScript | |
A powerful scraper for JavaScript Developers. | ||||||||||
Scraper Fourone Jobs | 43 | 5 years ago | gpl-2.0 | Python | ||||||
This is a anti-scraping cracker for extracting apply information of one of Taiwan jobs recruiting website. | ||||||||||
Ronin Web | 40 | 3 | 5 months ago | 16 | April 04, 2023 | 8 | gpl-3.0 | Ruby | ||
ronin-web is a collection of useful web helper methods and commands. | ||||||||||
Xpath Selector | 28 | 9 | 2 | 8 years ago | 6 | December 08, 2014 | 1 | HTML | ||
Library implementing easy XPath queries. Very useful for HTML and XML web scraping. |