Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Requests Html | 13,100 | a year ago | 6 | July 26, 2022 | 198 | mit | Python | |||
Pythonic HTML Parsing for Humans™ | ||||||||||
Crawler User Agents | 1,045 | 5 | 8 | 3 months ago | 118 | November 20, 2023 | 7 | mit | Python | |
Syntactic patterns of HTTP user-agents used by bots / robots / crawlers / scrapers / spiders. pull-request welcome :star: | ||||||||||
Scrapyrt | 793 | 5 | 6 months ago | 7 | September 20, 2023 | 31 | bsd-3-clause | Python | ||
HTTP API for Scrapy spiders | ||||||||||
Xidel | 611 | 5 months ago | 18 | gpl-3.0 | Pascal | |||||
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents. | ||||||||||
Httplang | 508 | 6 years ago | 2 | mit | Python | |||||
A scripting langauge to do HTTP routines. | ||||||||||
Hrequests | 327 | 6 months ago | 14 | September 10, 2023 | 3 | apache-2.0 | Python | |||
🚀 Web scraping for humans | ||||||||||
Sasila | 264 | 5 years ago | 17 | November 02, 2017 | 1 | apache-2.0 | Python | |||
一个灵活、友好的爬虫框架 | ||||||||||
Proxy Scraper | 260 | a year ago | 12 | Python | ||||||
scrape proxies from more than 5 different sources and check which ones are still alive | ||||||||||
Scrapelib | 195 | 126 | 12 | 4 months ago | 44 | December 15, 2023 | 4 | bsd-2-clause | Python | |
⛏ a library for scraping unreliable pages | ||||||||||
Tokio | 144 | 1 | 2 | 2 years ago | 3 | May 14, 2018 | 3 | mit | JavaScript | |
Web scraping made simple. |