Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Newspaper | 13,147 | 222 | 97 | 6 months ago | 18 | September 28, 2018 | 498 | mit | Python | |
News, full-text, and article metadata extraction in Python 3. Advanced docs: | ||||||||||
News Please | 1,821 | 6 | 4 | 4 months ago | 121 | August 30, 2023 | 17 | apache-2.0 | Python | |
news-please - an integrated web crawler and information extractor for news that just works | ||||||||||
Article Extractor | 1,297 | 12 | 10 | 3 months ago | 156 | December 01, 2022 | 4 | mit | JavaScript | |
To extract main article from given URL with Node.js | ||||||||||
Hacker News Digest | 620 | 3 months ago | 9 | lgpl-3.0 | Python | |||||
:newspaper: Let ChatGPT Summarize Hacker News for You | ||||||||||
Awesome Scrapy | 450 | a year ago | 2 | |||||||
A curated list of awesome packages, articles, and other cool resources from the Scrapy community. | ||||||||||
Html2article | 425 | 1 | 7 years ago | 5 | July 11, 2013 | 6 | other | C# | ||
Html网页正文提取 | ||||||||||
Node Readability | 302 | 10 | 4 | 6 years ago | 67 | August 01, 2018 | 9 | JavaScript | ||
Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English. | ||||||||||
Koreanewscrawler | 182 | 1 | 2 years ago | 10 | March 27, 2022 | 9 | mit | Python | ||
대량의 뉴스 데이터를 수집하기 위해 만들어진 뉴스 크롤러입니다. | ||||||||||
Selenium Crawler | 119 | 11 years ago | 1 | mit | Python | |||||
Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that. | ||||||||||
Strumentalia Seealsology | 76 | 5 months ago | 7 | other | JavaScript | |||||
see also section scraping on custom levels of depth |