Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Livetv_mining | 190 | 7 years ago | apache-2.0 | Python | ||||||
直播网站数据采集 | ||||||||||
Jason The Miner | 40 | 3 | 3 years ago | 13 | June 01, 2020 | 4 | mit | JavaScript | ||
⛏ A versatile Web scraper for Node.js | ||||||||||
Pttminer | 28 | 4 years ago | other | R | ||||||
Parallel Searching and Crawling Data from PTT 🚀 | ||||||||||
Real_time_social_media_mining | 24 | 6 months ago | 21 | mit | HTML | |||||
DevOps pipeline for Real Time Social/Web Mining | ||||||||||
Structominer | 17 | 10 years ago | 1 | April 19, 2014 | 7 | mit | Python | |||
Data scraping for a more civilized age | ||||||||||
Pdf Miner | 13 | 7 years ago | 2 | Python | ||||||
python based crawler to mine pdfs from websites and extracting useful features for data extraction | ||||||||||
Ducrawler | 12 | 2 years ago | 1 | HTML | ||||||
An automatic crawler to mine images from Google and Bing Image search (part of SketchyScene at ECCV 2018) | ||||||||||
Linkrev | 11 | 12 years ago | Scala | |||||||
Athena | 10 | 8 years ago | 7 | October 05, 2015 | Clojure | |||||
A small but powerful library for mining data from web pages and HTML documents. Athena provides an easy to use DSL for crawling HTML pages and extracting information from them. | ||||||||||
Webdirfuzz | 7 | 7 years ago | apache-2.0 | Python | ||||||
Web Dir Fuzz tool for vulnerability mining |