Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Lagoujob | 250 | 5 years ago | apache-2.0 | Python | ||||||
Job data mining repo for lagou.com | ||||||||||
Awesome Web Scraper | 214 | 8 months ago | 41 | mit | ||||||
A collection of awesome web scaper, crawler. | ||||||||||
Strong Web Crawler | 204 | 4 years ago | C# | |||||||
基于C#.NET+PhantomJS+Sellenium的高级网络爬虫程序。可执行Javascript代码、触发各类事件、操纵页面Dom结构。 | ||||||||||
Splashr | 88 | 4 years ago | 13 | other | R | |||||
:sweat_drops: Tools to Work with the 'Splash' JavaScript Rendering Service in R | ||||||||||
Tspider | 71 | 7 years ago | 1 | Python | ||||||
Yet Another Web Spider | ||||||||||
Siteshooter | 58 | 1 | 5 years ago | 73 | May 15, 2019 | 8 | mpl-2.0 | JavaScript | ||
:camera: Automate full website screenshots and PDF generation with multiple viewport support. | ||||||||||
Grell | 36 | 3 years ago | 21 | February 17, 2021 | 2 | mit | Ruby | |||
Web crawler with a Ruby API | ||||||||||
Pycreeper | 27 | 7 years ago | Python | |||||||
一个用来快速提取网页内容的信息采集(爬虫)框架, 实现了对网页的动态加载与控制。 | ||||||||||
Klepto | 21 | 11 years ago | 44 | July 18, 2013 | 1 | mit | Ruby | |||
A mean little DSL'd poltergeist (capybara) based web crawler that stuffs data into your Rails app. | ||||||||||
Php Simple Web Scraper | 11 | 3 years ago | 11 | April 04, 2020 | mit | PHP | ||||
A PHP application which runs on Heroku and dumps web site outputs including JavaScript generated contents. |