Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Lianjia Beike Spider | 2,464 | a year ago | 13 | Python | ||||||
链家网和贝壳网房价爬虫,采集北京上海广州深圳等21个中国主要城市的房价数据(小区,二手房,出租房,新房),稳定可靠快速!支持csv,MySQL, MongoDB,Excel, json存储,支持Python2和3,图表展示数据,注释丰富 ,点星支持,仅供学习参考,请勿用于商业用途,后果自负。 | ||||||||||
Web Scraping | 281 | 5 months ago | gpl-3.0 | Python | ||||||
Más de 50 ejemplos de web scraping utilizando: Requests | Scrapy | Selenium | LXML | BeautifulSoup | ||||||||||
Python Automation Scripts | 264 | 3 years ago | gpl-3.0 | Python | ||||||
Simple yet powerful automation stuffs. | ||||||||||
Sinew | 254 | 3 | 1 | 10 months ago | 14 | July 09, 2021 | mit | Ruby | ||
A Ruby DSL for structured web crawling, with a robust caching system. | ||||||||||
Site Audit Seo | 151 | 1 | 6 months ago | 30 | June 23, 2021 | 11 | JavaScript | |||
Web service and CLI tool for SEO site audit: crawl site, lighthouse all pages, view public reports in browser. Also output to console, json, csv, xlsx, Google Drive. | ||||||||||
Zhihu Spider | 128 | 5 years ago | 4 | mit | Python | |||||
一个获取知乎用户主页信息的多线程Python爬虫程序。 | ||||||||||
Ipproxy | 113 | 7 years ago | 1 | Python | ||||||
代理IP提取工具 | ||||||||||
Crawler | 64 | 9 | 9 years ago | 14 | February 08, 2014 | 8 | apache-2.0 | Java | ||
Simple java web crawler | ||||||||||
Ncovr | 56 | 4 years ago | 4 | gpl-3.0 | R | |||||
Scrapy Idealista | 45 | 4 years ago | 1 | gpl-2.0 | Python | |||||
Scrapping data from Real Estate site www.idealista.com |