Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Strumentalia Seealsology | 76 | 6 months ago | 7 | other | JavaScript | |||||
see also section scraping on custom levels of depth | ||||||||||
Wikipedia Crawler | 57 | 9 years ago | 1 | mit | Python | |||||
This is a program to crawl entire 'Wikipedia' and extract & store information from the pages as required. | ||||||||||
Tech Seo Crawler | 54 | 2 years ago | 16 | mit | Python | |||||
Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index. | ||||||||||
Wikireverse | 39 | 6 years ago | 2 | mit | Java | |||||
Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles. | ||||||||||
Wikiracer | 32 | 7 years ago | mit | Go | ||||||
Finds the shortest path between two Wikipedia articles, using only Wikipedia links. | ||||||||||
Word2vec Flask Api | 26 | 7 years ago | mit | Python | ||||||
Flask API for Word2vec | ||||||||||
Wikipedia Crawler | 25 | 3 years ago | gpl-3.0 | Python | ||||||
Extracts plain-text from Wikipedia articles, ideal to perform linguistic analysis | ||||||||||
Crawling For Nomore404 | 19 | a year ago | 8 | Python | ||||||
Similarweb | 10 | 6 years ago | 2 | Python | ||||||
similarweb crawler | ||||||||||
Wikipedia Title Dataset | 10 | 7 years ago | Python | |||||||
Dataset used for Learning Character-level Compositionality with Visual Features (ACL2017) |