Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Wikihadoop | 84 | 11 years ago | 5 | Java | ||||||
Stream-based InputFormat for processing the compressed XML dumps of Wikipedia with Hadoop | ||||||||||
Wikiparse | 47 | 9 years ago | 5 | Clojure | ||||||
Parse wikipedia dumps and index (some) page data to elasticsearch | ||||||||||
Jwiki | 23 | 2 years ago | Java | |||||||
Java tool to get wikipedia data | ||||||||||
Wikipedia Ngrams | 12 | 11 years ago | Java | |||||||
Code to split/parse Wikipedia XML dump | ||||||||||
Hadoop_ctakes | 9 | 10 years ago | apache-2.0 | Java | ||||||
Hadoop integration code for working with with Apache cTAKES | ||||||||||
Tf Idf Hadoop Mapreduce | 8 | 10 years ago | 2 | Java | ||||||
Project from the CTU Big Data course which purpose was to compute tf-idf values for the czech wikipedia | ||||||||||
Weird Tree Plot | 5 | 7 years ago | 9 | gpl-3.0 | Java | |||||
A plotter for really weird tree graphs. |