Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Pansori | 74 | 5 years ago | mit | Python | ||||||
Tools for ASR Corpus Generation from Online Video | ||||||||||
Word2vec_torch | 60 | 9 years ago | 1 | Lua | ||||||
Word2Vec implementation in Torch | ||||||||||
Twitter Corpus | 46 | 6 years ago | 5 | other | Python | |||||
Collects all tweets from the sample Public stream using Twitter's streaming API, and saves them to a file for later use as a corpus. | ||||||||||
Streamcorpus | 33 | 8 years ago | Scala | |||||||
common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text | ||||||||||
Activitystreams Test Documents | 8 | 8 years ago | 2 | other | JavaScript | |||||
Collection of documents for testing activity streams parsers | ||||||||||
Shami Corpus | 5 | 6 years ago | apache-2.0 | Java | ||||||
Shami Dialect Corpus (SDC) |