Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Blacklab | 97 | 4 | 1 | 3 months ago | 22 | October 06, 2022 | 79 | apache-2.0 | Java | |
Linguistic search for large annotated text corpora, based on Apache Lucene | ||||||||||
Opennlp Models | 43 | 12 years ago | apache-2.0 | Perl | ||||||
A project for code to create models from existing corpora and distribute models. | ||||||||||
Hfututils | 17 | 2 years ago | 2 | Java | ||||||
这是一个工具程序集合,方便我们平时对数据进行预处理。针对文本处理的内容较多。包括分词(集成了张华平分词、结巴分词)、文件处理增强(如读取文本到Map中,保存文本到Map)和语料模型(把文档转换成矩阵,就算单词数量等) | ||||||||||
Gitdox | 14 | 3 months ago | 28 | apache-2.0 | JavaScript | |||||
Repository for GitDOX, a GitHub Data-storage Online XML editor | ||||||||||
Sven | 10 | 10 years ago | 2 | JavaScript | ||||||
sven django project |