Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Autophrase | 978 | 2 years ago | 3 | November 19, 2020 | 6 | apache-2.0 | C++ | |||
AutoPhrase: Automated Phrase Mining from Massive Text Corpora | ||||||||||
Jiayan | 232 | 2 years ago | 3 | September 16, 2019 | 7 | mit | Python | |||
甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon construction, tokenizing, POS tagging, sentence segmentation and punctuation. | ||||||||||
Sudachidict | 212 | 26 | 3 months ago | 24 | December 14, 2023 | 9 | apache-2.0 | Python | ||
A lexicon for Sudachi | ||||||||||
Open Gram | 59 | 8 years ago | 2 | Python | ||||||
an open solution for collecting n-gram Chinese lexicon and n-gram statistics | ||||||||||
Eleve | 12 | 3 years ago | 12 | October 25, 2020 | 1 | lgpl-3.0 | Python | |||
Extraction de LExique par Variation d'Entropie - Lexicon extraction based on the variation of entropy | ||||||||||
Open Gram | 9 | 14 years ago | Python | |||||||
collect lexicon and build n-gram dataset for NLP in Chinese | ||||||||||
Myanmar Collation Stats | 7 | 3 years ago | Java | |||||||
Myanmar lexicon analyzer - Sorting and Segmentation | ||||||||||
Morphagram | 6 | 2 years ago | 1 | Python | ||||||
A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars | ||||||||||
Mini Segmenter | 6 | 9 years ago | Python | |||||||
Lightweight lexicon/dictionary based Chinese text segmenter |