Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Chinese Names Corpus | 3,719 | 5 months ago | 7 | apache-2.0 | ||||||
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。 | ||||||||||
Khcoder | 295 | 4 months ago | 10 | gpl-2.0 | Perl | |||||
KH Coder: for Quantitative Content Analysis or Text Mining | ||||||||||
Fasttextjapanesetutorial | 174 | 8 years ago | mit | Python | ||||||
Tutorial to train fastText with Japanese corpus | ||||||||||
Kanji Frequency | 116 | 4 months ago | 1 | cc-by-4.0 | Astro | |||||
Kanji usage frequency data collected from various sources | ||||||||||
Toiro | 110 | 9 months ago | 8 | July 31, 2023 | 1 | apache-2.0 | Python | |||
A comparison tool of Japanese tokenizers | ||||||||||
Chive | 105 | 2 years ago | apache-2.0 | |||||||
Japanese word embedding with Sudachi and NWJC 🌿 | ||||||||||
Jlm | 99 | 5 years ago | mit | Python | ||||||
A fast LSTM Language Model for large vocabulary language like Japanese and Chinese | ||||||||||
Ja.text8 | 74 | 7 years ago | Python | |||||||
Japanese text8 corpus for word embedding. | ||||||||||
Jrte Corpus | 73 | a year ago | other | Python | ||||||
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020) | ||||||||||
Kwdlc | 71 | 5 months ago | 12 | Python | ||||||
Kyoto University Web Document Leads Corpus |