Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Gpt2 Chinese | 7,249 | 4 months ago | 105 | mit | Python | |||||
Chinese version of GPT2 training code, using BERT tokenizer. | ||||||||||
Ckip Transformers | 439 | a year ago | 1 | gpl-3.0 | Python | |||||
CKIP Transformers | ||||||||||
Simple | 411 | 5 months ago | 10 | mit | C++ | |||||
支持中文和拼音的 SQLite fts5 全文搜索扩展 | A SQLite3 fts5 tokenizer which supports Chinese and PinYin | ||||||||||
Segmentit | 208 | 1 | 6 | a year ago | 17 | December 22, 2019 | 6 | mit | JavaScript | |
任何 JS 环境可用的中文分词包,fork from leizongmin/node-segment | ||||||||||
Sqlitesubstringsearch | 76 | 8 years ago | C | |||||||
An open source tokenizer which supports fast substring search with sqlite FTS (full text search) | ||||||||||
Cang Jie | 65 | 6 | 6 months ago | 20 | November 04, 2023 | mit | Rust | |||
Chinese tokenizer for tantivy, based on jieba-rs | ||||||||||
Ud Kanbun | 59 | 1 | 2 | 3 months ago | 249 | September 25, 2023 | mit | Python | ||
Tokenizer POS-tagger and Dependency-parser for Classical Chinese | ||||||||||
Rasa_chinese | 46 | 3 years ago | 1 | apache-2.0 | Python | |||||
rasa_chinese 专门针对中文语言的 rasa 组件扩展包,提供了许多针对中文语言的组件 | ||||||||||
Chinese Tokenizer | 39 | 3 | 7 | 4 years ago | 11 | June 05, 2019 | 2 | mit | JavaScript | |
Tokenizes Chinese texts into words. | ||||||||||
Pnlp | 25 | 1 | 1 | 3 months ago | 38 | December 25, 2022 | apache-2.0 | Python | ||
NLP预/后处理工具。 |