Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Elasticsearch Analysis Jieba | 296 | 7 years ago | 10 | Java | ||||||
The plugin includes the `jieba` analyzer, `jieba` tokenizer, and `jieba` token filter, and have two mode you can choose. one is `index` which means it will be used when you want to index a document. another is `search` mode which used when you want to search something. | ||||||||||
Microtokenizer | 119 | 3 | 1 | 3 years ago | 53 | September 28, 2021 | mit | Python | ||
一个微型&算法全面的中文分词引擎 | A micro tokenizer for Chinese | ||||||||||
Cang Jie | 65 | 6 | 8 months ago | 20 | November 04, 2023 | mit | Rust | |||
Chinese tokenizer for tantivy, based on jieba-rs | ||||||||||
Sphinx Jieba | 18 | 7 years ago | 4 | gpl-2.0 | C++ | |||||
sphinx search engine with jieba tokenizer | ||||||||||
Chinese_tokenizer_benchmark | 10 | 6 years ago | 1 | Python | ||||||
中文分词软件基准测试 | Chinese tokenizer benchmark | ||||||||||
Cppjieba Py | 5 | 7 years ago | mit | Python | ||||||
python extension for cppjieba |