Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Segmentit | 208 | 1 | 6 | a year ago | 17 | December 22, 2019 | 6 | mit | JavaScript | |
任何 JS 环境可用的中文分词包,fork from leizongmin/node-segment | ||||||||||
Microtokenizer | 119 | 3 | 1 | 3 years ago | 53 | September 28, 2021 | mit | Python | ||
一个微型&算法全面的中文分词引擎 | A micro tokenizer for Chinese | ||||||||||
Pnlp | 25 | 1 | 1 | 3 months ago | 38 | December 25, 2022 | apache-2.0 | Python | ||
NLP预/后处理工具。 | ||||||||||
Chinesebert | 18 | 5 years ago | 3 | Python | ||||||
This is a chinese Bert model specific for question answering | ||||||||||
Berserker | 16 | 5 years ago | 3 | mit | Python | |||||
Berserker - BERt chineSE woRd toKenizER | ||||||||||
Plane | 11 | 3 | 2 years ago | 20 | January 20, 2021 | 1 | mit | Python | ||
A text processing tool including tag(HTML, URL, Email) extraction and removing, punctuation normalization, simple segmentation, and so on. |