Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Nlp_chinese_corpus | 8,344 | 10 months ago | 20 | mit | ||||||
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP | ||||||||||
Asrt_speechrecognition | 7,253 | 2 months ago | 1 | October 23, 2020 | 101 | gpl-3.0 | Python | |||
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统 | ||||||||||
Pycorrector | 4,928 | 1 | 2 months ago | 30 | November 07, 2023 | 27 | apache-2.0 | Python | ||
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。 | ||||||||||
Chinese Names Corpus | 3,719 | 3 months ago | 7 | apache-2.0 | ||||||
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。 | ||||||||||
Clue | 3,345 | 10 months ago | 73 | Python | ||||||
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard | ||||||||||
Uer Py | 2,802 | 4 months ago | 132 | apache-2.0 | Python | |||||
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo | ||||||||||
Cluedatasetsearch | 2,778 | a year ago | 6 | Python | ||||||
搜索所有中文NLP数据集,附常用英文NLP数据集 | ||||||||||
Weibo_terminater | 2,265 | 4 years ago | 9 | Python | ||||||
Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator | ||||||||||
Gpt2 Ml | 1,674 | 10 months ago | 22 | apache-2.0 | Python | |||||
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型 | ||||||||||
Rasa_nlu_chi | 1,466 | 5 months ago | 79 | apache-2.0 | Python | |||||
Turn Chinese natural language into structured data 中文自然语言理解 |