Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
D2l Zh | 55,505 | 1 | 1 | a month ago | 51 | August 18, 2023 | 65 | apache-2.0 | Python | |
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。 | ||||||||||
Chinese Bert Wwm | 8,600 | 9 months ago | 3 | apache-2.0 | Python | |||||
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型) | ||||||||||
Qwen | 8,482 | 3 months ago | 139 | apache-2.0 | Python | |||||
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. | ||||||||||
Nlp_chinese_corpus | 8,344 | a year ago | 20 | mit | ||||||
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP | ||||||||||
Text_classification | 7,628 | 7 months ago | 45 | mit | Python | |||||
all kinds of text classification models and more with deep learning | ||||||||||
Gpt2 Chinese | 7,249 | 4 months ago | 105 | mit | Python | |||||
Chinese version of GPT2 training code, using BERT tokenizer. | ||||||||||
Ansj_seg | 6,390 | 402 | 17 | 5 months ago | 10 | February 15, 2018 | 50 | apache-2.0 | Java | |
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典 | ||||||||||
Baichuan 7b | 5,493 | 7 months ago | 80 | apache-2.0 | Python | |||||
A large-scale 7B pretraining language model developed by BaiChuan-Inc. | ||||||||||
Awesome Chinese Llm | 5,477 | 3 months ago | ||||||||
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。 | ||||||||||
Huatuo Llama Med Chinese | 3,776 | 6 months ago | 14 | apache-2.0 | Python | |||||
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调 |