Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Nlp_chinese_corpus | 8,344 | a year ago | 20 | mit | ||||||
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP | ||||||||||
Chinesenlp | 1,329 | 3 years ago | 3 | HTML | ||||||
Datasets, SOTA results of every fields of Chinese NLP | ||||||||||
Csl | 518 | 10 months ago | 3 | Python | ||||||
[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集 | ||||||||||
Chinese Nlp Corpus | 378 | 3 years ago | 1 | Python | ||||||
Collections of Chinese NLP corpus | ||||||||||
Gossiping Chinese Corpus | 136 | 3 years ago | apache-2.0 | Jupyter Notebook | ||||||
PTT 八卦版問答中文語料 | ||||||||||
Taisu | 129 | 6 months ago | 4 | other | Python | |||||
TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集) | ||||||||||
Chinese Sentence Pair Modeling | 54 | 2 years ago | 1 | apache-2.0 | Jupyter Notebook | |||||
Use deep models including BiLSTM, ABCNN, ESIM, RE2, BERT, etc. and evaluate on 5 Chinese NLP datasets: LCQMC, BQ Corpus, ChineseSTS, OCNLI, CMNLI | ||||||||||
Douban Dushu Dataset | 36 | 5 years ago | 1 | |||||||
A dataset contains 37 million douban dushu comments | ||||||||||
Cnn Question Classification Keras | 29 | 2 years ago | 12 | Python | ||||||
Chinese Question Classifier (Keras Implementation) on BQuLD | ||||||||||
Corpus_dataset_for_chinese_nlp | 18 | 5 years ago | mit | |||||||
中文 NLP 语料库数据集 |