Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Nlp_chinese_corpus | 8,344 | a year ago | 20 | mit | ||||||
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP | ||||||||||
Chinesenlp | 1,329 | 3 years ago | 3 | HTML | ||||||
Datasets, SOTA results of every fields of Chinese NLP | ||||||||||
Insuranceqa Corpus Zh | 989 | 6 months ago | 11 | November 15, 2023 | 9 | other | Python | |||
:helicopter: 保险行业语料库,聊天机器人 | ||||||||||
Thoughtsource | 680 | 9 months ago | 12 | mit | Jupyter Notebook | |||||
A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/ | ||||||||||
Mac Network | 445 | 3 years ago | 9 | apache-2.0 | Python | |||||
Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018) | ||||||||||
Dialogstudio | 356 | 7 months ago | apache-2.0 | Python | ||||||
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI | ||||||||||
Cmrc2018 | 313 | 2 years ago | 4 | cc-by-sa-4.0 | Python | |||||
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018) | ||||||||||
Medquad | 275 | 7 months ago | 4 | other | ||||||
Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites | ||||||||||
Triviaqa | 227 | 6 months ago | 2 | apache-2.0 | Python | |||||
Code for the TriviaQA reading comprehension dataset | ||||||||||
Ott Qa | 141 | 4 months ago | 3 | mit | Python | |||||
Code and Data for ICLR2021 Paper "Open Question Answering over Tables and Text" |