Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Awesome_chinese_medical_nlp | 1,411 | 2 months ago | ||||||||
中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc | ||||||||||
Chinese Nlp Corpus | 378 | 2 years ago | 1 | Python | ||||||
Collections of Chinese NLP corpus | ||||||||||
Chineseblue | 212 | 2 years ago | 3 | apache-2.0 | Python | |||||
Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE) | ||||||||||
Medical Books | 131 | 4 years ago | 1 | TeX | ||||||
Open sourece medical books in LaTeX. LaTeX写的中文开源医学书籍 | ||||||||||
Covid Dialogue | 110 | 2 years ago | 4 | Python | ||||||
Find Chinese Medical Words | 53 | 2 years ago | 1 | mit | Python | |||||
发现新词 无监督词库生成 医学词库生成 发现未登录词 | ||||||||||
Cmedqa2 | 51 | 4 years ago | gpl-3.0 | |||||||
This is updated version of the dataset for Chinese community medical question answering. | ||||||||||
Cmedqa | 36 | 3 years ago | 3 | |||||||
This is the dataset for Chinese community medical question answering. | ||||||||||
Amttl | 23 | 5 years ago | mit | Python | ||||||
Code & Data for our COLING 2018 paper "Adaptive Multi-Task Transfer Learning for Chinese Word Segmentation in Medical Text" | ||||||||||
Corona Tracker | 20 | 3 years ago | 3 | mit | Python | |||||
Coronavirus disease 2019(COVID-19) statistical data from the Chinese medical community(dxy.cn) to json format |
This is the dataset for Chinese community medical question answering. The dataset is in version 1.0 and is available for non-commercial research. We will update and expand the database from time to time. In order to protect the privacy, the data is anonymized and no personal information is included.
The newest version of cMedQA now comes to v2.0. You can click here
DataSet | #Ques | #Ans | Ave. #words per Question | Ave. #words per Answer | Ave. #characters per Question | Ave. #characters per Answer |
---|---|---|---|---|---|---|
Train | 50,000 | 94,134 | 97 | 169 | 120 | 212 |
Dev | 2,000 | 3,774 | 94 | 172 | 117 | 216 |
Test | 2,000 | 3,835 | 96 | 168 | 119 | 211 |
Total | 54,000 | 101,743 | 96 | 169 | 119 | 212 |
Chinese Medical Question Answer Matching Using End-to-End Character-Level Multi-Scale CNNs link to the paper
Please cite our paper when you use the dataset.
@article{zhang2017chinese,
title={Chinese Medical Question Answer Matching Using End-to-End Character-Level Multi-Scale CNNs},
author={Zhang, Sheng and Zhang, Xin and Wang, Hui and Cheng, Jiajun and Li, Pei and Ding, Zhaoyun},
journal={Applied Sciences},
volume={7},
number={8},
pages={767},
year={2017},
publisher={Multidisciplinary Digital Publishing Institute}
}