Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Huatuo Llama Med Chinese | 3,776 | a month ago | 14 | apache-2.0 | Python | |||||
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调 | ||||||||||
Awesome_chinese_medical_nlp | 1,411 | 10 months ago | ||||||||
中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc | ||||||||||
Med Chatglm | 462 | 6 months ago | 16 | apache-2.0 | Python | |||||
Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调 | ||||||||||
Chinese Nlp Corpus | 378 | 3 years ago | 1 | Python | ||||||
Collections of Chinese NLP corpus | ||||||||||
Chineseblue | 212 | 2 years ago | 3 | apache-2.0 | Python | |||||
Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE) | ||||||||||
Medical Books | 131 | 5 years ago | 1 | TeX | ||||||
Open sourece medical books in LaTeX. LaTeX写的中文开源医学书籍 | ||||||||||
Covid Dialogue | 110 | 2 years ago | 4 | Python | ||||||
Find Chinese Medical Words | 53 | 3 years ago | 1 | mit | Python | |||||
发现新词 无监督词库生成 医学词库生成 发现未登录词 | ||||||||||
Cmedqa2 | 51 | 5 years ago | gpl-3.0 | |||||||
This is updated version of the dataset for Chinese community medical question answering. | ||||||||||
Cmedqa | 36 | 4 years ago | 3 | |||||||
This is the dataset for Chinese community medical question answering. |
/(Instruction-tuning) LLaMAAlpaca-ChineseBloom
ChatGPT API
[2023/08/05] CCL 2023 Demo TrackPoster
[2023/07/19] Bloom
[2023/05/12] """"
[2023/04/28] Alpaca
[2023/04/24] LLaMA
[2023/03/31] LLaMA
python3.9+
pip install -r requirements.txt
LoRA
LoRAHugging Face
LoRA
**lora-folder-name**/
- adapter_config.json # LoRA
- adapter_model.bin # LoRA
ChatGLM: ChatGLM-6B-Med
./data/infer.json
infer
#
bash ./scripts/infer.sh
#
#
bash ./scripts/infer-literature-single.sh
#
bash ./scripts/infer-literature-multi.sh
infer.shbase_modelloralora_weightsinstruct_dir
python infer.py \
--base_model 'BASE_MODEL_PATH' \
--lora_weights 'LORA_WEIGHTS_PATH' \
--use_lora True \
--instruct_dir 'INFER_DATA_PATH' \
--prompt_template 'TEMPLATE_PATH'
__
&Bloom | LLaMA&Alpaca |
---|---|
templates/bloom_deploy.json |
templates/med_template.json templates/literature_template.json
|
./scripts/test.sh
:
{"": "", "": ["", ""], "": ["", "", ""], "": ["", ""], "": [""]}
GPT3.5Prompt
""
: "
2023GPT3.5./data_literature/liver_cancer.json
1k``
""16
./data/llama_data.json
finetune
bash ./scripts/finetune.sh
LLaMAA100-SXM-80GB102h17mbatch_size=12840G3090/4090(24GB)batch_size
wandb:
https://wandb.ai/thinksoso/llama_med/runs/a5wgcnzt/overview?workspace=user-thinksoso
20233
Llama | Alpaca | BenTsao | |
---|---|---|---|
**** | |||
**** | |||
**** |
Q: ""
A: SCIR""
Q: """"
A: LLaMAAlpaca
Q:
A:
Q:
A: LLaMAAlpacabloom-based-based
Q: /
A: requirementscudalorallama-basedllamaloraissue
Q:
A:
: Huatuo: Tuning llama model with chinese medical knowledge
@misc{wang2023huatuo,
title={HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge},
author={Haochun Wang and Chi Liu and Nuwa Xi and Zewen Qiang and Sendong Zhao and Bing Qin and Ting Liu},
year={2023},
eprint={2304.06975},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
@misc{wang2023knowledgetuning,
title={Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese},
author={Haochun Wang and Sendong Zhao and Zewen Qiang and Zijian Li and Nuwa Xi and Yanrui Du and MuZhen Cai and Haoqiang Guo and Yuhan Chen and Haoming Xu and Bing Qin and Ting Liu},
year={2023},
eprint={2309.04175},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
The CALLA Dataset: Probing LLMs Interactive Knowledge Acquisition from Chinese Medical Literature
@misc{du2023calla,
title={The CALLA Dataset: Probing LLMs' Interactive Knowledge Acquisition from Chinese Medical Literature},
author={Yanrui Du and Sendong Zhao and Muzhen Cai and Jianyu Chen and Haochun Wang and Yuhan Chen and Haoqiang Guo and Bing Qin},
year={2023},
eprint={2309.04198},
archivePrefix={arXiv},
primaryClass={cs.CL}
}