Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

Oct 13, 2023

Qichen Ye, Junling Liu, Dading Chong, Peilin Zhou, Yining Hua, Andrew Liu

Figure 1 for Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

Figure 2 for Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

Figure 3 for Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

Figure 4 for Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

Share this with someone who'll enjoy it:

Abstract:Integrating large language models (LLMs) into healthcare presents potential but faces challenges. Directly pre-training LLMs for domains like medicine is resource-heavy and sometimes unfeasible. Sole reliance on Supervised Fine-tuning (SFT) can result in overconfident predictions and may not tap into domain specific insights. Addressing these challenges, we present a multi-stage training method combining Domain-specific Continued Pre-training (DCPT), SFT, and Direct Preference Optimization (DPO). A notable contribution of our study is the introduction of a 3Gb Chinese Medicine (ChiMed) dataset, encompassing medical question answering, plain texts, knowledge graphs, and dialogues, segmented into three training stages. The medical LLM trained with our pipeline, Qilin-Med, exhibits significant performance boosts. In the CPT and SFT phases, it achieves 38.4% and 40.0% accuracy on the CMExam, surpassing Baichuan-7B's 33.5%. In the DPO phase, on the Huatuo-26M test set, it scores 16.66 in BLEU-1 and 27.44 in ROUGE1, outperforming the SFT's 12.69 and 24.21. This highlights the strength of our training approach in refining LLMs for medical applications.

View paper on

Share this with someone who'll enjoy it:

Title:Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

Paper and Code