Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Exploring LLM-based Data Annotation Strategies for Medical Dialogue Preference Alignment

Oct 05, 2024

Chengfeng Dou, Ying Zhang, Zhi Jin, Wenpin Jiao, Haiyan Zhao, Yongqiang Zhao, Zhengwei Tao

Figure 1 for Exploring LLM-based Data Annotation Strategies for Medical Dialogue Preference Alignment

Figure 2 for Exploring LLM-based Data Annotation Strategies for Medical Dialogue Preference Alignment

Figure 3 for Exploring LLM-based Data Annotation Strategies for Medical Dialogue Preference Alignment

Figure 4 for Exploring LLM-based Data Annotation Strategies for Medical Dialogue Preference Alignment

Share this with someone who'll enjoy it:

Abstract:This research examines the use of Reinforcement Learning from AI Feedback (RLAIF) techniques to improve healthcare dialogue models, with the aim of tackling the challenges of preference-aligned data annotation while reducing the reliance on medical experts. We argue that the primary challenges in current RLAIF research for healthcare are the limitations of automated evaluation methods and the difficulties in accurately representing physician preferences. To address these challenges, we present a new evaluation framework based on standardized patient examinations. This framework is designed to objectively assess the effectiveness of large language models (LLMs) in guiding users and following instructions, enabling a comprehensive comparison across different models. Furthermore, our investigation of effective ways to express physician preferences using Constitutional AI algorithms highlighted the particular effectiveness of flowcharts. Utilizing this finding, we introduce an innovative agent-based approach for annotating preference data. This approach autonomously creates medical dialogue flows tailored to the patient's condition, demonstrates strong generalization abilities, and reduces the need for expert involvement. Our results show that the agent-based approach outperforms existing RLAIF annotation methods in standardized patient examinations and surpasses current open source medical dialogue LLMs in various test scenarios.

* 14 Pages, 12 figures

View paper on

Share this with someone who'll enjoy it:

Title:Exploring LLM-based Data Annotation Strategies for Medical Dialogue Preference Alignment

Paper and Code