Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rajat Rawat

DiversityMedQA: Assessing Demographic Biases in Medical Diagnosis using Large Language Models

Sep 02, 2024

Rajat Rawat, Hudson McBride, Dhiyaan Nirmal, Rajarshi Ghosh, Jong Moon, Dhruv Alamuri, Sean O'Brien, Kevin Zhu

Figure 1 for DiversityMedQA: Assessing Demographic Biases in Medical Diagnosis using Large Language Models

Figure 2 for DiversityMedQA: Assessing Demographic Biases in Medical Diagnosis using Large Language Models

Figure 3 for DiversityMedQA: Assessing Demographic Biases in Medical Diagnosis using Large Language Models

Figure 4 for DiversityMedQA: Assessing Demographic Biases in Medical Diagnosis using Large Language Models

Abstract:As large language models (LLMs) gain traction in healthcare, concerns about their susceptibility to demographic biases are growing. We introduce {DiversityMedQA}, a novel benchmark designed to assess LLM responses to medical queries across diverse patient demographics, such as gender and ethnicity. By perturbing questions from the MedQA dataset, which comprises medical board exam questions, we created a benchmark that captures the nuanced differences in medical diagnosis across varying patient profiles. Our findings reveal notable discrepancies in model performance when tested against these demographic variations. Furthermore, to ensure the perturbations were accurate, we also propose a filtering strategy that validates each perturbation. By releasing DiversityMedQA, we provide a resource for evaluating and mitigating demographic bias in LLM medical diagnoses.

Via

Access Paper or Ask Questions