Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation

Apr 01, 2024

Zilong Wang, Xufang Luo, Xinyang Jiang, Dongsheng Li, Lili Qiu

Figure 1 for LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation

Figure 2 for LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation

Figure 3 for LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation

Figure 4 for LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation

Share this with someone who'll enjoy it:

Abstract:Evaluating generated radiology reports is crucial for the development of radiology AI, but existing metrics fail to reflect the task's clinical requirements. This study proposes a novel evaluation framework using large language models (LLMs) to compare radiology reports for assessment. We compare the performance of various LLMs and demonstrate that, when using GPT-4, our proposed metric achieves evaluation consistency close to that of radiologists. Furthermore, to reduce costs and improve accessibility, making this method practical, we construct a dataset using LLM evaluation results and perform knowledge distillation to train a smaller model. The distilled model achieves evaluation capabilities comparable to GPT-4. Our framework and distilled model offer an accessible and efficient evaluation method for radiology report generation, facilitating the development of more clinically relevant models. The model will be further open-sourced and accessible.

* 11 pages, 6 figures

View paper on

Share this with someone who'll enjoy it:

Title:LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation

Paper and Code