Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment

Dec 30, 2021

Shuxin Yang, Xian Wu, Shen Ge, Xingwang Wu, S. Kevin Zhou, Li Xiao

Figure 1 for Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment

Figure 2 for Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment

Figure 3 for Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment

Figure 4 for Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment

Share this with someone who'll enjoy it:

Abstract:In clinics, a radiology report is crucial for guiding a patient's treatment. Unfortunately, report writing imposes a heavy burden on radiologists. To effectively reduce such a burden, we hereby present an automatic, multi-modal approach for report generation from chest x-ray. Our approach, motivated by the observation that the descriptions in radiology reports are highly correlated with the x-ray images, features two distinct modules: (i) Learned knowledge base. To absorb the knowledge embedded in the above-mentioned correlation, we automatically build a knowledge base based on textual embedding. (ii) Multi-modal alignment. To promote the semantic alignment among reports, disease labels and images, we explicitly utilize textual embedding to guide the learning of the visual feature space. We evaluate the performance of the proposed model using metrics from both natural language generation and clinic efficacy on the public IU and MIMIC-CXR datasets. Our ablation study shows that each module contributes to improving the quality of generated reports. Furthermore, with the aid of both modules, our approach clearly outperforms state-of-the-art methods.

View paper on

Share this with someone who'll enjoy it:

Title:Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment

Paper and Code