Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Khashayar

Ernest

Using Multi-modal Data for Improving Generalizability and Explainability of Disease Classification in Radiology

Jul 29, 2022

Pranav Agnihotri, Sara Ketabi, Khashayar, Namdar, Farzad Khalvati

Figure 1 for Using Multi-modal Data for Improving Generalizability and Explainability of Disease Classification in Radiology

Figure 2 for Using Multi-modal Data for Improving Generalizability and Explainability of Disease Classification in Radiology

Figure 3 for Using Multi-modal Data for Improving Generalizability and Explainability of Disease Classification in Radiology

Figure 4 for Using Multi-modal Data for Improving Generalizability and Explainability of Disease Classification in Radiology

Abstract:Traditional datasets for the radiological diagnosis tend to only provide the radiology image alongside the radiology report. However, radiology reading as performed by radiologists is a complex process, and information such as the radiologist's eye-fixations over the course of the reading has the potential to be an invaluable data source to learn from. Nonetheless, the collection of such data is expensive and time-consuming. This leads to the question of whether such data is worth the investment to collect. This paper utilizes the recently published Eye-Gaze dataset to perform an exhaustive study on the impact on performance and explainability of deep learning (DL) classification in the face of varying levels of input features, namely: radiology images, radiology report text, and radiologist eye-gaze data. We find that the best classification performance of X-ray images is achieved with a combination of radiology report free-text and radiology image, with the eye-gaze data providing no performance boost. Nonetheless, eye-gaze data serving as secondary ground truth alongside the class label results in highly explainable models that generate better attention maps compared to models trained to do classification and attention map generation without eye-gaze data.

Via

Access Paper or Ask Questions

Improving Disease Classification Performance and Explainability of Deep Learning Models in Radiology with Heatmap Generators

Jun 28, 2022

Akino Watanabe, Sara Ketabi, Khashayar, Namdar, Farzad Khalvati

Figure 1 for Improving Disease Classification Performance and Explainability of Deep Learning Models in Radiology with Heatmap Generators

Figure 2 for Improving Disease Classification Performance and Explainability of Deep Learning Models in Radiology with Heatmap Generators

Figure 3 for Improving Disease Classification Performance and Explainability of Deep Learning Models in Radiology with Heatmap Generators

Figure 4 for Improving Disease Classification Performance and Explainability of Deep Learning Models in Radiology with Heatmap Generators

Abstract:As deep learning is widely used in the radiology field, the explainability of such models is increasingly becoming essential to gain clinicians' trust when using the models for diagnosis. In this research, three experiment sets were conducted with a U-Net architecture to improve the classification performance while enhancing the heatmaps corresponding to the model's focus through incorporating heatmap generators during training. All of the experiments used the dataset that contained chest radiographs, associated labels from one of the three conditions ("normal", "congestive heart failure (CHF)", and "pneumonia"), and numerical information regarding a radiologist's eye-gaze coordinates on the images. The paper (A. Karargyris and Moradi, 2021) that introduced this dataset developed a U-Net model, which was treated as the baseline model for this research, to show how the eye-gaze data can be used in multi-modal training for explainability improvement. To compare the classification performances, the 95% confidence intervals (CI) of the area under the receiver operating characteristic curve (AUC) were measured. The best method achieved an AUC of 0.913 (CI: 0.860-0.966). The greatest improvements were for the "pneumonia" and "CHF" classes, which the baseline model struggled most to classify, resulting in AUCs of 0.859 (CI: 0.732-0.957) and 0.962 (CI: 0.933-0.989), respectively. The proposed method's decoder was also able to produce probability masks that highlight the determining image parts in model classifications, similarly as the radiologist's eye-gaze data. Hence, this work showed that incorporating heatmap generators and eye-gaze information into training can simultaneously improve disease classification and provide explainable visuals that align well with how the radiologist viewed the chest radiographs when making diagnosis.

Via

Access Paper or Ask Questions