Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Failure Detection in Medical Image Classification: A Reality Check and Benchmarking Testbed

May 27, 2022

Melanie Bernhardt, Fabio De Sousa Ribeiro, Ben Glocker

Figure 1 for Failure Detection in Medical Image Classification: A Reality Check and Benchmarking Testbed

Figure 2 for Failure Detection in Medical Image Classification: A Reality Check and Benchmarking Testbed

Figure 3 for Failure Detection in Medical Image Classification: A Reality Check and Benchmarking Testbed

Figure 4 for Failure Detection in Medical Image Classification: A Reality Check and Benchmarking Testbed

Share this with someone who'll enjoy it:

Abstract:Failure detection in automated image classification is a critical safeguard for clinical deployment. Detected failure cases can be referred to human assessment, ensuring patient safety in computer-aided clinical decision making. Despite its paramount importance, there is insufficient evidence about the ability of state-of-the-art confidence scoring methods to detect test-time failures of classification models in the context of medical imaging. This paper provides a reality check, establishing the performance of in-domain misclassification detection methods, benchmarking 9 confidence scores on 6 medical imaging datasets with different imaging modalities, in multiclass and binary classification settings. Our experiments show that the problem of failure detection is far from being solved. We found that none of the benchmarked advanced methods proposed in the computer vision and machine learning literature can consistently outperform a simple softmax baseline. Our developed testbed facilitates future work in this important area.

View paper on

Share this with someone who'll enjoy it:

Title:Failure Detection in Medical Image Classification: A Reality Check and Benchmarking Testbed

Paper and Code