Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Karin Dembrower

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

Dec 02, 2021

Moein Sorkhei, Yue Liu, Hossein Azizpour, Edward Azavedo, Karin Dembrower, Dimitra Ntoula, Athanasios Zouzos, Fredrik Strand, Kevin Smith

Figure 1 for CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

Figure 2 for CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

Figure 3 for CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

Figure 4 for CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

Abstract:Interval and large invasive breast cancers, which are associated with worse prognosis than other cancers, are usually detected at a late stage due to false negative assessments of screening mammograms. The missed screening-time detection is commonly caused by the tumor being obscured by its surrounding breast tissues, a phenomenon called masking. To study and benchmark mammographic masking of cancer, in this work we introduce CSAW-M, the largest public mammographic dataset, collected from over 10,000 individuals and annotated with potential masking. In contrast to the previous approaches which measure breast image density as a proxy, our dataset directly provides annotations of masking potential assessments from five specialists. We also trained deep learning models on CSAW-M to estimate the masking level and showed that the estimated masking is significantly more predictive of screening participants diagnosed with interval and large invasive cancers -- without being explicitly trained for these tasks -- than its breast density counterparts.

* 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

Via

Access Paper or Ask Questions

Adding Seemingly Uninformative Labels Helps in Low Data Regimes

Aug 11, 2020

Christos Matsoukas, Albert Bou I Hernandez, Yue Liu, Karin Dembrower, Gisele Miranda, Emir Konuk, Johan Fredin Haslum, Athanasios Zouzos, Peter Lindholm, Fredrik Strand(+1 more)

Figure 1 for Adding Seemingly Uninformative Labels Helps in Low Data Regimes

Figure 2 for Adding Seemingly Uninformative Labels Helps in Low Data Regimes

Figure 3 for Adding Seemingly Uninformative Labels Helps in Low Data Regimes

Figure 4 for Adding Seemingly Uninformative Labels Helps in Low Data Regimes

Abstract:Evidence suggests that networks trained on large datasets generalize well not solely because of the numerous training examples, but also class diversity which encourages learning of enriched features. This raises the question of whether this remains true when data is scarce - is there an advantage to learning with additional labels in low-data regimes? In this work, we consider a task that requires difficult-to-obtain expert annotations: tumor segmentation in mammography images. We show that, in low-data settings, performance can be improved by complementing the expert annotations with seemingly uninformative labels from non-expert annotators, turning the task into a multi-class problem. We reveal that these gains increase when less expert data is available, and uncover several interesting properties through further studies. We demonstrate our findings on CSAW-S, a new dataset that we introduce here, and confirm them on two public datasets.

* ICML 2020

Via

Access Paper or Ask Questions