Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Hidden Heterogeneity: When to Choose Similarity-Based Calibration

Feb 03, 2022

Kiri L. Wagstaff, Thomas G. Dietterich

Figure 1 for Hidden Heterogeneity: When to Choose Similarity-Based Calibration

Figure 2 for Hidden Heterogeneity: When to Choose Similarity-Based Calibration

Figure 3 for Hidden Heterogeneity: When to Choose Similarity-Based Calibration

Figure 4 for Hidden Heterogeneity: When to Choose Similarity-Based Calibration

Share this with someone who'll enjoy it:

Abstract:Trustworthy classifiers are essential to the adoption of machine learning predictions in many real-world settings. The predicted probability of possible outcomes can inform high-stakes decision making, particularly when assessing the expected value of alternative decisions or the risk of bad outcomes. These decisions require well calibrated probabilities, not just the correct prediction of the most likely class. Black-box classifier calibration methods can improve the reliability of a classifier's output without requiring retraining. However, these methods are unable to detect subpopulations where calibration could improve prediction accuracy. Such subpopulations are said to exhibit "hidden heterogeneity" (HH), because the original classifier did not detect them. The paper proposes a quantitative measure for HH. It also introduces two similarity-weighted calibration methods that can address HH by adapting locally to each test item: SWC weights the calibration set by similarity to the test item, and SWC-HH explicitly incorporates hidden heterogeneity to filter the calibration set. Experiments show that the improvements in calibration achieved by similarity-based calibration methods correlate with the amount of HH present and, given sufficient calibration data, generally exceed calibration achieved by global methods. HH can therefore serve as a useful diagnostic tool for identifying when local calibration methods are needed.

* Draft version currently under review. Do not cite. Comments and feedback welcome! 33 pages, 10 figures

View paper on

Share this with someone who'll enjoy it:

Title:Hidden Heterogeneity: When to Choose Similarity-Based Calibration

Paper and Code