Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

Mar 21, 2022

Markus Kängsepp, Kaspar Valk, Meelis Kull

Figure 1 for On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

Figure 2 for On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

Figure 3 for On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

Figure 4 for On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

Share this with someone who'll enjoy it:

Abstract:Every uncalibrated classifier has a corresponding true calibration map that calibrates its confidence. Deviations of this idealistic map from the identity map reveal miscalibration. Such calibration errors can be reduced with many post-hoc calibration methods which fit some family of calibration maps on a validation dataset. In contrast, evaluation of calibration with the expected calibration error (ECE) on the test set does not explicitly involve fitting. However, as we demonstrate, ECE can still be viewed as if fitting a family of functions on the test data. This motivates the fit-on-the-test view on evaluation: first, approximate a calibration map on the test data, and second, quantify its distance from the identity. Exploiting this view allows us to unlock missed opportunities: (1) use the plethora of post-hoc calibration methods for evaluating calibration; (2) tune the number of bins in ECE with cross-validation. Furthermore, we introduce: (3) benchmarking on pseudo-real data where the true calibration map can be estimated very precisely; and (4) novel calibration and evaluation methods using new calibration map families PL and PL3.

* ECML-PKDD journal track. Update 1: removed Statements and Declaration section, added a line about source code to Experiments section, fixed a couple of typos

View paper on

Share this with someone who'll enjoy it:

Title:On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

Paper and Code