Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Markus Kängsepp

Calibrated Perception Uncertainty Across Objects and Regions in Bird's-Eye-View

Nov 08, 2022

Markus Kängsepp, Meelis Kull

Abstract:In driving scenarios with poor visibility or occlusions, it is important that the autonomous vehicle would take into account all the uncertainties when making driving decisions, including choice of a safe speed. The grid-based perception outputs, such as occupancy grids, and object-based outputs, such as lists of detected objects, must then be accompanied by well-calibrated uncertainty estimates. We highlight limitations in the state-of-the-art and propose a more complete set of uncertainties to be reported, particularly including undetected-object-ahead probabilities. We suggest a novel way to get these probabilistic outputs from bird's-eye-view probabilistic semantic segmentation, in the example of the FIERY model. We demonstrate that the obtained probabilities are not calibrated out-of-the-box and propose methods to achieve well-calibrated uncertainties.

Via

Access Paper or Ask Questions

On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

Mar 21, 2022

Markus Kängsepp, Kaspar Valk, Meelis Kull

Figure 1 for On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

Figure 2 for On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

Figure 3 for On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

Figure 4 for On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

Abstract:Every uncalibrated classifier has a corresponding true calibration map that calibrates its confidence. Deviations of this idealistic map from the identity map reveal miscalibration. Such calibration errors can be reduced with many post-hoc calibration methods which fit some family of calibration maps on a validation dataset. In contrast, evaluation of calibration with the expected calibration error (ECE) on the test set does not explicitly involve fitting. However, as we demonstrate, ECE can still be viewed as if fitting a family of functions on the test data. This motivates the fit-on-the-test view on evaluation: first, approximate a calibration map on the test data, and second, quantify its distance from the identity. Exploiting this view allows us to unlock missed opportunities: (1) use the plethora of post-hoc calibration methods for evaluating calibration; (2) tune the number of bins in ECE with cross-validation. Furthermore, we introduce: (3) benchmarking on pseudo-real data where the true calibration map can be estimated very precisely; and (4) novel calibration and evaluation methods using new calibration map families PL and PL3.

* ECML-PKDD journal track. Update 1: removed Statements and Declaration section, added a line about source code to Experiments section, fixed a couple of typos

Via

Access Paper or Ask Questions

Correlated daily time series and forecasting in the M4 competition

Mar 31, 2020

Anti Ingel, Novin Shahroudi, Markus Kängsepp, Andre Tättar, Viacheslav Komisarenko, Meelis Kull

Figure 1 for Correlated daily time series and forecasting in the M4 competition

Figure 2 for Correlated daily time series and forecasting in the M4 competition

Figure 3 for Correlated daily time series and forecasting in the M4 competition

Figure 4 for Correlated daily time series and forecasting in the M4 competition

Abstract:We participated in the M4 competition for time series forecasting and describe here our methods for forecasting daily time series. We used an ensemble of five statistical forecasting methods and a method that we refer to as the correlator. Our retrospective analysis using the ground truth values published by the M4 organisers after the competition demonstrates that the correlator was responsible for most of our gains over the naive constant forecasting method. We identify data leakage as one reason for its success, partly due to test data selected from different time intervals, and partly due to quality issues in the original time series. We suggest that future forecasting competitions should provide actual dates for the time series so that some of those leakages could be avoided by the participants.

* International Journal of Forecasting, 36(1), 121-128 (2020)

Via

Access Paper or Ask Questions

Beyond temperature scaling: Obtaining well-calibrated multiclass probabilities with Dirichlet calibration

Oct 28, 2019

Meelis Kull, Miquel Perello-Nieto, Markus Kängsepp, Telmo Silva Filho, Hao Song, Peter Flach

Figure 1 for Beyond temperature scaling: Obtaining well-calibrated multiclass probabilities with Dirichlet calibration

Figure 2 for Beyond temperature scaling: Obtaining well-calibrated multiclass probabilities with Dirichlet calibration

Figure 3 for Beyond temperature scaling: Obtaining well-calibrated multiclass probabilities with Dirichlet calibration

Figure 4 for Beyond temperature scaling: Obtaining well-calibrated multiclass probabilities with Dirichlet calibration

Abstract:Class probabilities predicted by most multiclass classifiers are uncalibrated, often tending towards over-confidence. With neural networks, calibration can be improved by temperature scaling, a method to learn a single corrective multiplicative factor for inputs to the last softmax layer. On non-neural models the existing methods apply binary calibration in a pairwise or one-vs-rest fashion. We propose a natively multiclass calibration method applicable to classifiers from any model class, derived from Dirichlet distributions and generalising the beta calibration method from binary classification. It is easily implemented with neural nets since it is equivalent to log-transforming the uncalibrated probabilities, followed by one linear layer and softmax. Experiments demonstrate improved probabilistic predictions according to multiple measures (confidence-ECE, classwise-ECE, log-loss, Brier score) across a wide range of datasets and classifiers. Parameters of the learned Dirichlet calibration map provide insights to the biases in the uncalibrated model.

* Accepted for presentation at NeurIPS 2019

Via

Access Paper or Ask Questions