Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Characterising Gradients for Unsupervised Accuracy Estimation under Distribution Shift

Jan 17, 2024

Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Ievgen Redko, Jianfeng Zhang, Bo An

Figure 1 for Characterising Gradients for Unsupervised Accuracy Estimation under Distribution Shift

Figure 2 for Characterising Gradients for Unsupervised Accuracy Estimation under Distribution Shift

Figure 3 for Characterising Gradients for Unsupervised Accuracy Estimation under Distribution Shift

Figure 4 for Characterising Gradients for Unsupervised Accuracy Estimation under Distribution Shift

Share this with someone who'll enjoy it:

Abstract:Estimating test accuracy without access to the ground-truth test labels under varying test environments is a challenging, yet extremely important problem in the safe deployment of machine learning algorithms. Existing works rely on the information from either the outputs or the extracted features of neural networks to formulate an estimation score correlating with the ground-truth test accuracy. In this paper, we investigate--both empirically and theoretically--how the information provided by the gradients can be predictive of the ground-truth test accuracy even under a distribution shift. Specifically, we use the norm of classification-layer gradients, backpropagated from the cross-entropy loss after only one gradient step over test data. Our key idea is that the model should be adjusted with a higher magnitude of gradients when it does not generalize to the test dataset with a distribution shift. We provide theoretical insights highlighting the main ingredients of such an approach ensuring its empirical success. Extensive experiments conducted on diverse distribution shifts and model structures demonstrate that our method significantly outperforms state-of-the-art algorithms.

View paper on

Share this with someone who'll enjoy it:

Title:Characterising Gradients for Unsupervised Accuracy Estimation under Distribution Shift

Paper and Code