Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Towards Understanding GD with Hard and Conjugate Pseudo-labels for Test-Time Adaptation

Oct 18, 2022

Jun-Kun Wang, Andre Wibisono

Figure 1 for Towards Understanding GD with Hard and Conjugate Pseudo-labels for Test-Time Adaptation

Figure 2 for Towards Understanding GD with Hard and Conjugate Pseudo-labels for Test-Time Adaptation

Figure 3 for Towards Understanding GD with Hard and Conjugate Pseudo-labels for Test-Time Adaptation

Figure 4 for Towards Understanding GD with Hard and Conjugate Pseudo-labels for Test-Time Adaptation

Share this with someone who'll enjoy it:

Abstract:We consider a setting that a model needs to adapt to a new domain under distribution shifts, given that only unlabeled test samples from the new domain are accessible at test time. A common idea in most of the related works is constructing pseudo-labels for the unlabeled test samples and applying gradient descent (GD) to a loss function with the pseudo-labels. Recently, Goyal et al. (2022) propose conjugate labels, which is a new kind of pseudo-labels for self-training at test time. They empirically show that the conjugate label outperforms other ways of pseudo-labeling on many domain adaptation benchmarks. However, provably showing that GD with conjugate labels learns a good classifier for test-time adaptation remains open. In this work, we aim at theoretically understanding GD with hard and conjugate labels for a binary classification problem. We show that for square loss, GD with conjugate labels converges to a solution that minimizes the testing 0-1 loss under a Gaussian model, while GD with hard pseudo-labels fails in this task. We also analyze them under different loss functions for the update. Our results shed lights on understanding when and why GD with hard labels or conjugate labels works in test-time adaptation.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Towards Understanding GD with Hard and Conjugate Pseudo-labels for Test-Time Adaptation

Paper and Code