Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Distributionally Robust Models with Parametric Likelihood Ratios

Apr 13, 2022

Paul Michel, Tatsunori Hashimoto, Graham Neubig

Figure 1 for Distributionally Robust Models with Parametric Likelihood Ratios

Figure 2 for Distributionally Robust Models with Parametric Likelihood Ratios

Figure 3 for Distributionally Robust Models with Parametric Likelihood Ratios

Figure 4 for Distributionally Robust Models with Parametric Likelihood Ratios

Share this with someone who'll enjoy it:

Abstract:As machine learning models are deployed ever more broadly, it becomes increasingly important that they are not only able to perform well on their training distribution, but also yield accurate predictions when confronted with distribution shift. The Distributionally Robust Optimization (DRO) framework proposes to address this issue by training models to minimize their expected risk under a collection of distributions, to imitate test-time shifts. This is most commonly achieved by instance-level re-weighting of the training objective to emulate the likelihood ratio with possible test distributions, which allows for estimating their empirical risk via importance sampling (assuming that they are subpopulations of the training distribution). However, re-weighting schemes in the literature are usually limited due to the difficulty of keeping the optimization problem tractable and the complexity of enforcing normalization constraints. In this paper, we show that three simple ideas -- mini-batch level normalization, a KL penalty and simultaneous gradient updates -- allow us to train models with DRO using a broader class of parametric likelihood ratios. In a series of experiments on both image and text classification benchmarks, we find that models trained with the resulting parametric adversaries are consistently more robust to subpopulation shifts when compared to other DRO approaches, and that the method performs reliably well with little hyper-parameter tuning. Code to reproduce our experiments can be found at https://github.com/pmichel31415/P-DRO.

* ICLR 2022

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Distributionally Robust Models with Parametric Likelihood Ratios

Paper and Code