Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Focus on the Common Good: Group Distributional Robustness Follows

Oct 06, 2021

Vihari Piratla, Praneeth Netrapalli, Sunita Sarawagi

Figure 1 for Focus on the Common Good: Group Distributional Robustness Follows

Figure 2 for Focus on the Common Good: Group Distributional Robustness Follows

Figure 3 for Focus on the Common Good: Group Distributional Robustness Follows

Figure 4 for Focus on the Common Good: Group Distributional Robustness Follows

Share this with someone who'll enjoy it:

Abstract:We consider the problem of training a classification model with group annotated training data. Recent work has established that, if there is distribution shift across different groups, models trained using the standard empirical risk minimization (ERM) objective suffer from poor performance on minority groups and that group distributionally robust optimization (Group-DRO) objective is a better alternative. The starting point of this paper is the observation that though Group-DRO performs better than ERM on minority groups for some benchmark datasets, there are several other datasets where it performs much worse than ERM. Inspired by ideas from the closely related problem of domain generalization, this paper proposes a new and simple algorithm that explicitly encourages learning of features that are shared across various groups. The key insight behind our proposed algorithm is that while Group-DRO focuses on groups with worst regularized loss, focusing instead, on groups that enable better performance even on other groups, could lead to learning of shared/common features, thereby enhancing minority performance beyond what is achieved by Group-DRO. Empirically, we show that our proposed algorithm matches or achieves better performance compared to strong contemporary baselines including ERM and Group-DRO on standard benchmarks on both minority groups and across all groups. Theoretically, we show that the proposed algorithm is a descent method and finds first order stationary points of smooth nonconvex functions.

* Under review; Code can be found at: https://github.com/vps-anonconfs/cg-iclr22

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Focus on the Common Good: Group Distributional Robustness Follows

Paper and Code