Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Songkai Xue

Distributionally Robust Performative Prediction

Dec 05, 2024

Songkai Xue, Yuekai Sun

Figure 1 for Distributionally Robust Performative Prediction

Figure 2 for Distributionally Robust Performative Prediction

Figure 3 for Distributionally Robust Performative Prediction

Figure 4 for Distributionally Robust Performative Prediction

Abstract:Performative prediction aims to model scenarios where predictive outcomes subsequently influence the very systems they target. The pursuit of a performative optimum (PO) -- minimizing performative risk -- is generally reliant on modeling of the distribution map, which characterizes how a deployed ML model alters the data distribution. Unfortunately, inevitable misspecification of the distribution map can lead to a poor approximation of the true PO. To address this issue, we introduce a novel framework of distributionally robust performative prediction and study a new solution concept termed as distributionally robust performative optimum (DRPO). We show provable guarantees for DRPO as a robust approximation to the true PO when the nominal distribution map is different from the actual one. Moreover, distributionally robust performative prediction can be reformulated as an augmented performative prediction problem, enabling efficient optimization. The experimental results demonstrate that DRPO offers potential advantages over traditional PO approach when the distribution map is misspecified at either micro- or macro-level.

* In Proceedings of the 38th Conference on Neural Information Processing Systems (NeurIPS) 2024

Via

Access Paper or Ask Questions

Minimax Regret Learning for Data with Heterogeneous Subgroups

May 02, 2024

Weibin Mo, Weijing Tang, Songkai Xue, Yufeng Liu, Ji Zhu

Abstract:Modern complex datasets often consist of various sub-populations. To develop robust and generalizable methods in the presence of sub-population heterogeneity, it is important to guarantee a uniform learning performance instead of an average one. In many applications, prior information is often available on which sub-population or group the data points belong to. Given the observed groups of data, we develop a min-max-regret (MMR) learning framework for general supervised learning, which targets to minimize the worst-group regret. Motivated from the regret-based decision theoretic framework, the proposed MMR is distinguished from the value-based or risk-based robust learning methods in the existing literature. The regret criterion features several robustness and invariance properties simultaneously. In terms of generalizability, we develop the theoretical guarantee for the worst-case regret over a super-population of the meta data, which incorporates the observed sub-populations, their mixtures, as well as other unseen sub-populations that could be approximated by the observed ones. We demonstrate the effectiveness of our method through extensive simulation studies and an application to kidney transplantation data from hundreds of transplant centers.

Via

Access Paper or Ask Questions

Calibrated Data-Dependent Constraints with Exact Satisfaction Guarantees

Jan 15, 2023

Songkai Xue, Yuekai Sun, Mikhail Yurochkin

Figure 1 for Calibrated Data-Dependent Constraints with Exact Satisfaction Guarantees

Figure 2 for Calibrated Data-Dependent Constraints with Exact Satisfaction Guarantees

Figure 3 for Calibrated Data-Dependent Constraints with Exact Satisfaction Guarantees

Figure 4 for Calibrated Data-Dependent Constraints with Exact Satisfaction Guarantees

Abstract:We consider the task of training machine learning models with data-dependent constraints. Such constraints often arise as empirical versions of expected value constraints that enforce fairness or stability goals. We reformulate data-dependent constraints so that they are calibrated: enforcing the reformulated constraints guarantees that their expected value counterparts are satisfied with a user-prescribed probability. The resulting optimization problem is amendable to standard stochastic optimization algorithms, and we demonstrate the efficacy of our method on a fairness-sensitive classification task where we wish to guarantee the classifier's fairness (at test time).

* In Proceedings of the 36th Conference on Neural Information Processing Systems (NeurIPS) 2022

Via

Access Paper or Ask Questions

How does overparametrization affect performance on minority groups?

Jun 07, 2022

Subha Maity, Saptarshi Roy, Songkai Xue, Mikhail Yurochkin, Yuekai Sun

Figure 1 for How does overparametrization affect performance on minority groups?

Figure 2 for How does overparametrization affect performance on minority groups?

Figure 3 for How does overparametrization affect performance on minority groups?

Figure 4 for How does overparametrization affect performance on minority groups?

Abstract:The benefits of overparameterization for the overall performance of modern machine learning (ML) models are well known. However, the effect of overparameterization at a more granular level of data subgroups is less understood. Recent empirical studies demonstrate encouraging results: (i) when groups are not known, overparameterized models trained with empirical risk minimization (ERM) perform better on minority groups; (ii) when groups are known, ERM on data subsampled to equalize group sizes yields state-of-the-art worst-group-accuracy in the overparameterized regime. In this paper, we complement these empirical studies with a theoretical investigation of the risk of overparameterized random feature models on minority groups. In a setting in which the regression functions for the majority and minority groups are different, we show that overparameterization always improves minority group performance.

Via

Access Paper or Ask Questions

Statistical inference for individual fairness

Mar 30, 2021

Subha Maity, Songkai Xue, Mikhail Yurochkin, Yuekai Sun

Figure 1 for Statistical inference for individual fairness

Figure 2 for Statistical inference for individual fairness

Figure 3 for Statistical inference for individual fairness

Figure 4 for Statistical inference for individual fairness

Abstract:As we rely on machine learning (ML) models to make more consequential decisions, the issue of ML models perpetuating or even exacerbating undesirable historical biases (e.g., gender and racial biases) has come to the fore of the public's attention. In this paper, we focus on the problem of detecting violations of individual fairness in ML models. We formalize the problem as measuring the susceptibility of ML models against a form of adversarial attack and develop a suite of inference tools for the adversarial cost function. The tools allow auditors to assess the individual fairness of ML models in a statistically-principled way: form confidence intervals for the worst-case performance differential between similar individuals and test hypotheses of model fairness with (asymptotic) non-coverage/Type I error rate control. We demonstrate the utility of our tools in a real-world case study.

Via

Access Paper or Ask Questions

Auditing ML Models for Individual Bias and Unfairness

Mar 11, 2020

Songkai Xue, Mikhail Yurochkin, Yuekai Sun

Figure 1 for Auditing ML Models for Individual Bias and Unfairness

Figure 2 for Auditing ML Models for Individual Bias and Unfairness

Figure 3 for Auditing ML Models for Individual Bias and Unfairness

Figure 4 for Auditing ML Models for Individual Bias and Unfairness

Abstract:We consider the task of auditing ML models for individual bias/unfairness. We formalize the task in an optimization problem and develop a suite of inferential tools for the optimal value. Our tools permit us to obtain asymptotic confidence intervals and hypothesis tests that cover the target/control the Type I error rate exactly. To demonstrate the utility of our tools, we use them to reveal the gender and racial biases in Northpointe's COMPAS recidivism prediction instrument.

* In Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020

Via

Access Paper or Ask Questions