Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tobias Wegel

Learning Pareto manifolds in high dimensions: How can regularization help?

Mar 11, 2025

Tobias Wegel, Filip Kovačević, Alexandru Ţifrea, Fanny Yang

Abstract:Simultaneously addressing multiple objectives is becoming increasingly important in modern machine learning. At the same time, data is often high-dimensional and costly to label. For a single objective such as prediction risk, conventional regularization techniques are known to improve generalization when the data exhibits low-dimensional structure like sparsity. However, it is largely unexplored how to leverage this structure in the context of multi-objective learning (MOL) with multiple competing objectives. In this work, we discuss how the application of vanilla regularization approaches can fail, and propose a two-stage MOL framework that can successfully leverage low-dimensional structure. We demonstrate its effectiveness experimentally for multi-distribution learning and fairness-risk trade-offs.

* Published in Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS) 2025

Via

Access Paper or Ask Questions

Early-Stopped Mirror Descent for Linear Regression over Convex Bodies

Mar 05, 2025

Tobias Wegel, Gil Kur, Patrick Rebeschini

Abstract:Early-stopped iterative optimization methods are widely used as alternatives to explicit regularization, and direct comparisons between early-stopping and explicit regularization have been established for many optimization geometries. However, most analyses depend heavily on the specific properties of the optimization geometry or strong convexity of the empirical objective, and it remains unclear whether early-stopping could ever be less statistically efficient than explicit regularization for some particular shape constraint, especially in the overparameterized regime. To address this question, we study the setting of high-dimensional linear regression under additive Gaussian noise when the ground truth is assumed to lie in a known convex body and the task is to minimize the in-sample mean squared error. Our main result shows that for any convex body and any design matrix, up to an absolute constant factor, the worst-case risk of unconstrained early-stopped mirror descent with an appropriate potential is at most that of the least squares estimator constrained to the convex body. We achieve this by constructing algorithmic regularizers based on the Minkowski functional of the convex body.

Via

Access Paper or Ask Questions

A Framework for Verification of Wasserstein Adversarial Robustness

Oct 13, 2021

Tobias Wegel, Felix Assion, David Mickisch, Florens Greßner

Figure 1 for A Framework for Verification of Wasserstein Adversarial Robustness

Figure 2 for A Framework for Verification of Wasserstein Adversarial Robustness

Figure 3 for A Framework for Verification of Wasserstein Adversarial Robustness

Figure 4 for A Framework for Verification of Wasserstein Adversarial Robustness

Abstract:Machine learning image classifiers are susceptible to adversarial and corruption perturbations. Adding imperceptible noise to images can lead to severe misclassifications of the machine learning model. Using $L_p$-norms for measuring the size of the noise fails to capture human similarity perception, which is why optimal transport based distance measures like the Wasserstein metric are increasingly being used in the field of adversarial robustness. Verifying the robustness of classifiers using the Wasserstein metric can be achieved by proving the absence of adversarial examples (certification) or proving their presence (attack). In this work we present a framework based on the work by Levine and Feizi, which allows us to transfer existing certification methods for convex polytopes or $L_1$-balls to the Wasserstein threat model. The resulting certification can be complete or incomplete, depending on whether convex polytopes or $L_1$-balls were chosen. Additionally, we present a new Wasserstein adversarial attack that is projected gradient descent based and which has a significantly reduced computational burden compared to existing attack approaches.

* 10 pages, 4 figures

Via

Access Paper or Ask Questions