Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Matthieu Lerasle

JAD

Unlearning Works Better Than You Think: Local Reinforcement-Based Selection of Auxiliary Objectives

Apr 19, 2025

Abderrahim Bendahi, Adrien Fradin, Matthieu Lerasle

Abstract:We introduce Local Reinforcement-Based Selection of Auxiliary Objectives (LRSAO), a novel approach that selects auxiliary objectives using reinforcement learning (RL) to support the optimization process of an evolutionary algorithm (EA) as in EA+RL framework and furthermore incorporates the ability to unlearn previously used objectives. By modifying the reward mechanism to penalize moves that do no increase the fitness value and relying on the local auxiliary objectives, LRSAO dynamically adapts its selection strategy to optimize performance according to the landscape and unlearn previous objectives when necessary. We analyze and evaluate LRSAO on the black-box complexity version of the non-monotonic Jump function, with gap parameter $\ell$, where each auxiliary objective is beneficial at specific stages of optimization. The Jump function is hard to optimize for evolutionary-based algorithms and the best-known complexity for reinforcement-based selection on Jump was $O(n^2 \log(n) / \ell)$. Our approach improves over this result to achieve a complexity of $\Theta(n^2 / \ell^2 + n \log(n))$ resulting in a significant improvement, which demonstrates the efficiency and adaptability of LRSAO, highlighting its potential to outperform traditional methods in complex optimization scenarios.

* GECCO 2025
* Conference version with an appendix containing the proofs omitted for reasons of space

Via

Access Paper or Ask Questions

Finite-sample performance of the maximum likelihood estimator in logistic regression

Nov 04, 2024

Hugo Chardon, Matthieu Lerasle, Jaouad Mourtada

Abstract:Logistic regression is a classical model for describing the probabilistic dependence of binary responses to multivariate covariates. We consider the predictive performance of the maximum likelihood estimator (MLE) for logistic regression, assessed in terms of logistic risk. We consider two questions: first, that of the existence of the MLE (which occurs when the dataset is not linearly separated), and second that of its accuracy when it exists. These properties depend on both the dimension of covariates and on the signal strength. In the case of Gaussian covariates and a well-specified logistic model, we obtain sharp non-asymptotic guarantees for the existence and excess logistic risk of the MLE. We then generalize these results in two ways: first, to non-Gaussian covariates satisfying a certain two-dimensional margin condition, and second to the general case of statistical learning with a possibly misspecified logistic model. Finally, we consider the case of a Bernoulli design, where the behavior of the MLE is highly sensitive to the parameter direction.

Via

Access Paper or Ask Questions

Construction of a Surrogate Model: Multivariate Time Series Prediction with a Hybrid Model

Dec 15, 2022

Clara Carlier, Arnaud Franju, Matthieu Lerasle, Mathias Obrebski

Figure 1 for Construction of a Surrogate Model: Multivariate Time Series Prediction with a Hybrid Model

Figure 2 for Construction of a Surrogate Model: Multivariate Time Series Prediction with a Hybrid Model

Figure 3 for Construction of a Surrogate Model: Multivariate Time Series Prediction with a Hybrid Model

Figure 4 for Construction of a Surrogate Model: Multivariate Time Series Prediction with a Hybrid Model

Abstract:Recent developments of advanced driver-assistance systems necessitate an increasing number of tests to validate new technologies. These tests cannot be carried out on track in a reasonable amount of time and automotive groups rely on simulators to perform most tests. The reliability of these simulators for constantly refined tasks is becoming an issue and, to increase the number of tests, the industry is now developing surrogate models, that should mimic the behavior of the simulator while being much faster to run on specific tasks. In this paper we aim to construct a surrogate model to mimic and replace the simulator. We first test several classical methods such as random forests, ridge regression or convolutional neural networks. Then we build three hybrid models that use all these methods and combine them to obtain an efficient hybrid surrogate model.

* 6 pages, 8 figures, 7 tables

Via

Access Paper or Ask Questions

Benign overfitting in the large deviation regime

Mar 12, 2020

Geoffrey Chinot, Matthieu Lerasle

Abstract:We investigate the benign overfitting phenomenon in the large deviation regime where the bounds on the prediction risk hold with probability $1-e^{-\zeta n}$, for some absolute constant $\zeta$. We prove that these bounds can converge to $0$ for the quadratic loss. We obtain this result by a new analysis of the interpolating estimator with minimal Euclidean norm, relying on a preliminary localization of this estimator with respect to the Euclidean norm. This new analysis complements and strengthens particular cases obtained in previous works for the square loss and is extended to other loss functions. To illustrate this, we also provide excess risk bounds for the Huber and absolute losses, two widely spread losses in robust statistics.

* 19 pages

Via

Access Paper or Ask Questions

Lecture Notes: Selected topics on robust statistical learning theory

Aug 28, 2019

Matthieu Lerasle

Abstract:These notes gather recent results on robust statistical learning theory. The goal is to stress the main principles underlying the construction and theoretical analysis of these estimators rather than provide an exhaustive account on this rapidly growing field. The notes are the basis of lectures given at the conference StatMathAppli 2019.

Via

Access Paper or Ask Questions

Pair Matching: When bandits meet stochastic block model

May 17, 2019

Christophe Giraud, Yann Issartel, Luc Lehéricy, Matthieu Lerasle

Abstract:The pair-matching problem appears in many applications where one wants to discover good matches between pairs of individuals. Formally, the set of individuals is represented by the nodes of a graph where the edges, unobserved at first, represent the good matches. The algorithm queries pairs of nodes and observes the presence/absence of edges. Its goal is to discover as many edges as possible with a fixed budget of queries. Pair-matching is a particular instance of multi-armed bandit problem in which the arms are pairs of individuals and the rewards are edges linking these pairs. This bandit problem is non-standard though, as each arm can only be played once. Given this last constraint, sublinear regret can be expected only if the graph presents some underlying structure. This paper shows that sublinear regret is achievable in the case where the graph is generated according to a Stochastic Block Model (SBM) with two communities. Optimal regret bounds are computed for this pair-matching problem. They exhibit a phase transition related to the Kesten-Stigund threshold for community detection in SBM. To avoid undesirable features of optimal solutions, the pair-matching problem is also considered in the case where each node is constrained to be sampled less than a given amount of times. We show how this constraint deteriorates optimal regret rates. The paper is concluded by a conjecture regarding the optimal regret when the number of communities is larger than $2$. Contrary to the two communities case, we believe that a statistical-computational gap would appear in this problem.

* 57 pages

Via

Access Paper or Ask Questions

MONK -- Outlier-Robust Mean Embedding Estimation by Median-of-Means

Oct 17, 2018

Matthieu Lerasle, Zoltan Szabo, Timothee Mathieu, Guillaume Lecue

Figure 1 for MONK -- Outlier-Robust Mean Embedding Estimation by Median-of-Means

Figure 2 for MONK -- Outlier-Robust Mean Embedding Estimation by Median-of-Means

Figure 3 for MONK -- Outlier-Robust Mean Embedding Estimation by Median-of-Means

Abstract:Mean embeddings provide an extremely flexible and powerful tool in machine learning and statistics to represent probability distributions and define a semi-metric (MMD, maximum mean discrepancy; also called N-distance or energy distance), with numerous successful applications. The representation is constructed as the expectation of the feature map defined by a kernel. As a mean, its classical empirical estimator, however, can be arbitrary severely affected even by a single outlier in case of unbounded features. To the best of our knowledge, unfortunately even the consistency of the existing few techniques trying to alleviate this serious sensitivity bottleneck is unknown. In this paper, we show how the recently emerged principle of median-of-means can be used to design estimators for kernel mean embedding and MMD with excessive resistance properties to outliers, and optimal sub-Gaussian deviation bounds under mild assumptions.

Via

Access Paper or Ask Questions

Choice of V for V-Fold Cross-Validation in Least-Squares Density Estimation

Oct 11, 2015

Sylvain Arlot, Matthieu Lerasle

Figure 1 for Choice of V for V-Fold Cross-Validation in Least-Squares Density Estimation

Figure 2 for Choice of V for V-Fold Cross-Validation in Least-Squares Density Estimation

Figure 3 for Choice of V for V-Fold Cross-Validation in Least-Squares Density Estimation

Figure 4 for Choice of V for V-Fold Cross-Validation in Least-Squares Density Estimation

Abstract:This paper studies V-fold cross-validation for model selection in least-squares density estimation. The goal is to provide theoretical grounds for choosing V in order to minimize the least-squares loss of the selected estimator. We first prove a non-asymptotic oracle inequality for V-fold cross-validation and its bias-corrected version (V-fold penalization). In particular, this result implies that V-fold penalization is asymptotically optimal in the nonparametric case. Then, we compute the variance of V-fold cross-validation and related criteria, as well as the variance of key quantities for model selection performance. We show that these variances depend on V like 1+4/(V-1), at least in some particular cases, suggesting that the performance increases much from V=2 to V=5 or 10, and then is almost constant. Overall, this can explain the common advice to take V=5---at least in our setting and when the computational power is limited---, as supported by some simulation experiments. An oracle inequality and exact formulas for the variance are also proved for Monte-Carlo cross-validation, also known as repeated cross-validation, where the parameter V is replaced by the number B of random splits of the data.

Via

Access Paper or Ask Questions