Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ieva Petrulionyte

Non-Stationary Functional Bilevel Optimization

Jan 21, 2026

Jason Bohne, Ieva Petrulionyte, Michael Arbel, Julien Mairal, Paweł Polak

Abstract:Functional bilevel optimization (FBO) provides a powerful framework for hierarchical learning in function spaces, yet current methods are limited to static offline settings and perform suboptimally in online, non-stationary scenarios. We propose SmoothFBO, the first algorithm for non-stationary FBO with both theoretical guarantees and practical scalability. SmoothFBO introduces a time-smoothed stochastic hypergradient estimator that reduces variance through a window parameter, enabling stable outer-loop updates with sublinear regret. Importantly, the classical parametric bilevel case is a special reduction of our framework, making SmoothFBO a natural extension to online, non-stationary settings. Empirically, SmoothFBO consistently outperforms existing FBO methods in non-stationary hyperparameter optimization and model-based reinforcement learning, demonstrating its practical effectiveness. Together, these results establish SmoothFBO as a general, theoretically grounded, and practically viable foundation for bilevel optimization in online, non-stationary scenarios.

Via

Access Paper or Ask Questions

Functional Bilevel Optimization for Machine Learning

Mar 29, 2024

Ieva Petrulionyte, Julien Mairal, Michael Arbel

Figure 1 for Functional Bilevel Optimization for Machine Learning

Figure 2 for Functional Bilevel Optimization for Machine Learning

Figure 3 for Functional Bilevel Optimization for Machine Learning

Figure 4 for Functional Bilevel Optimization for Machine Learning

Abstract:In this paper, we introduce a new functional point of view on bilevel optimization problems for machine learning, where the inner objective is minimized over a function space. These types of problems are most often solved by using methods developed in the parametric setting, where the inner objective is strongly convex with respect to the parameters of the prediction function. The functional point of view does not rely on this assumption and notably allows using over-parameterized neural networks as the inner prediction function. We propose scalable and efficient algorithms for the functional bilevel optimization problem and illustrate the benefits of our approach on instrumental regression and reinforcement learning tasks, which admit natural functional bilevel structures.

Via

Access Paper or Ask Questions