Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pascal Bianchi

S2A, IDS, LTCI

How to craft a deep reinforcement learning policy for wind farm flow control

Jun 06, 2025

Elie Kadoche, Pascal Bianchi, Florence Carton, Philippe Ciblat, Damien Ernst

Abstract:Within wind farms, wake effects between turbines can significantly reduce overall energy production. Wind farm flow control encompasses methods designed to mitigate these effects through coordinated turbine control. Wake steering, for example, consists in intentionally misaligning certain turbines with the wind to optimize airflow and increase power output. However, designing a robust wake steering controller remains challenging, and existing machine learning approaches are limited to quasi-static wind conditions or small wind farms. This work presents a new deep reinforcement learning methodology to develop a wake steering policy that overcomes these limitations. Our approach introduces a novel architecture that combines graph attention networks and multi-head self-attention blocks, alongside a novel reward function and training strategy. The resulting model computes the yaw angles of each turbine, optimizing energy production in time-varying wind conditions. An empirical study conducted on steady-state, low-fidelity simulation, shows that our model requires approximately 10 times fewer training steps than a fully connected neural network and achieves more robust performance compared to a strong optimization baseline, increasing energy production by up to 14 %. To the best of our knowledge, this is the first deep reinforcement learning-based wake steering controller to generalize effectively across any time-varying wind conditions in a low-fidelity, steady-state numerical simulation setting.

Via

Access Paper or Ask Questions

Long-time asymptotics of noisy SVGD outside the population limit

Jun 17, 2024

Victor Priser, Pascal Bianchi, Adil Salim

Figure 1 for Long-time asymptotics of noisy SVGD outside the population limit

Figure 2 for Long-time asymptotics of noisy SVGD outside the population limit

Abstract:Stein Variational Gradient Descent (SVGD) is a widely used sampling algorithm that has been successfully applied in several areas of Machine Learning. SVGD operates by iteratively moving a set of interacting particles (which represent the samples) to approximate the target distribution. Despite recent studies on the complexity of SVGD and its variants, their long-time asymptotic behavior (i.e., after numerous iterations ) is still not understood in the finite number of particles regime. We study the long-time asymptotic behavior of a noisy variant of SVGD. First, we establish that the limit set of noisy SVGD for large is well-defined. We then characterize this limit set, showing that it approaches the target distribution as increases. In particular, noisy SVGD provably avoids the variance collapse observed for SVGD. Our approach involves demonstrating that the trajectories of noisy SVGD closely resemble those described by a McKean-Vlasov process.

Via

Access Paper or Ask Questions

Stochastic Subgradient Descent Escapes Active Strict Saddles

Aug 04, 2021

Pascal Bianchi, Walid Hachem, Sholom Schechtman

Abstract:In non-smooth stochastic optimization, we establish the non-convergence of the stochastic subgradient descent (SGD) to the critical points recently called active strict saddles by Davis and Drusvyatskiy. Such points lie on a manifold $M$ where the function $f$ has a direction of second-order negative curvature. Off this manifold, the norm of the Clarke subdifferential of $f$ is lower-bounded. We require two conditions on $f$. The first assumption is a Verdier stratification condition, which is a refinement of the popular Whitney stratification. It allows us to establish a reinforced version of the projection formula of Bolte \emph{et.al.} for Whitney stratifiable functions, and which is of independent interest. The second assumption, termed the angle condition, allows to control the distance of the iterates to $M$. When $f$ is weakly convex, our assumptions are generic. Consequently, generically in the class of definable weakly convex functions, the SGD converges to a local minimizer.

Via

Access Paper or Ask Questions

Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation

Jun 14, 2021

Anas Barakat, Pascal Bianchi, Julien Lehmann

Figure 1 for Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation

Abstract:Actor-critic methods integrating target networks have exhibited a stupendous empirical success in deep reinforcement learning. However, a theoretical understanding of the use of target networks in actor-critic methods is largely missing in the literature. In this paper, we bridge this gap between theory and practice by proposing the first theoretical analysis of an online target-based actor-critic algorithm with linear function approximation in the discounted reward setting. Our algorithm uses three different timescales: one for the actor and two for the critic. Instead of using the standard single timescale temporal difference (TD) learning algorithm as a critic, we use a two timescales target-based version of TD learning closely inspired from practical actor-critic algorithms implementing target networks. First, we establish asymptotic convergence results for both the critic and the actor under Markovian sampling. Then, we provide a finite-time analysis showing the impact of incorporating a target network into actor-critic methods.

* 34 pages

Via

Access Paper or Ask Questions

Conditional independence testing via weighted partial copulas

Jun 23, 2020

Pascal Bianchi, Kevin Elgui, François Portier

Figure 1 for Conditional independence testing via weighted partial copulas

Figure 2 for Conditional independence testing via weighted partial copulas

Figure 3 for Conditional independence testing via weighted partial copulas

Figure 4 for Conditional independence testing via weighted partial copulas

Abstract:This paper introduces the \textit{weighted partial copula} function for testing conditional independence. The proposed test procedure results from these two ingredients: (i) the test statistic is an explicit Cramer-von Mises transformation of the \textit{weighted partial copula}, (ii) the regions of rejection are computed using a bootstrap procedure which mimics conditional independence by generating samples from the product measure of the estimated conditional marginals. Under conditional independence, the weak convergence of the \textit{weighted partial copula proces}s is established when the marginals are estimated using a smoothed local linear estimator. Finally, an experimental section demonstrates that the proposed test has competitive power compared to recent state-of-the-art methods such as kernel-based test.

Via

Access Paper or Ask Questions

Convergence Analysis of a Momentum Algorithm with Adaptive Step Size for Non Convex Optimization

Nov 18, 2019

Anas Barakat, Pascal Bianchi

Figure 1 for Convergence Analysis of a Momentum Algorithm with Adaptive Step Size for Non Convex Optimization

Figure 2 for Convergence Analysis of a Momentum Algorithm with Adaptive Step Size for Non Convex Optimization

Abstract:Although ADAM is a very popular algorithm for optimizing the weights of neural networks, it has been recently shown that it can diverge even in simple convex optimization examples. Several variants of ADAM have been proposed to circumvent this convergence issue. In this work, we study the ADAM algorithm for smooth nonconvex optimization under a boundedness assumption on the adaptive learning rate. The bound on the adaptive step size depends on the Lipschitz constant of the gradient of the objective function and provides safe theoretical adaptive step sizes. Under this boundedness assumption, we show a novel first order convergence rate result in both deterministic and stochastic contexts. Furthermore, we establish convergence rates of the function value sequence using the Kurdyka-Lojasiewicz property.

Via

Access Paper or Ask Questions

A Fully Stochastic Primal-Dual Algorithm

Feb 03, 2019

Adil Salim, Pascal Bianchi, Walid Hachem

Abstract:A new stochastic primal-dual algorithm for solving a composite optimization problem is proposed. It is assumed that all the functions/matrix used to define the optimization problem are given as statistical expectations. These expectations are unknown but revealed across time through i.i.d realizations. This covers the case of convex optimization under stochastic linear constraints. The proposed algorithm is proven to converge to a saddle point of the Lagrangian function. In the framework of the monotone operator theory, the convergence proof relies on recent results on the stochastic Forward Backward algorithm involving random monotone operators.

* 10 pages

Via

Access Paper or Ask Questions

A Constant Step Stochastic Douglas-Rachford Algorithm with Application to Non Separable Regularizations

Apr 03, 2018

Adil Salim, Pascal Bianchi, Walid Hachem

Figure 1 for A Constant Step Stochastic Douglas-Rachford Algorithm with Application to Non Separable Regularizations

Figure 2 for A Constant Step Stochastic Douglas-Rachford Algorithm with Application to Non Separable Regularizations

Abstract:The Douglas Rachford algorithm is an algorithm that converges to a minimizer of a sum of two convex functions. The algorithm consists in fixed point iterations involving computations of the proximity operators of the two functions separately. The paper investigates a stochastic version of the algorithm where both functions are random and the step size is constant. We establish that the iterates of the algorithm stay close to the set of solution with high probability when the step size is small enough. Application to structured regularization is considered.

Via

Access Paper or Ask Questions

Snake: a Stochastic Proximal Gradient Algorithm for Regularized Problems over Large Graphs

Dec 19, 2017

Adil Salim, Pascal Bianchi, Walid Hachem

Figure 1 for Snake: a Stochastic Proximal Gradient Algorithm for Regularized Problems over Large Graphs

Figure 2 for Snake: a Stochastic Proximal Gradient Algorithm for Regularized Problems over Large Graphs

Figure 3 for Snake: a Stochastic Proximal Gradient Algorithm for Regularized Problems over Large Graphs

Figure 4 for Snake: a Stochastic Proximal Gradient Algorithm for Regularized Problems over Large Graphs

Abstract:A regularized optimization problem over a large unstructured graph is studied, where the regularization term is tied to the graph geometry. Typical regularization examples include the total variation and the Laplacian regularizations over the graph. When applying the proximal gradient algorithm to solve this problem, there exist quite affordable methods to implement the proximity operator (backward step) in the special case where the graph is a simple path without loops. In this paper, an algorithm, referred to as "Snake", is proposed to solve such regularized problems over general graphs, by taking benefit of these fast methods. The algorithm consists in properly selecting random simple paths in the graph and performing the proximal gradient algorithm over these simple paths. This algorithm is an instance of a new general stochastic proximal gradient algorithm, whose convergence is proven. Applications to trend filtering and graph inpainting are provided among others. Numerical experiments are conducted over large graphs.

Via

Access Paper or Ask Questions