Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Philippe Ciblat

Télécom Paris

How to craft a deep reinforcement learning policy for wind farm flow control

Jun 06, 2025

Elie Kadoche, Pascal Bianchi, Florence Carton, Philippe Ciblat, Damien Ernst

Abstract:Within wind farms, wake effects between turbines can significantly reduce overall energy production. Wind farm flow control encompasses methods designed to mitigate these effects through coordinated turbine control. Wake steering, for example, consists in intentionally misaligning certain turbines with the wind to optimize airflow and increase power output. However, designing a robust wake steering controller remains challenging, and existing machine learning approaches are limited to quasi-static wind conditions or small wind farms. This work presents a new deep reinforcement learning methodology to develop a wake steering policy that overcomes these limitations. Our approach introduces a novel architecture that combines graph attention networks and multi-head self-attention blocks, alongside a novel reward function and training strategy. The resulting model computes the yaw angles of each turbine, optimizing energy production in time-varying wind conditions. An empirical study conducted on steady-state, low-fidelity simulation, shows that our model requires approximately 10 times fewer training steps than a fully connected neural network and achieves more robust performance compared to a strong optimization baseline, increasing energy production by up to 14 %. To the best of our knowledge, this is the first deep reinforcement learning-based wake steering controller to generalize effectively across any time-varying wind conditions in a low-fidelity, steady-state numerical simulation setting.

Via

Access Paper or Ask Questions

Multi-Objective Decision Transformers for Offline Reinforcement Learning

Aug 31, 2023

Abdelghani Ghanem, Philippe Ciblat, Mounir Ghogho

Abstract:Offline Reinforcement Learning (RL) is structured to derive policies from static trajectory data without requiring real-time environment interactions. Recent studies have shown the feasibility of framing offline RL as a sequence modeling task, where the sole aim is to predict actions based on prior context using the transformer architecture. However, the limitation of this single task learning approach is its potential to undermine the transformer model's attention mechanism, which should ideally allocate varying attention weights across different tokens in the input context for optimal prediction. To address this, we reformulate offline RL as a multi-objective optimization problem, where the prediction is extended to states and returns. We also highlight a potential flaw in the trajectory representation used for sequence modeling, which could generate inaccuracies when modeling the state and return distributions. This is due to the non-smoothness of the action distribution within the trajectory dictated by the behavioral policy. To mitigate this issue, we introduce action space regions to the trajectory representation. Our experiments on D4RL benchmark locomotion tasks reveal that our propositions allow for more effective utilization of the attention mechanism in the transformer model, resulting in performance that either matches or outperforms current state-of-the art methods.

Via

Access Paper or Ask Questions

Neural network approaches to point lattice decoding

Dec 13, 2020

Vincent Corlay, Joseph J. Boutros, Philippe Ciblat, Loïc Brunel

Figure 1 for Neural network approaches to point lattice decoding

Figure 2 for Neural network approaches to point lattice decoding

Figure 3 for Neural network approaches to point lattice decoding

Figure 4 for Neural network approaches to point lattice decoding

Abstract:We characterize the complexity of the lattice decoding problem from a neural network perspective. The notion of Voronoi-reduced basis is introduced to restrict the space of solutions to a binary set. On the one hand, this problem is shown to be equivalent to computing a continuous piecewise linear (CPWL) function restricted to the fundamental parallelotope. On the other hand, it is known that any function computed by a ReLU feed-forward neural network is CPWL. As a result, we count the number of affine pieces in the CPWL decoding function to characterize the complexity of the decoding problem. It is exponential in the space dimension $n$, which induces shallow neural networks of exponential size. For structured lattices we show that folding, a technique equivalent to using a deep neural network, enables to reduce this complexity from exponential in $n$ to polynomial in $n$. Regarding unstructured MIMO lattices, in contrary to dense lattices many pieces in the CPWL decoding function can be neglected for quasi-optimal decoding on the Gaussian channel. This makes the decoding problem easier and it explains why shallow neural networks of reasonable size are more efficient with this category of lattices (in low to moderate dimensions).

* submitted to IEEE Transactions on Information Theory. arXiv admin note: text overlap with arXiv:1902.11294

Via

Access Paper or Ask Questions

Deep Reinforcement Learning for Wireless Scheduling with Multiclass Services

Nov 27, 2020

Apostolos Avranas, Marios Kountouris, Philippe Ciblat

Figure 1 for Deep Reinforcement Learning for Wireless Scheduling with Multiclass Services

Figure 2 for Deep Reinforcement Learning for Wireless Scheduling with Multiclass Services

Figure 3 for Deep Reinforcement Learning for Wireless Scheduling with Multiclass Services

Figure 4 for Deep Reinforcement Learning for Wireless Scheduling with Multiclass Services

Abstract:In this paper, we investigate the problem of scheduling and resource allocation over a time varying set of clients with heterogeneous demands.In this context, a service provider has to schedule traffic destined to users with different classes of requirements and to allocate bandwidth resources over time as a means to efficiently satisfy service demands within a limited time horizon. This is a highly intricate problem, in particular in wireless communication systems, and solutions may involve tools stemming from diverse fields, including combinatorics and constrained optimization. Although recent work has successfully proposed solutions based on Deep Reinforcement Learning (DRL), the challenging setting of heterogeneous user traffic and demands has not been addressed. We propose a deep deterministic policy gradient algorithm that combines state-of-the-art techniques, namely Distributional RL and Deep Sets, to train a model for heterogeneous traffic scheduling. We test on diverse scenarios with different time dependence dynamics, users' requirements, and resources available, demonstrating consistent results using both synthetic and real data. We evaluate the algorithm on a wireless communication setting using both synthetic and real data and show significant gains in terms of Quality of Service (QoS) defined by the classes, against state-of-the-art conventional algorithms from combinatorics, optimization and scheduling metric(e.g. Knapsack, Integer Linear Programming, Frank-Wolfe, Exponential Rule).

Via

Access Paper or Ask Questions

GraphCL: Contrastive Self-Supervised Learning of Graph Representations

Jul 15, 2020

Hakim Hafidi, Mounir Ghogho, Philippe Ciblat, Ananthram Swami

Figure 1 for GraphCL: Contrastive Self-Supervised Learning of Graph Representations

Figure 2 for GraphCL: Contrastive Self-Supervised Learning of Graph Representations

Figure 3 for GraphCL: Contrastive Self-Supervised Learning of Graph Representations

Abstract:We propose Graph Contrastive Learning (GraphCL), a general framework for learning node representations in a self supervised manner. GraphCL learns node embeddings by maximizing the similarity between the representations of two randomly perturbed versions of the intrinsic features and link structure of the same node's local subgraph. We use graph neural networks to produce two representations of the same node and leverage a contrastive learning loss to maximize agreement between them. In both transductive and inductive learning setups, we demonstrate that our approach significantly outperforms the state-of-the-art in unsupervised learning on a number of node classification benchmarks.

* Under review for Neurips 2020

Via

Access Paper or Ask Questions

A lattice-based approach to the expressivity of deep ReLU neural networks

Feb 28, 2019

Vincent Corlay, Joseph J. Boutros, Philippe Ciblat, Loic Brunel

Figure 1 for A lattice-based approach to the expressivity of deep ReLU neural networks

Figure 2 for A lattice-based approach to the expressivity of deep ReLU neural networks

Figure 3 for A lattice-based approach to the expressivity of deep ReLU neural networks

Figure 4 for A lattice-based approach to the expressivity of deep ReLU neural networks

Abstract:We present new families of continuous piecewise linear (CPWL) functions in Rn having a number of affine pieces growing exponentially in $n$. We show that these functions can be seen as the high-dimensional generalization of the triangle wave function used by Telgarsky in 2016. We prove that they can be computed by ReLU networks with quadratic depth and linear width in the space dimension. We also investigate the approximation error of one of these functions by shallower networks and prove a separation result. The main difference between our functions and other constructions is their practical interest: they arise in the scope of channel coding. Hence, computing such functions amounts to performing a decoding operation.

* arXiv admin note: text overlap with arXiv:1902.05146

Via

Access Paper or Ask Questions

On the CVP for the root lattices via folding with deep ReLU neural networks

Feb 28, 2019

Vincent Corlay, Joseph J. Boutros, Philippe Ciblat, Loic Brunel

Figure 1 for On the CVP for the root lattices via folding with deep ReLU neural networks

Figure 2 for On the CVP for the root lattices via folding with deep ReLU neural networks

Figure 3 for On the CVP for the root lattices via folding with deep ReLU neural networks

Figure 4 for On the CVP for the root lattices via folding with deep ReLU neural networks

Abstract:Point lattices and their decoding via neural networks are considered in this paper. Lattice decoding in Rn, known as the closest vector problem (CVP), becomes a classification problem in the fundamental parallelotope with a piecewise linear function defining the boundary. Theoretical results are obtained by studying root lattices. We show how the number of pieces in the boundary function reduces dramatically with folding, from exponential to linear. This translates into a two-layer ReLU network requiring a number of neurons growing exponentially in n to solve the CVP, whereas this complexity becomes polynomial in n for a deep ReLU network.

* Submitted to 2019 IEEE International Symposium on Information Theory (ISIT)

Via

Access Paper or Ask Questions