Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Benjamin Donnot

RTE

RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations

Mar 29, 2025

Enrico Marchesini, Benjamin Donnot, Constance Crozier, Ian Dytham, Christian Merz, Lars Schewe, Nico Westerbeck, Cathy Wu, Antoine Marot, Priya L. Donti

Abstract:Reinforcement learning (RL) can transform power grid operations by providing adaptive and scalable controllers essential for grid decarbonization. However, existing methods struggle with the complex dynamics, aleatoric uncertainty, long-horizon goals, and hard physical constraints that occur in real-world systems. This paper presents RL2Grid, a benchmark designed in collaboration with power system operators to accelerate progress in grid control and foster RL maturity. Built on a power simulation framework developed by RTE France, RL2Grid standardizes tasks, state and action spaces, and reward structures within a unified interface for a systematic evaluation and comparison of RL approaches. Moreover, we integrate real control heuristics and safety constraints informed by the operators' expertise to ensure RL2Grid aligns with grid operation requirements. We benchmark popular RL baselines on the grid control tasks represented within RL2Grid, establishing reference performance metrics. Our results and discussion highlight the challenges that power grids pose for RL methods, emphasizing the need for novel algorithms capable of handling real-world physical systems.

Via

Access Paper or Ask Questions

Reinforcement learning for Energies of the future and carbon neutrality: a Challenge Design

Jul 21, 2022

Gaëtan Serré, Eva Boguslawski, Benjamin Donnot, Adrien Pavão, Isabelle Guyon, Antoine Marot

Figure 1 for Reinforcement learning for Energies of the future and carbon neutrality: a Challenge Design

Figure 2 for Reinforcement learning for Energies of the future and carbon neutrality: a Challenge Design

Figure 3 for Reinforcement learning for Energies of the future and carbon neutrality: a Challenge Design

Figure 4 for Reinforcement learning for Energies of the future and carbon neutrality: a Challenge Design

Abstract:Current rapid changes in climate increase the urgency to change energy production and consumption management, to reduce carbon and other green-house gas production. In this context, the French electricity network management company RTE (R{\'e}seau de Transport d'{\'E}lectricit{\'e}) has recently published the results of an extensive study outlining various scenarios for tomorrow's French power management. We propose a challenge that will test the viability of such a scenario. The goal is to control electricity transportation in power networks, while pursuing multiple objectives: balancing production and consumption, minimizing energetic losses, and keeping people and equipment safe and particularly avoiding catastrophic failures. While the importance of the application provides a goal in itself, this challenge also aims to push the state-of-the-art in a branch of Artificial Intelligence (AI) called Reinforcement Learning (RL), which offers new possibilities to tackle control problems. In particular, various aspects of the combination of Deep Learning and RL called Deep Reinforcement Learning remain to be harnessed in this application domain. This challenge belongs to a series started in 2019 under the name "Learning to run a power network" (L2RPN). In this new edition, we introduce new more realistic scenarios proposed by RTE to reach carbon neutrality by 2050, retiring fossil fuel electricity production, increasing proportions of renewable and nuclear energy and introducing batteries. Furthermore, we provide a baseline using state-of-the-art reinforcement learning algorithm to stimulate the future participants.

* IEEE SSCI ADPRL, IEEE, Dec 2022, Singapour, Singapore

Via

Access Paper or Ask Questions

Learning to run a power network with trust

Oct 21, 2021

Antoine Marot, Benjamin Donnot, Karim Chaouache, Adrian Kelly, Qiuhua Huang, Ramij-Raja Hossain, Jochen L. Cremer

Figure 1 for Learning to run a power network with trust

Figure 2 for Learning to run a power network with trust

Figure 3 for Learning to run a power network with trust

Figure 4 for Learning to run a power network with trust

Abstract:Artificial agents are promising for realtime power system operations, particularly, to compute remedial actions for congestion management. Currently, these agents are limited to only autonomously run by themselves. However, autonomous agents will not be deployed any time soon. Operators will still be in charge of taking action in the future. Aiming at designing an assistant for operators, we here consider humans in the loop and propose an original formulation for this problem. We first advance an agent with the ability to send to the operator alarms ahead of time when the proposed actions are of low confidence. We further model the operator's available attention as a budget that decreases when alarms are sent. We present the design and results of our competition "Learning to run a power network with trust" in which we benchmark the ability of submitted agents to send relevant alarms while operating the network to their best.

Via

Access Paper or Ask Questions

Learning to run a Power Network Challenge: a Retrospective Analysis

Mar 02, 2021

Antoine Marot, Benjamin Donnot, Gabriel Dulac-Arnold, Adrian Kelly, Aïdan O'Sullivan, Jan Viebahn, Mariette Awad, Isabelle Guyon, Patrick Panciatici, Camilo Romero

Figure 1 for Learning to run a Power Network Challenge: a Retrospective Analysis

Figure 2 for Learning to run a Power Network Challenge: a Retrospective Analysis

Figure 3 for Learning to run a Power Network Challenge: a Retrospective Analysis

Figure 4 for Learning to run a Power Network Challenge: a Retrospective Analysis

Abstract:Power networks, responsible for transporting electricity across large geographical regions, are complex infrastructures on which modern life critically depend. Variations in demand and production profiles, with increasing renewable energy integration, as well as the high voltage network technology, constitute a real challenge for human operators when optimizing electricity transportation while avoiding blackouts. Motivated to investigate the potential of Artificial Intelligence methods in enabling adaptability in power network operation, we have designed a L2RPN challenge to encourage the development of reinforcement learning solutions to key problems present in the next-generation power networks. The NeurIPS 2020 competition was well received by the international community attracting over 300 participants worldwide. The main contribution of this challenge is our proposed comprehensive Grid2Op framework, and associated benchmark, which plays realistic sequential network operations scenarios. The framework is open-sourced and easily re-usable to define new environments with its companion GridAlive ecosystem. It relies on existing non-linear physical simulators and let us create a series of perturbations and challenges that are representative of two important problems: a) the uncertainty resulting from the increased use of unpredictable renewable energy sources, and b) the robustness required with contingent line disconnections. In this paper, we provide details about the competition highlights. We present the benchmark suite and analyse the winning solutions of the challenge, observing one super-human performance demonstration by the best agent. We propose our organizational insights for a successful competition and conclude on open research avenues. We expect our work will foster research to create more sustainable solutions for power network operations.

Via

Access Paper or Ask Questions

Adversarial Training for a Continuous Robustness Control Problem in Power Systems

Jan 14, 2021

Loïc Omnes, Antoine Marot, Benjamin Donnot

Figure 1 for Adversarial Training for a Continuous Robustness Control Problem in Power Systems

Figure 2 for Adversarial Training for a Continuous Robustness Control Problem in Power Systems

Figure 3 for Adversarial Training for a Continuous Robustness Control Problem in Power Systems

Figure 4 for Adversarial Training for a Continuous Robustness Control Problem in Power Systems

Abstract:We propose a new adversarial training approach for injecting robustness when designing controllers for upcoming cyber-physical power systems. Previous approaches relying deeply on simulations are not able to cope with the rising complexity and are too costly when used online in terms of computation budget. In comparison, our method proves to be computationally efficient online while displaying useful robustness properties. To do so we model an adversarial framework, propose the implementation of a fixed opponent policy and test it on a L2RPN (Learning to Run a Power Network) environment. That environment is a synthetic but realistic modeling of a cyber-physical system accounting for one third of the IEEE 118 grid. Using adversarial testing, we analyze the results of submitted trained agents from the robustness track of the L2RPN competition. We then further assess the performance of those agents in regards to the continuous N-1 problem through tailored evaluation metrics. We discover that some agents trained in an adversarial way demonstrate interesting preventive behaviors in that regard, which we discuss.

Via

Access Paper or Ask Questions

Towards an AI assistant for human grid operators

Dec 03, 2020

Antoine Marot, Alexandre Rozier, Matthieu Dussartre, Laure Crochepierre, Benjamin Donnot

Figure 1 for Towards an AI assistant for human grid operators

Figure 2 for Towards an AI assistant for human grid operators

Figure 3 for Towards an AI assistant for human grid operators

Abstract:Power systems are becoming more complex to operate in the digital age. As a result, real-time decision-making is getting more challenging as the human operator has to deal with more information, more uncertainty, more applications and more coordination. While supervision has been primarily used to help them make decisions over the last decades, it cannot reasonably scale up anymore. There is a great need for rethinking the human-machine interface under more unified and interactive frameworks. Taking advantage of the latest developments in Human-machine Interactions and Artificial intelligence, we share the vision of a new assistant framework relying on an hypervision interface and greater bidirectional interactions. We review the known principles of decision-making that drives the assistant design and supporting assistance functions we present. We finally share some guidelines to make progress towards the development of such an assistant.

Via

Access Paper or Ask Questions

Exploring grid topology reconfiguration using a simple deep reinforcement learning approach

Nov 26, 2020

Medha Subramanian, Jan Viebahn, Simon Tindemans, Benjamin Donnot, Antoine Marot

Figure 1 for Exploring grid topology reconfiguration using a simple deep reinforcement learning approach

Figure 2 for Exploring grid topology reconfiguration using a simple deep reinforcement learning approach

Figure 3 for Exploring grid topology reconfiguration using a simple deep reinforcement learning approach

Figure 4 for Exploring grid topology reconfiguration using a simple deep reinforcement learning approach

Abstract:System operators are faced with increasingly volatile operating conditions. In order to manage system reliability in a cost-effective manner, control room operators are turning to computerised decision support tools based on AI and machine learning. Specifically, Reinforcement Learning (RL) is a promising technique to train agents that suggest grid control actions to operators. In this paper, a simple baseline approach is presented using RL to represent an artificial control room operator that can operate a IEEE 14-bus test case for a duration of 1 week. This agent takes topological switching actions to control power flows on the grid, and is trained on only a single well-chosen scenario. The behaviour of this agent is tested on different time-series of generation and demand, demonstrating its ability to operate the grid successfully in 965 out of 1000 scenarios. The type and variability of topologies suggested by the agent are analysed across the test scenarios, demonstrating efficient and diverse agent behaviour.

Via

Access Paper or Ask Questions

Learning to run a power network challenge for training topology controllers

Dec 05, 2019

Antoine Marot, Benjamin Donnot, Camilo Romero, Luca Veyrin-Forrer, Marvin Lerousseau, Balthazar Donon, Isabelle Guyon

Figure 1 for Learning to run a power network challenge for training topology controllers

Figure 2 for Learning to run a power network challenge for training topology controllers

Figure 3 for Learning to run a power network challenge for training topology controllers

Figure 4 for Learning to run a power network challenge for training topology controllers

Abstract:For power grid operations, a large body of research focuses on using generation redispatching, load shedding or demand side management flexibilities. However, a less costly and potentially more flexible option would be grid topology reconfiguration, as already partially exploited by Coreso (European RSC) and RTE (French TSO) operations. Beyond previous work on branch switching, bus reconfigurations are a broader class of action and could provide some substantial benefits to route electricity and optimize the grid capacity to keep it within safety margins. Because of its non-linear and combinatorial nature, no existing optimal power flow solver can yet tackle this problem. We here propose a new framework to learn topology controllers through imitation and reinforcement learning. We present the design and the results of the first "Learning to Run a Power Network" challenge released with this framework. We finally develop a method providing performance upper-bounds (oracle), which highlights remaining unsolved challenges and suggests future directions of improvement.

Via

Access Paper or Ask Questions

LEAP nets for power grid perturbations

Aug 22, 2019

Benjamin Donnot, Balthazar Donon, Isabelle Guyon, Zhengying Liu, Antoine Marot, Patrick Panciatici, Marc Schoenauer

Figure 1 for LEAP nets for power grid perturbations

Figure 2 for LEAP nets for power grid perturbations

Figure 3 for LEAP nets for power grid perturbations

Abstract:We propose a novel neural network embedding approach to model power transmission grids, in which high voltage lines are disconnected and reconnected with one-another from time to time, either accidentally or willfully. We call our architeture LEAP net, for Latent Encoding of Atypical Perturbation. Our method implements a form of transfer learning, permitting to train on a few source domains, then generalize to new target domains, without learning on any example of that domain. We evaluate the viability of this technique to rapidly assess cu-rative actions that human operators take in emergency situations, using real historical data, from the French high voltage power grid.

* ESANN, Apr 2019, Bruges, Belgium

Via

Access Paper or Ask Questions

Optimization of computational budget for power system risk assessment

May 03, 2018

Benjamin Donnot, Isabelle Guyon, Antoine Marot, Marc Schoenauer, Patrick Panciatici

Figure 1 for Optimization of computational budget for power system risk assessment

Figure 2 for Optimization of computational budget for power system risk assessment

Abstract:We address the problem of maintaining high voltage power transmission networks in security at all time, namely anticipating exceeding of thermal limit for eventual single line disconnection (whatever its cause may be) by running slow, but accurate, physical grid simulators. New conceptual frameworks are calling for a probabilistic risk-based security criterion. However, these approaches suffer from high requirements in terms of tractability. Here, we propose a new method to assess the risk. This method uses both machine learning techniques (artificial neural networks) and more standard simulators based on physical laws. More specifically we train neural networks to estimate the overall dangerousness of a grid state. A classical benchmark problem (manpower 118 buses test case) is used to show the strengths of the proposed method.

* 2018 IEEE PES Innovative Smart Grid Technologies Conference Europe, Oct 2018, Sarajevo, Bosnia and Herzegovina. 2018

Via

Access Paper or Ask Questions