Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Multi-objective evolution for Generalizable Policy Gradient Algorithms

Apr 08, 2022

Juan Jose Garau-Luis, Yingjie Miao, John D. Co-Reyes, Aaron Parisi, Jie Tan, Esteban Real, Aleksandra Faust

Figure 1 for Multi-objective evolution for Generalizable Policy Gradient Algorithms

Figure 2 for Multi-objective evolution for Generalizable Policy Gradient Algorithms

Figure 3 for Multi-objective evolution for Generalizable Policy Gradient Algorithms

Figure 4 for Multi-objective evolution for Generalizable Policy Gradient Algorithms

Share this with someone who'll enjoy it:

Abstract:Performance, generalizability, and stability are three Reinforcement Learning (RL) challenges relevant to many practical applications in which they present themselves in combination. Still, state-of-the-art RL algorithms fall short when addressing multiple RL objectives simultaneously and current human-driven design practices might not be well-suited for multi-objective RL. In this paper we present MetaPG, an evolutionary method that discovers new RL algorithms represented as graphs, following a multi-objective search criteria in which different RL objectives are encoded in separate fitness scores. Our findings show that, when using a graph-based implementation of Soft Actor-Critic (SAC) to initialize the population, our method is able to find new algorithms that improve upon SAC's performance and generalizability by 3% and 17%, respectively, and reduce instability up to 65%. In addition, we analyze the graph structure of the best algorithms in the population and offer an interpretation of specific elements that help trading performance for generalizability and vice versa. We validate our findings in three different continuous control tasks: RWRL Cartpole, RWRL Walker, and Gym Pendulum.

* 23 pages, 12 figures, 10 tables

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Multi-objective evolution for Generalizable Policy Gradient Algorithms

Paper and Code