Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning

Dec 02, 2023

Cyrus Neary, Christian Ellis, Aryaman Singh Samyal, Craig Lennon, Ufuk Topcu

Figure 1 for A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning

Figure 2 for A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning

Figure 3 for A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning

Figure 4 for A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:We propose and demonstrate a compositional framework for training and verifying reinforcement learning (RL) systems within a multifidelity sim-to-real pipeline, in order to deploy reliable and adaptable RL policies on physical hardware. By decomposing complex robotic tasks into component subtasks and defining mathematical interfaces between them, the framework allows for the independent training and testing of the corresponding subtask policies, while simultaneously providing guarantees on the overall behavior that results from their composition. By verifying the performance of these subtask policies using a multifidelity simulation pipeline, the framework not only allows for efficient RL training, but also for a refinement of the subtasks and their interfaces in response to challenges arising from discrepancies between simulation and reality. In an experimental case study we apply the framework to train and deploy a compositional RL system that successfully pilots a Warthog unmanned ground robot.

View paper on

Share this with someone who'll enjoy it:

Title:A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning

Paper and Code