Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Oct 02, 2023

Lirui Wang, Kaiqing Zhang, Allan Zhou, Max Simchowitz, Russ Tedrake

Figure 1 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Figure 2 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Figure 3 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Figure 4 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Share this with someone who'll enjoy it:

Abstract:Fleets of robots ingest massive amounts of streaming data generated by interacting with their environments, far more than those that can be stored or transmitted with ease. At the same time, we hope that teams of robots can co-acquire diverse skills through their experiences in varied settings. How can we enable such fleet-level learning without having to transmit or centralize fleet-scale data? In this paper, we investigate distributed learning of policies as a potential solution. To efficiently merge policies in the distributed setting, we propose fleet-merge, an instantiation of distributed learning that accounts for the symmetries that can arise in learning policies that are parameterized by recurrent neural networks. We show that fleet-merge consolidates the behavior of policies trained on 50 tasks in the Meta-World environment, with the merged policy achieving good performance on nearly all training tasks at test time. Moreover, we introduce a novel robotic tool-use benchmark, fleet-tools, for fleet policy learning in compositional and contact-rich robot manipulation tasks, which might be of broader interest, and validate the efficacy of fleet-merge on the benchmark.

* See the code https://github.com/liruiw/Fleet-Tools for more details

View paper on

Share this with someone who'll enjoy it:

Title:Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Paper and Code