Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sashwat Mahalingam

SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks

Jul 07, 2023

Xingyu Lin, John So, Sashwat Mahalingam, Fangchen Liu, Pieter Abbeel

Figure 1 for SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks

Figure 2 for SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks

Figure 3 for SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks

Figure 4 for SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks

Abstract:The existing internet-scale image and video datasets cover a wide range of everyday objects and tasks, bringing the potential of learning policies that have broad generalization. Prior works have explored visual pre-training with different self-supervised objectives, but the generalization capabilities of the learned policies remain relatively unknown. In this work, we take the first step towards this challenge, focusing on how pre-trained representations can help the generalization of the learned policies. We first identify the key bottleneck in using a frozen pre-trained visual backbone for policy learning. We then propose SpawnNet, a novel two-stream architecture that learns to fuse pre-trained multi-layer representations into a separate network to learn a robust policy. Through extensive simulated and real experiments, we demonstrate significantly better categorical generalization compared to prior approaches in imitation learning settings.

Via

Access Paper or Ask Questions