Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors

Oct 20, 2022

Yifeng Zhu, Abhishek Joshi, Peter Stone, Yuke Zhu

Figure 1 for VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors

Figure 2 for VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors

Figure 3 for VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors

Figure 4 for VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors

Share this with someone who'll enjoy it:

Abstract:We introduce VIOLA, an object-centric imitation learning approach to learning closed-loop visuomotor policies for robot manipulation. Our approach constructs object-centric representations based on general object proposals from a pre-trained vision model. VIOLA uses a transformer-based policy to reason over these representations and attend to the task-relevant visual factors for action prediction. Such object-based structural priors improve deep imitation learning algorithm's robustness against object variations and environmental perturbations. We quantitatively evaluate VIOLA in simulation and on real robots. VIOLA outperforms the state-of-the-art imitation learning methods by $45.8\%$ in success rate. It has also been deployed successfully on a physical robot to solve challenging long-horizon tasks, such as dining table arrangement and coffee making. More videos and model details can be found in supplementary material and the project website: https://ut-austin-rpl.github.io/VIOLA .

* To be published at the 6th Conference on Robot Learning

View paper on

Share this with someone who'll enjoy it:

Title:VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors

Paper and Code