Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation

Sep 27, 2023

Shizhe Chen, Ricardo Garcia, Cordelia Schmid, Ivan Laptev

Figure 1 for PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation

Figure 2 for PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation

Figure 3 for PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation

Figure 4 for PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation

Share this with someone who'll enjoy it:

Abstract:The ability for robots to comprehend and execute manipulation tasks based on natural language instructions is a long-term goal in robotics. The dominant approaches for language-guided manipulation use 2D image representations, which face difficulties in combining multi-view cameras and inferring precise 3D positions and relationships. To address these limitations, we propose a 3D point cloud based policy called PolarNet for language-guided manipulation. It leverages carefully designed point cloud inputs, efficient point cloud encoders, and multimodal transformers to learn 3D point cloud representations and integrate them with language instructions for action prediction. PolarNet is shown to be effective and data efficient in a variety of experiments conducted on the RLBench benchmark. It outperforms state-of-the-art 2D and 3D approaches in both single-task and multi-task learning. It also achieves promising results on a real robot.

* Accepted to CoRL 2023. Project website: https://www.di.ens.fr/willow/research/polarnet/

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation

Paper and Code