Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

William D. Richards

Depth by Poking: Learning to Estimate Depth from Self-Supervised Grasping

Jun 16, 2020

Ben Goodrich, Alex Kuefler, William D. Richards

Figure 1 for Depth by Poking: Learning to Estimate Depth from Self-Supervised Grasping

Figure 2 for Depth by Poking: Learning to Estimate Depth from Self-Supervised Grasping

Figure 3 for Depth by Poking: Learning to Estimate Depth from Self-Supervised Grasping

Figure 4 for Depth by Poking: Learning to Estimate Depth from Self-Supervised Grasping

Abstract:Accurate depth estimation remains an open problem for robotic manipulation; even state of the art techniques including structured light and LiDAR sensors fail on reflective or transparent surfaces. We address this problem by training a neural network model to estimate depth from RGB-D images, using labels from physical interactions between a robot and its environment. Our network predicts, for each pixel in an input image, the z position that a robot's end effector would reach if it attempted to grasp or poke at the corresponding position. Given an autonomous grasping policy, our approach is self-supervised as end effector position labels can be recovered through forward kinematics, without human annotation. Although gathering such physical interaction data is expensive, it is necessary for training and routine operation of state of the art manipulation systems. Therefore, this depth estimator comes ``for free'' while collecting data for other tasks (e.g., grasping, pushing, placing). We show our approach achieves significantly lower root mean squared error than traditional structured light sensors and unsupervised deep learning methods on difficult, industry-scale jumbled bin datasets.

* IEEE International Conference on Robotics and Automation (ICRA) 2020

Via

Access Paper or Ask Questions