Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ForceSight: Text-Guided Mobile Manipulation with Visual-Force Goals

Sep 24, 2023

Jeremy A. Collins, Cody Houff, You Liang Tan, Charles C. Kemp

Figure 1 for ForceSight: Text-Guided Mobile Manipulation with Visual-Force Goals

Figure 2 for ForceSight: Text-Guided Mobile Manipulation with Visual-Force Goals

Figure 3 for ForceSight: Text-Guided Mobile Manipulation with Visual-Force Goals

Figure 4 for ForceSight: Text-Guided Mobile Manipulation with Visual-Force Goals

Share this with someone who'll enjoy it:

Abstract:We present ForceSight, a system for text-guided mobile manipulation that predicts visual-force goals using a deep neural network. Given a single RGBD image combined with a text prompt, ForceSight determines a target end-effector pose in the camera frame (kinematic goal) and the associated forces (force goal). Together, these two components form a visual-force goal. Prior work has demonstrated that deep models outputting human-interpretable kinematic goals can enable dexterous manipulation by real robots. Forces are critical to manipulation, yet have typically been relegated to lower-level execution in these systems. When deployed on a mobile manipulator equipped with an eye-in-hand RGBD camera, ForceSight performed tasks such as precision grasps, drawer opening, and object handovers with an 81% success rate in unseen environments with object instances that differed significantly from the training data. In a separate experiment, relying exclusively on visual servoing and ignoring force goals dropped the success rate from 90% to 45%, demonstrating that force goals can significantly enhance performance. The appendix, videos, code, and trained models are available at https://force-sight.github.io/.

View paper on

Share this with someone who'll enjoy it:

Title:ForceSight: Text-Guided Mobile Manipulation with Visual-Force Goals

Paper and Code