Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thomas Weng

On-Robot Reinforcement Learning with Goal-Contrastive Rewards

Oct 25, 2024

Ondrej Biza, Thomas Weng, Lingfeng Sun, Karl Schmeckpeper, Tarik Kelestemur, Yecheng Jason Ma, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

Abstract:Reinforcement Learning (RL) has the potential to enable robots to learn from their own actions in the real world. Unfortunately, RL can be prohibitively expensive, in terms of on-robot runtime, due to inefficient exploration when learning from a sparse reward signal. Designing dense reward functions is labour-intensive and requires domain expertise. In our work, we propose GCR (Goal-Contrastive Rewards), a dense reward function learning method that can be trained on passive video demonstrations. By using videos without actions, our method is easier to scale, as we can use arbitrary videos. GCR combines two loss functions, an implicit value loss function that models how the reward increases when traversing a successful trajectory, and a goal-contrastive loss that discriminates between successful and failed trajectories. We perform experiments in simulated manipulation environments across RoboMimic and MimicGen tasks, as well as in the real world using a Franka arm and a Spot quadruped. We find that GCR leads to a more-sample efficient RL, enabling model-free RL to solve about twice as many tasks as our baseline reward learning methods. We also demonstrate positive cross-embodiment transfer from videos of people and of other robots performing a task. Appendix: \url{https://tinyurl.com/gcr-appendix-2}.

Via

Access Paper or Ask Questions

Neural Grasp Distance Fields for Robot Manipulation

Nov 04, 2022

Thomas Weng, David Held, Franziska Meier, Mustafa Mukadam

Abstract:We formulate grasp learning as a neural field and present Neural Grasp Distance Fields (NGDF). Here, the input is a 6D pose of a robot end effector and output is a distance to a continuous manifold of valid grasps for an object. In contrast to current approaches that predict a set of discrete candidate grasps, the distance-based NGDF representation is easily interpreted as a cost, and minimizing this cost produces a successful grasp pose. This grasp distance cost can be incorporated directly into a trajectory optimizer for joint optimization with other costs such as trajectory smoothness and collision avoidance. During optimization, as the various costs are balanced and minimized, the grasp target is allowed to smoothly vary, as the learned grasp field is continuous. In simulation benchmarks with a Franka arm, we find that joint grasping and planning with NGDF outperforms baselines by 63% execution success while generalizing to unseen query poses and unseen object shapes. Project page: https://sites.google.com/view/neural-grasp-distance-fields.

Via

Access Paper or Ask Questions

Learning to Singulate Layers of Cloth using Tactile Feedback

Jul 22, 2022

Sashank Tirumala, Thomas Weng, Daniel Seita, Oliver Kroemer, Zeynep Temel, David Held

Figure 1 for Learning to Singulate Layers of Cloth using Tactile Feedback

Figure 2 for Learning to Singulate Layers of Cloth using Tactile Feedback

Figure 3 for Learning to Singulate Layers of Cloth using Tactile Feedback

Figure 4 for Learning to Singulate Layers of Cloth using Tactile Feedback

Abstract:Robotic manipulation of cloth has applications ranging from fabrics manufacturing to handling blankets and laundry. Cloth manipulation is challenging for robots largely due to their high degrees of freedom, complex dynamics, and severe self-occlusions when in folded or crumpled configurations. Prior work on robotic manipulation of cloth relies primarily on vision sensors alone, which may pose challenges for fine-grained manipulation tasks such as grasping a desired number of cloth layers from a stack of cloth. In this paper, we propose to use tactile sensing for cloth manipulation; we attach a tactile sensor (ReSkin) to one of the two fingertips of a Franka robot and train a classifier to determine whether the robot is grasping a specific number of cloth layers. During test-time experiments, the robot uses this classifier as part of its policy to grasp one or two cloth layers using tactile feedback to determine suitable grasping points. Experimental results over 180 physical trials suggest that the proposed method outperforms baselines that do not use tactile feedback and has better generalization to unseen cloth compared to methods that use image classifiers. Code, data, and videos are available at https://sites.google.com/view/reskin-cloth.

* IROS 2022. See https://sites.google.com/view/reskin-cloth for supplementary material

Via

Access Paper or Ask Questions

FabricFlowNet: Bimanual Cloth Manipulation with a Flow-based Policy

Nov 10, 2021

Thomas Weng, Sujay Bajracharya, Yufei Wang, Khush Agrawal, David Held

Figure 1 for FabricFlowNet: Bimanual Cloth Manipulation with a Flow-based Policy

Figure 2 for FabricFlowNet: Bimanual Cloth Manipulation with a Flow-based Policy

Figure 3 for FabricFlowNet: Bimanual Cloth Manipulation with a Flow-based Policy

Figure 4 for FabricFlowNet: Bimanual Cloth Manipulation with a Flow-based Policy

Abstract:We address the problem of goal-directed cloth manipulation, a challenging task due to the deformability of cloth. Our insight is that optical flow, a technique normally used for motion estimation in video, can also provide an effective representation for corresponding cloth poses across observation and goal images. We introduce FabricFlowNet (FFN), a cloth manipulation policy that leverages flow as both an input and as an action representation to improve performance. FabricFlowNet also elegantly switches between bimanual and single-arm actions based on the desired goal. We show that FabricFlowNet significantly outperforms state-of-the-art model-free and model-based cloth manipulation policies that take image input. We also present real-world experiments on a bimanual system, demonstrating effective sim-to-real transfer. Finally, we show that our method generalizes when trained on a single square cloth to other cloth shapes, such as T-shirts and rectangular cloths. Video and other supplementary materials are available at: https://sites.google.com/view/fabricflownet.

* CoRL 2021

Via

Access Paper or Ask Questions

Cloth Region Segmentation for Robust Grasp Selection

Aug 13, 2020

Jianing Qian, Thomas Weng, Luxin Zhang, Brian Okorn, David Held

Figure 1 for Cloth Region Segmentation for Robust Grasp Selection

Figure 2 for Cloth Region Segmentation for Robust Grasp Selection

Figure 3 for Cloth Region Segmentation for Robust Grasp Selection

Figure 4 for Cloth Region Segmentation for Robust Grasp Selection

Abstract:Cloth detection and manipulation is a common task in domestic and industrial settings, yet such tasks remain a challenge for robots due to cloth deformability. Furthermore, in many cloth-related tasks like laundry folding and bed making, it is crucial to manipulate specific regions like edges and corners, as opposed to folds. In this work, we focus on the problem of segmenting and grasping these key regions. Our approach trains a network to segment the edges and corners of a cloth from a depth image, distinguishing such regions from wrinkles or folds. We also provide a novel algorithm for estimating the grasp location, direction, and directional uncertainty from the segmentation. We demonstrate our method on a real robot system and show that it outperforms baseline methods on grasping success. Video and other supplementary materials are available at: https://sites.google.com/view/cloth-segmentation.

* Accepted at IROS 2020. The first two authors contributed equally and are listed in alphabetical order

Via

Access Paper or Ask Questions

Multi-modal Transfer Learning for Grasping Transparent and Specular Objects

May 29, 2020

Thomas Weng, Amith Pallankize, Yimin Tang, Oliver Kroemer, David Held

Figure 1 for Multi-modal Transfer Learning for Grasping Transparent and Specular Objects

Figure 2 for Multi-modal Transfer Learning for Grasping Transparent and Specular Objects

Figure 3 for Multi-modal Transfer Learning for Grasping Transparent and Specular Objects

Figure 4 for Multi-modal Transfer Learning for Grasping Transparent and Specular Objects

Abstract:State-of-the-art object grasping methods rely on depth sensing to plan robust grasps, but commercially available depth sensors fail to detect transparent and specular objects. To improve grasping performance on such objects, we introduce a method for learning a multi-modal perception model by bootstrapping from an existing uni-modal model. This transfer learning approach requires only a pre-existing uni-modal grasping model and paired multi-modal image data for training, foregoing the need for ground-truth grasp success labels nor real grasp attempts. Our experiments demonstrate that our approach is able to reliably grasp transparent and reflective objects. Video and supplementary material are available at https://sites.google.com/view/transparent-specular-grasping.

* IEEE ROBOTICS AND AUTOMATION LETTERS, VOL. 5, NO. 3, JULY 2020. 3791-3798
* RA-L with presentation at ICRA 2020

Via

Access Paper or Ask Questions