Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yik Lung Pang

Stereo Hand-Object Reconstruction for Human-to-Robot Handover

Dec 10, 2024

Yik Lung Pang, Alessio Xompero, Changjae Oh, Andrea Cavallaro

Figure 1 for Stereo Hand-Object Reconstruction for Human-to-Robot Handover

Figure 2 for Stereo Hand-Object Reconstruction for Human-to-Robot Handover

Figure 3 for Stereo Hand-Object Reconstruction for Human-to-Robot Handover

Figure 4 for Stereo Hand-Object Reconstruction for Human-to-Robot Handover

Abstract:Jointly estimating hand and object shape ensures the success of the robot grasp in human-to-robot handovers. However, relying on hand-crafted prior knowledge about the geometric structure of the object fails when generalising to unseen objects, and depth sensors fail to detect transparent objects such as drinking glasses. In this work, we propose a stereo-based method for hand-object reconstruction that combines single-view reconstructions probabilistically to form a coherent stereo reconstruction. We learn 3D shape priors from a large synthetic hand-object dataset to ensure that our method is generalisable, and use RGB inputs instead of depth as RGB can better capture transparent objects. We show that our method achieves a lower object Chamfer distance compared to existing RGB based hand-object reconstruction methods on single view and stereo settings. We process the reconstructed hand-object shape with a projection-based outlier removal step and use the output to guide a human-to-robot handover pipeline with wide-baseline stereo RGB cameras. Our hand-object reconstruction enables a robot to successfully receive a diverse range of household objects from the human.

* 8 pages, 9 figures, 1 table

Via

Access Paper or Ask Questions

Sparse multi-view hand-object reconstruction for unseen environments

May 02, 2024

Yik Lung Pang, Changjae Oh, Andrea Cavallaro

Figure 1 for Sparse multi-view hand-object reconstruction for unseen environments

Figure 2 for Sparse multi-view hand-object reconstruction for unseen environments

Figure 3 for Sparse multi-view hand-object reconstruction for unseen environments

Figure 4 for Sparse multi-view hand-object reconstruction for unseen environments

Abstract:Recent works in hand-object reconstruction mainly focus on the single-view and dense multi-view settings. On the one hand, single-view methods can leverage learned shape priors to generalise to unseen objects but are prone to inaccuracies due to occlusions. On the other hand, dense multi-view methods are very accurate but cannot easily adapt to unseen objects without further data collection. In contrast, sparse multi-view methods can take advantage of the additional views to tackle occlusion, while keeping the computational cost low compared to dense multi-view methods. In this paper, we consider the problem of hand-object reconstruction with unseen objects in the sparse multi-view setting. Given multiple RGB images of the hand and object captured at the same time, our model SVHO combines the predictions from each view into a unified reconstruction without optimisation across views. We train our model on a synthetic hand-object dataset and evaluate directly on a real world recorded hand-object dataset with unseen objects. We show that while reconstruction of unseen hands and objects from RGB is challenging, additional views can help improve the reconstruction quality.

* Camera-ready version. Paper accepted to CVPRW 2024. 8 pages, 7 figures, 1 table

Via

Access Paper or Ask Questions

OHPL: One-shot Hand-eye Policy Learner

Aug 06, 2021

Changjae Oh, Yik Lung Pang, Andrea Cavallaro

Figure 1 for OHPL: One-shot Hand-eye Policy Learner

Figure 2 for OHPL: One-shot Hand-eye Policy Learner

Figure 3 for OHPL: One-shot Hand-eye Policy Learner

Figure 4 for OHPL: One-shot Hand-eye Policy Learner

Abstract:The control of a robot for manipulation tasks generally relies on object detection and pose estimation. An attractive alternative is to learn control policies directly from raw input data. However, this approach is time-consuming and expensive since learning the policy requires many trials with robot actions in the physical environment. To reduce the training cost, the policy can be learned in simulation with a large set of synthetic images. The limit of this approach is the domain gap between the simulation and the robot workspace. In this paper, we propose to learn a policy for robot reaching movements from a single image captured directly in the robot workspace from a camera placed on the end-effector (a hand-eye camera). The idea behind the proposed policy learner is that view changes seen from the hand-eye camera produced by actions in the robot workspace are analogous to locating a region-of-interest in a single image by performing sequential object localisation. This similar view change enables training of object reaching policies using reinforcement-learning-based sequential object localisation. To facilitate the adaptation of the policy to view changes in the robot workspace, we further present a dynamic filter that learns to bias an input state to remove irrelevant information for an action decision. The proposed policy learner can be used as a powerful representation for robotic tasks, and we validate it on static and moving object reaching tasks.

* Camera-ready version. Paper accepted to IROS 2021. 7 pages, 7 figures, 2 tables

Via

Access Paper or Ask Questions

Towards safe human-to-robot handovers of unknown containers

Jul 03, 2021

Yik Lung Pang, Alessio Xompero, Changjae Oh, Andrea Cavallaro

Figure 1 for Towards safe human-to-robot handovers of unknown containers

Figure 2 for Towards safe human-to-robot handovers of unknown containers

Figure 3 for Towards safe human-to-robot handovers of unknown containers

Figure 4 for Towards safe human-to-robot handovers of unknown containers

Abstract:Safe human-to-robot handovers of unknown objects require accurate estimation of hand poses and object properties, such as shape, trajectory, and weight. Accurately estimating these properties requires the use of scanned 3D object models or expensive equipment, such as motion capture systems and markers, or both. However, testing handover algorithms with robots may be dangerous for the human and, when the object is an open container with liquids, for the robot. In this paper, we propose a real-to-simulation framework to develop safe human-to-robot handovers with estimations of the physical properties of unknown cups or drinking glasses and estimations of the human hands from videos of a human manipulating the container. We complete the handover in simulation, and we estimate a region that is not occluded by the hand of the human holding the container. We also quantify the safeness of the human and object in simulation. We validate the framework using public recordings of containers manipulated before a handover and show the safeness of the handover when using noisy estimates from a range of perceptual algorithms.

* Camera-ready version. Paper accepted to RO-MAN 2021. 8 pages, 8 figures, 1 table

Via

Access Paper or Ask Questions