Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera

Sep 16, 2024

Jingpei Lu, Zekai Liang, Tristin Xie, Florian Ritcher, Shan Lin, Sainan Liu, Michael C. Yip

Figure 1 for CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera

Figure 2 for CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera

Figure 3 for CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera

Figure 4 for CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera

Share this with someone who'll enjoy it:

Abstract:Camera-to-robot calibration is crucial for vision-based robot control and requires effort to make it accurate. Recent advancements in markerless pose estimation methods have eliminated the need for time-consuming physical setups for camera-to-robot calibration. While the existing markerless pose estimation methods have demonstrated impressive accuracy without the need for cumbersome setups, they rely on the assumption that all the robot joints are visible within the camera's field of view. However, in practice, robots usually move in and out of view, and some portion of the robot may stay out-of-frame during the whole manipulation task due to real-world constraints, leading to a lack of sufficient visual features and subsequent failure of these approaches. To address this challenge and enhance the applicability to vision-based robot control, we propose a novel framework capable of estimating the robot pose with partially visible robot manipulators. Our approach leverages the Vision-Language Models for fine-grained robot components detection, and integrates it into a keypoint-based pose estimation network, which enables more robust performance in varied operational conditions. The framework is evaluated on both public robot datasets and self-collected partial-view datasets to demonstrate our robustness and generalizability. As a result, this method is effective for robot pose estimation in a wider range of real-world manipulation scenarios.

* 7 pages, 5 figures, project website: https://sites.google.com/ucsd.edu/ctrnet-x

View paper on

Share this with someone who'll enjoy it:

Title:CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera

Paper and Code