Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments

Mar 24, 2024

Abdelrahman Younes, Tamim Asfour

Figure 1 for KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments

Figure 2 for KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments

Figure 3 for KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments

Figure 4 for KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments

Share this with someone who'll enjoy it:

Abstract:Despite the recent progress on 6D object pose estimation methods for robotic grasping, a substantial performance gap persists between the capabilities of these methods on existing datasets and their efficacy in real-world mobile manipulation tasks, particularly when robots rely solely on their monocular egocentric field of view (FOV). Existing real-world datasets primarily focus on table-top grasping scenarios, where a robotic arm is placed in a fixed position and the objects are centralized within the FOV of fixed external camera(s). Assessing performance on such datasets may not accurately reflect the challenges encountered in everyday mobile manipulation tasks within kitchen environments such as retrieving objects from higher shelves, sinks, dishwashers, ovens, refrigerators, or microwaves. To address this gap, we present Kitchen, a novel benchmark designed specifically for estimating the 6D poses of objects located in diverse positions within kitchen settings. For this purpose, we recorded a comprehensive dataset comprising around 205k real-world RGBD images for 111 kitchen objects captured in two distinct kitchens, utilizing one humanoid robot with its egocentric perspectives. Subsequently, we developed a semi-automated annotation pipeline, to streamline the labeling process of such datasets, resulting in the generation of 2D object labels, 2D object segmentation masks, and 6D object poses with minimized human effort. The benchmark, the dataset, and the annotation pipeline are available at https://kitchen-dataset.github.io/KITchen.

View paper on

Share this with someone who'll enjoy it:

Title:KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments

Paper and Code