Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Flat'n'Fold: A Diverse Multi-Modal Dataset for Garment Perception and Manipulation

Sep 26, 2024

Lipeng Zhuang, Shiyu Fan, Yingdong Ru, Florent Audonnet, Paul Henderson, Gerardo Aragon-Camarasa

Figure 1 for Flat'n'Fold: A Diverse Multi-Modal Dataset for Garment Perception and Manipulation

Figure 2 for Flat'n'Fold: A Diverse Multi-Modal Dataset for Garment Perception and Manipulation

Figure 3 for Flat'n'Fold: A Diverse Multi-Modal Dataset for Garment Perception and Manipulation

Figure 4 for Flat'n'Fold: A Diverse Multi-Modal Dataset for Garment Perception and Manipulation

Share this with someone who'll enjoy it:

Abstract:We present Flat'n'Fold, a novel large-scale dataset for garment manipulation that addresses critical gaps in existing datasets. Comprising 1,212 human and 887 robot demonstrations of flattening and folding 44 unique garments across 8 categories, Flat'n'Fold surpasses prior datasets in size, scope, and diversity. Our dataset uniquely captures the entire manipulation process from crumpled to folded states, providing synchronized multi-view RGB-D images, point clouds, and action data, including hand or gripper positions and rotations. We quantify the dataset's diversity and complexity compared to existing benchmarks and show that our dataset features natural and diverse manipulations of real-world demonstrations of human and robot demonstrations in terms of visual and action information. To showcase Flat'n'Fold's utility, we establish new benchmarks for grasping point prediction and subtask decomposition. Our evaluation of state-of-the-art models on these tasks reveals significant room for improvement. This underscores Flat'n'Fold's potential to drive advances in robotic perception and manipulation of deformable objects. Our dataset can be downloaded at https://cvas-ug.github.io/flat-n-fold

View paper on

Share this with someone who'll enjoy it:

Title:Flat'n'Fold: A Diverse Multi-Modal Dataset for Garment Perception and Manipulation

Paper and Code