Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Taozheng Yang

Exploring Visual Pre-training for Robot Manipulation: Datasets, Models and Methods

Aug 07, 2023

Ya Jing, Xuelin Zhu, Xingbin Liu, Qie Sima, Taozheng Yang, Yunhai Feng, Tao Kong

Figure 1 for Exploring Visual Pre-training for Robot Manipulation: Datasets, Models and Methods

Figure 2 for Exploring Visual Pre-training for Robot Manipulation: Datasets, Models and Methods

Figure 3 for Exploring Visual Pre-training for Robot Manipulation: Datasets, Models and Methods

Figure 4 for Exploring Visual Pre-training for Robot Manipulation: Datasets, Models and Methods

Abstract:Visual pre-training with large-scale real-world data has made great progress in recent years, showing great potential in robot learning with pixel observations. However, the recipes of visual pre-training for robot manipulation tasks are yet to be built. In this paper, we thoroughly investigate the effects of visual pre-training strategies on robot manipulation tasks from three fundamental perspectives: pre-training datasets, model architectures and training methods. Several significant experimental findings are provided that are beneficial for robot learning. Further, we propose a visual pre-training scheme for robot manipulation termed Vi-PRoM, which combines self-supervised learning and supervised learning. Concretely, the former employs contrastive learning to acquire underlying patterns from large-scale unlabeled data, while the latter aims learning visual semantics and temporal dynamics. Extensive experiments on robot manipulations in various simulation environments and the real robot demonstrate the superiority of the proposed scheme. Videos and more details can be found on \url{https://explore-pretrain-robot.github.io}.

* IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023

Via

Access Paper or Ask Questions

MOMA-Force: Visual-Force Imitation for Real-World Mobile Manipulation

Aug 07, 2023

Taozheng Yang, Ya Jing, Hongtao Wu, Jiafeng Xu, Kuankuan Sima, Guangzeng Chen, Qie Sima, Tao Kong

Figure 1 for MOMA-Force: Visual-Force Imitation for Real-World Mobile Manipulation

Figure 2 for MOMA-Force: Visual-Force Imitation for Real-World Mobile Manipulation

Figure 3 for MOMA-Force: Visual-Force Imitation for Real-World Mobile Manipulation

Figure 4 for MOMA-Force: Visual-Force Imitation for Real-World Mobile Manipulation

Abstract:In this paper, we present a novel method for mobile manipulators to perform multiple contact-rich manipulation tasks. While learning-based methods have the potential to generate actions in an end-to-end manner, they often suffer from insufficient action accuracy and robustness against noise. On the other hand, classical control-based methods can enhance system robustness, but at the cost of extensive parameter tuning. To address these challenges, we present MOMA-Force, a visual-force imitation method that seamlessly combines representation learning for perception, imitation learning for complex motion generation, and admittance whole-body control for system robustness and controllability. MOMA-Force enables a mobile manipulator to learn multiple complex contact-rich tasks with high success rates and small contact forces. In a real household setting, our method outperforms baseline methods in terms of task success rates. Moreover, our method achieves smaller contact forces and smaller force variances compared to baseline methods without force imitation. Overall, we offer a promising approach for efficient and robust mobile manipulation in the real world. Videos and more details can be found on \url{https://visual-force-imitation.github.io}

* IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023

Via

Access Paper or Ask Questions