Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yoshihiro Sato

CAPT: Category-level Articulation Estimation from a Single Point Cloud Using Transformer

Feb 27, 2024

Lian Fu, Ryoichi Ishikawa, Yoshihiro Sato, Takeshi Oishi

Abstract:The ability to estimate joint parameters is essential for various applications in robotics and computer vision. In this paper, we propose CAPT: category-level articulation estimation from a point cloud using Transformer. CAPT uses an end-to-end transformer-based architecture for joint parameter and state estimation of articulated objects from a single point cloud. The proposed CAPT methods accurately estimate joint parameters and states for various articulated objects with high precision and robustness. The paper also introduces a motion loss approach, which improves articulation estimation performance by emphasizing the dynamic features of articulated objects. Additionally, the paper presents a double voting strategy to provide the framework with coarse-to-fine parameter estimation. Experimental results on several category datasets demonstrate that our methods outperform existing alternatives for articulation estimation. Our research provides a promising solution for applying Transformer-based architectures in articulated object analysis.

* Accepted to ICRA 2024

Via

Access Paper or Ask Questions

Learning 6DoF Grasping Using Reward-Consistent Demonstration

Mar 23, 2021

Daichi Kawakami, Ryoichi Ishikawa, Menandro Roxas, Yoshihiro Sato, Takeshi Oishi

Figure 1 for Learning 6DoF Grasping Using Reward-Consistent Demonstration

Figure 2 for Learning 6DoF Grasping Using Reward-Consistent Demonstration

Figure 3 for Learning 6DoF Grasping Using Reward-Consistent Demonstration

Figure 4 for Learning 6DoF Grasping Using Reward-Consistent Demonstration

Abstract:As the number of the robot's degrees of freedom increases, the implementation of robot motion becomes more complex and difficult. In this study, we focus on learning 6DOF-grasping motion and consider dividing the grasping motion into multiple tasks. We propose to combine imitation and reinforcement learning in order to facilitate a more efficient learning of the desired motion. In order to collect demonstration data as teacher data for the imitation learning, we created a virtual reality (VR) interface that allows humans to operate the robot intuitively. Moreover, by dividing the motion into simpler tasks, we simplify the design of reward functions for reinforcement learning and show in our experiments a reduction in the steps required to learn the grasping motion.

Via

Access Paper or Ask Questions

Describing upper body motions based on the Labanotation for learning-from-observation robots

Sep 18, 2016

Katsushi Ikeuchi, Zengqiang Yan, Zhaoyuan Ma, Yoshihiro Sato, Minako Nakamura, Shunsuke Kudoh

Figure 1 for Describing upper body motions based on the Labanotation for learning-from-observation robots

Figure 2 for Describing upper body motions based on the Labanotation for learning-from-observation robots

Figure 3 for Describing upper body motions based on the Labanotation for learning-from-observation robots

Figure 4 for Describing upper body motions based on the Labanotation for learning-from-observation robots

Abstract:We have been developing a paradigm, which we refer to as Learning-from-observation, for a robot to automatically acquire what-to-do through observation of human performance. Since a simple mimicking method to repeat exact joint angles does not work due to the kinematic and dynamic difference between a human and a robot, the method introduces an intermediate symbolic representation, task models, to conceptually represent what-to-do through observation. Then, these task models are mapped appropriate robot motions depending on each robot hardware. This paper presents task models, designed based on the Labanotation, for upper body movements of humanoid robots. Given a human motion sequence, we first analyze the motions of the upper body, and extract certain fixed poses at certain key frames. These key poses are translated into states represented by Labanotation symbols. Then, task models, identified from the state transitions, are mapped to robot movements on a particular robot hardware. Since the task models based on Labanotation are independent from different robot hardware, we can share the same observation module; we only need task mapping modules depending on different robot hardware. The system was implemented and demonstrated that three different robots can automatically mimic human upper body motions with satisfactory level of resemblance.

* 9 pages, 11 figures

Via

Access Paper or Ask Questions