Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ci Chen

Pretraining-finetuning Framework for Efficient Co-design: A Case Study on Quadruped Robot Parkour

Jul 09, 2024

Ci Chen, Jiyu Yu, Haojian Lu, Hongbo Gao, Rong Xiong, Yue Wang

Abstract:In nature, animals with exceptional locomotion abilities, such as cougars, often possess asymmetric fore and hind legs, with their powerful hind legs acting as reservoirs of energy for leaps. This observation inspired us: could optimize the leg length of quadruped robots endow them with similar locomotive capabilities? In this paper, we propose an approach that co-optimizes the mechanical structure and control policy to boost the locomotive prowess of quadruped robots. Specifically, we introduce a novel pretraining-finetuning framework, which not only guarantees optimal control strategies for each mechanical candidate but also ensures time efficiency. Additionally, we have devised an innovative training method for our pretraining network, integrating spatial domain randomization with regularization methods, markedly improving the network's generalizability. Our experimental results indicate that the proposed pretraining-finetuning framework significantly enhances the overall co-design performance with less time consumption. Moreover, the co-design strategy substantially exceeds the conventional method of independently optimizing control strategies, further improving the robot's locomotive performance and providing an innovative approach to enhancing the extreme parkour capabilities of quadruped robots.

Via

Access Paper or Ask Questions

Failure-aware Policy Learning for Self-assessable Robotics Tasks

Feb 25, 2023

Kechun Xu, Runjian Chen, Shuqi Zhao, Zizhang Li, Hongxiang Yu, Ci Chen, Yue Wang, Rong Xiong

Figure 1 for Failure-aware Policy Learning for Self-assessable Robotics Tasks

Figure 2 for Failure-aware Policy Learning for Self-assessable Robotics Tasks

Figure 3 for Failure-aware Policy Learning for Self-assessable Robotics Tasks

Figure 4 for Failure-aware Policy Learning for Self-assessable Robotics Tasks

Abstract:Self-assessment rules play an essential role in safe and effective real-world robotic applications, which verify the feasibility of the selected action before actual execution. But how to utilize the self-assessment results to re-choose actions remains a challenge. Previous methods eliminate the selected action evaluated as failed by the self-assessment rules, and re-choose one with the next-highest affordance~(i.e. process-of-elimination strategy [1]), which ignores the dependency between the self-assessment results and the remaining untried actions. However, this dependency is important since the previous failures might help trim the remaining over-estimated actions. In this paper, we set to investigate this dependency by learning a failure-aware policy. We propose two architectures for the failure-aware policy by representing the self-assessment results of previous failures as the variable state, and leveraging recurrent neural networks to implicitly memorize the previous failures. Experiments conducted on three tasks demonstrate that our method can achieve better performances with higher task success rates by less trials. Moreover, when the actions are correlated, learning a failure-aware policy can achieve better performance than the process-of-elimination strategy.

Via

Access Paper or Ask Questions

C^2:Co-design of Robots via Concurrent Networks Coupling Online and Offline Reinforcement Learning

Sep 14, 2022

Ci Chen, Pingyu Xiang, Haojian Lu, Yue Wang, Rong Xiong

Figure 1 for C^2:Co-design of Robots via Concurrent Networks Coupling Online and Offline Reinforcement Learning

Figure 2 for C^2:Co-design of Robots via Concurrent Networks Coupling Online and Offline Reinforcement Learning

Figure 3 for C^2:Co-design of Robots via Concurrent Networks Coupling Online and Offline Reinforcement Learning

Figure 4 for C^2:Co-design of Robots via Concurrent Networks Coupling Online and Offline Reinforcement Learning

Abstract:With the rise of computing power, using data-driven approaches for co-designing robots' morphology and controller has become a feasible way. Nevertheless, evaluating the fitness of the controller under each morphology is time-consuming. As a pioneering data-driven method, Co-adaptation utilizes a double-network mechanism with the aim of learning a Q function conditioned on morphology parameters to replace the traditional evaluation of a diverse set of candidates, thereby speeding up optimization. In this paper, we find that Co-adaptation ignores the existence of exploration error during training and state-action distribution shift during parameter transmitting, which hurt the performance. We propose the framework of the concurrent network that couples online and offline RL methods. By leveraging the behavior cloning term flexibly, we mitigate the impact of the above issues on the results. Simulation and physical experiments are performed to demonstrate that our proposed method outperforms baseline algorithms, which illustrates that the proposed method is an effective way of discovering the optimal combination of morphology and controller.

Via

Access Paper or Ask Questions

Motion Planning for Heterogeneous Unmanned Systems under Partial Observation from UAV

Jul 28, 2020

Ci Chen, Yuanfang Wan, Baowei Li, Chen Wang, Guangming Xie, Huanyu Jiang

Figure 1 for Motion Planning for Heterogeneous Unmanned Systems under Partial Observation from UAV

Figure 2 for Motion Planning for Heterogeneous Unmanned Systems under Partial Observation from UAV

Figure 3 for Motion Planning for Heterogeneous Unmanned Systems under Partial Observation from UAV

Figure 4 for Motion Planning for Heterogeneous Unmanned Systems under Partial Observation from UAV

Abstract:For heterogeneous unmanned systems composed of unmanned aerial vehicles (UAVs) and unmanned ground vehicles (UGVs), using UAVs serve as eyes to assist UGVs in motion planning is a promising research direction due to the UAVs' vast view scope. However, due to UAVs flight altitude limitations, it may be impossible to observe the global map, and motion planning in the local map is a POMDP (Partially Observable Markov Decision Process) problem. This paper proposes a motion planning algorithm for heterogeneous unmanned system under partial observation from UAV without reconstruction of global maps, which consists of two parts designed for perception and decision-making, respectively. For the perception part, we propose the Grid Map Generation Network (GMGN), which is used to perceive scenes from UAV's perspective and classify the pathways and obstacles. For the decision-making part, we propose the Motion Command Generation Network (MCGN). Due to the addition of memory mechanism, MCGN has planning and reasoning abilities under partial observation from UAVs. We evaluate our proposed algorithm by comparing with baseline algorithms. The results show that our method effectively plans the motion of heterogeneous unmanned systems and achieves a relatively high success rate.

Via

Access Paper or Ask Questions