Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning

Aug 11, 2020

Tianjian Chen, Zhanpeng He, Matei Ciocarlie

Figure 1 for Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning

Figure 2 for Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning

Figure 3 for Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning

Figure 4 for Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Deep Reinforcement Learning (RL) has shown great success in learning complex control policies for a variety of applications in robotics. However, in most such cases, the hardware of the robot has been considered immutable, modeled as part of the environment. In this study, we explore the problem of learning hardware and control parameters together in a unified RL framework. To achieve this, we propose to model aspects of the robot's hardware as a "mechanical policy", analogous to and optimized jointly with its computational counterpart. We show that, by modeling such mechanical policies as auto-differentiable computational graphs, the ensuing optimization problem can be solved efficiently by gradient-based algorithms from the Policy Optimization family. We present two such design examples: a toy mass-spring problem, and a real-world problem of designing an underactuated hand. We compare our method against traditional co-optimization approaches, and also demonstrate its effectiveness by building a physical prototype based on the learned hardware parameters.

* Submitted to Conference on Robot Learning (CoRL) 2020

View paper on

Share this with someone who'll enjoy it:

Title:Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning

Paper and Code