Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ryan Grindle

A good body is all you need: avoiding catastrophic interference via agent architecture search

Aug 20, 2021

Joshua Powers, Ryan Grindle, Lapo Frati, Josh Bongard

Figure 1 for A good body is all you need: avoiding catastrophic interference via agent architecture search

Figure 2 for A good body is all you need: avoiding catastrophic interference via agent architecture search

Figure 3 for A good body is all you need: avoiding catastrophic interference via agent architecture search

Figure 4 for A good body is all you need: avoiding catastrophic interference via agent architecture search

Abstract:In robotics, catastrophic interference continues to restrain policy training across environments. Efforts to combat catastrophic interference to date focus on novel neural architectures or training methods, with a recent emphasis on policies with good initial settings that facilitate training in new environments. However, none of these methods to date have taken into account how the physical architecture of the robot can obstruct or facilitate catastrophic interference, just as the choice of neural architecture can. In previous work we have shown how aspects of a robot's physical structure (specifically, sensor placement) can facilitate policy learning by increasing the fraction of optimal policies for a given physical structure. Here we show for the first time that this proxy measure of catastrophic interference correlates with sample efficiency across several search methods, proving that favorable loss landscapes can be induced by the correct choice of physical structure. We show that such structures can be found via co-optimization -- optimization of a robot's structure and control policy simultaneously -- yielding catastrophic interference resistant robot structures and policies, and that this is more efficient than control policy optimization alone. Finally, we show that such structures exhibit sensor homeostasis across environments and introduce this as the mechanism by which certain robots overcome catastrophic interference.

* arXiv admin note: text overlap with arXiv:1910.07487

Via

Access Paper or Ask Questions

Embodiment dictates learnability in neural controllers

Oct 15, 2019

Joshua Powers, Ryan Grindle, Sam Kriegman, Lapo Frati, Nick Cheney, Josh Bongard

Figure 1 for Embodiment dictates learnability in neural controllers

Figure 2 for Embodiment dictates learnability in neural controllers

Figure 3 for Embodiment dictates learnability in neural controllers

Figure 4 for Embodiment dictates learnability in neural controllers

Abstract:Catastrophic forgetting continues to severely restrict the learnability of controllers suitable for multiple task environments. Efforts to combat catastrophic forgetting reported in the literature to date have focused on how control systems can be updated more rapidly, hastening their adjustment from good initial settings to new environments, or more circumspectly, suppressing their ability to overfit to any one environment. When using robots, the environment includes the robot's own body, its shape and material properties, and how its actuators and sensors are distributed along its mechanical structure. Here we demonstrate for the first time how one such design decision (sensor placement) can alter the landscape of the loss function itself, either expanding or shrinking the weight manifolds containing suitable controllers for each individual task, thus increasing or decreasing their probability of overlap across tasks, and thus reducing or inducing the potential for catastrophic forgetting.

Via

Access Paper or Ask Questions