Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Discourse on MetODS: Meta-Optimized Dynamical Synapses for Meta-Reinforcement Learning

Feb 04, 2022

Mathieu Chalvidal, Thomas Serre, Rufin VanRullen

Figure 1 for A Discourse on MetODS: Meta-Optimized Dynamical Synapses for Meta-Reinforcement Learning

Figure 2 for A Discourse on MetODS: Meta-Optimized Dynamical Synapses for Meta-Reinforcement Learning

Figure 3 for A Discourse on MetODS: Meta-Optimized Dynamical Synapses for Meta-Reinforcement Learning

Figure 4 for A Discourse on MetODS: Meta-Optimized Dynamical Synapses for Meta-Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Recent meta-reinforcement learning work has emphasized the importance of mnemonic control for agents to quickly assimilate relevant experience in new contexts and suitably adapt their policy. However, what computational mechanisms support flexible behavioral adaptation from past experience remains an open question. Inspired by neuroscience, we propose MetODS (for Meta-Optimized Dynamical Synapses), a broadly applicable model of meta-reinforcement learning which leverages fast synaptic dynamics influenced by action-reward feedback. We develop a theoretical interpretation of MetODS as a model learning powerful control rules in the policy space and demonstrate empirically that robust reinforcement learning programs emerge spontaneously from them. We further propose a formalism which efficiently optimizes the meta-parameters governing MetODS synaptic processes. In multiple experiments and domains, MetODS outperforms or compares favorably with previous meta-reinforcement learning approaches. Our agents can perform one-shot learning, approaches optimal exploration/exploitation strategies, generalize navigation principles to unseen environments and demonstrate a strong ability to learn adaptive motor policies.

View paper on

Share this with someone who'll enjoy it:

Title:A Discourse on MetODS: Meta-Optimized Dynamical Synapses for Meta-Reinforcement Learning

Paper and Code