Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sidhant Kaushik

Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment

Jul 21, 2021

Michael Chang, Sidhant Kaushik, Sergey Levine, Thomas L. Griffiths

Figure 1 for Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment

Figure 2 for Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment

Figure 3 for Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment

Figure 4 for Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment

Abstract:Many transfer problems require re-using previously optimal decisions for solving new tasks, which suggests the need for learning algorithms that can modify the mechanisms for choosing certain actions independently of those for choosing others. However, there is currently no formalism nor theory for how to achieve this kind of modular credit assignment. To answer this question, we define modular credit assignment as a constraint on minimizing the algorithmic mutual information among feedback signals for different decisions. We introduce what we call the modularity criterion for testing whether a learning algorithm satisfies this constraint by performing causal analysis on the algorithm itself. We generalize the recently proposed societal decision-making framework as a more granular formalism than the Markov decision process to prove that for decision sequences that do not contain cycles, certain single-step temporal difference action-value methods meet this criterion while all policy-gradient methods do not. Empirical evidence suggests that such action-value methods are more sample efficient than policy-gradient methods on transfer problems that require only sparse changes to a sequence of previously optimal decisions.

* Long Presentation at the Thirty-eighth International Conference on Machine Learning (ICML) 2021. 21 pages, 11 figures. v2: updated acknowledgments. v3: clarified that the internal function nodes of the credit assignment mechanism are not considered O(1)

Via

Access Paper or Ask Questions

Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions

Jul 05, 2020

Michael Chang, Sidhant Kaushik, S. Matthew Weinberg, Thomas L. Griffiths, Sergey Levine

Figure 1 for Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions

Figure 2 for Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions

Figure 3 for Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions

Figure 4 for Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions

Abstract:This paper seeks to establish a framework for directing a society of simple, specialized, self-interested agents to solve what traditionally are posed as monolithic single-agent sequential decision problems. What makes it challenging to use a decentralized approach to collectively optimize a central objective is the difficulty in characterizing the equilibrium strategy profile of non-cooperative games. To overcome this challenge, we design a mechanism for defining the learning environment of each agent for which we know that the optimal solution for the global objective coincides with a Nash equilibrium strategy profile of the agents optimizing their own local objectives. The society functions as an economy of agents that learn the credit assignment process itself by buying and selling to each other the right to operate on the environment state. We derive a class of decentralized reinforcement learning algorithms that are broadly applicable not only to standard reinforcement learning but also for selecting options in semi-MDPs and dynamically composing computation graphs. Lastly, we demonstrate the potential advantages of a society's inherent modular structure for more efficient transfer learning.

* 17 pages, 12 figures, accepted to the International Conference on Machine Learning (ICML) 2020

Via

Access Paper or Ask Questions