Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rachit Dubey

Modularity benefits reinforcement learning agents with competing homeostatic drives

Apr 13, 2022

Zack Dulberg, Rachit Dubey, Isabel M. Berwian, Jonathan D. Cohen

Figure 1 for Modularity benefits reinforcement learning agents with competing homeostatic drives

Figure 2 for Modularity benefits reinforcement learning agents with competing homeostatic drives

Figure 3 for Modularity benefits reinforcement learning agents with competing homeostatic drives

Abstract:The problem of balancing conflicting needs is fundamental to intelligence. Standard reinforcement learning algorithms maximize a scalar reward, which requires combining different objective-specific rewards into a single number. Alternatively, different objectives could also be combined at the level of action value, such that specialist modules responsible for different objectives submit different action suggestions to a decision process, each based on rewards that are independent of one another. In this work, we explore the potential benefits of this alternative strategy. We investigate a biologically relevant multi-objective problem, the continual homeostasis of a set of variables, and compare a monolithic deep Q-network to a modular network with a dedicated Q-learner for each variable. We find that the modular agent: a) requires minimal exogenously determined exploration; b) has improved sample efficiency; and c) is more robust to out-of-domain perturbation.

* 4 pages, accepted paper at the Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM) 2022

Via

Access Paper or Ask Questions

Connecting Context-specific Adaptation in Humans to Meta-learning

Dec 01, 2020

Rachit Dubey, Erin Grant, Michael Luo, Karthik Narasimhan, Thomas Griffiths

Figure 1 for Connecting Context-specific Adaptation in Humans to Meta-learning

Figure 2 for Connecting Context-specific Adaptation in Humans to Meta-learning

Figure 3 for Connecting Context-specific Adaptation in Humans to Meta-learning

Figure 4 for Connecting Context-specific Adaptation in Humans to Meta-learning

Abstract:Cognitive control, the ability of a system to adapt to the demands of a task, is an integral part of cognition. A widely accepted fact about cognitive control is that it is context-sensitive: Adults and children alike infer information about a task's demands from contextual cues and use these inferences to learn from ambiguous cues. However, the precise way in which people use contextual cues to guide adaptation to a new task remains poorly understood. This work connects the context-sensitive nature of cognitive control to a method for meta-learning with context-conditioned adaptation. We begin by identifying an essential difference between human learning and current approaches to meta-learning: In contrast to humans, existing meta-learning algorithms do not make use of task-specific contextual cues but instead rely exclusively on online feedback in the form of task-specific labels or rewards. To remedy this, we introduce a framework for using contextual information about a task to guide the initialization of task-specific models before adaptation to online feedback. We show how context-conditioned meta-learning can capture human behavior in a cognitive task and how it can be scaled to improve the speed of learning in various settings, including few-shot classification and low-sample reinforcement learning. Our work demonstrates that guiding meta-learning with task information can capture complex, human-like behavior, thereby deepening our understanding of cognitive control.

* 9 pages

Via

Access Paper or Ask Questions

Investigating Human Priors for Playing Video Games

Jul 25, 2018

Rachit Dubey, Pulkit Agrawal, Deepak Pathak, Thomas L. Griffiths, Alexei A. Efros

Figure 1 for Investigating Human Priors for Playing Video Games

Figure 2 for Investigating Human Priors for Playing Video Games

Figure 3 for Investigating Human Priors for Playing Video Games

Figure 4 for Investigating Human Priors for Playing Video Games

Abstract:What makes humans so good at solving seemingly complex video games? Unlike computers, humans bring in a great deal of prior knowledge about the world, enabling efficient decision making. This paper investigates the role of human priors for solving video games. Given a sample game, we conduct a series of ablation studies to quantify the importance of various priors on human performance. We do this by modifying the video game environment to systematically mask different types of visual information that could be used by humans as priors. We find that removal of some prior knowledge causes a drastic degradation in the speed with which human players solve the game, e.g. from 2 minutes to over 20 minutes. Furthermore, our results indicate that general priors, such as the importance of objects and visual consistency, are critical for efficient game-play. Videos and the game manipulations are available at https://rach0012.github.io/humanRL_website/

* ICML 2018
* ICML 2018

Via

Access Paper or Ask Questions

A rational analysis of curiosity

May 11, 2017

Rachit Dubey, Thomas L. Griffiths

Figure 1 for A rational analysis of curiosity

Figure 2 for A rational analysis of curiosity

Figure 3 for A rational analysis of curiosity

Figure 4 for A rational analysis of curiosity

Abstract:We present a rational analysis of curiosity, proposing that people's curiosity is driven by seeking stimuli that maximize their ability to make appropriate responses in the future. This perspective offers a way to unify previous theories of curiosity into a single framework. Experimental results confirm our model's predictions, showing how the relationship between curiosity and confidence can change significantly depending on the nature of the environment.

* Conference paper in CogSci 2017

Via

Access Paper or Ask Questions