Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jenna Reinen

Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

May 12, 2020

Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish

Figure 1 for Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

Figure 2 for Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

Figure 3 for Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

Figure 4 for Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

Abstract:Artificial behavioral agents are often evaluated based on their consistent behaviors and performance to take sequential actions in an environment to maximize some notion of cumulative reward. However, human decision making in real life usually involves different strategies and behavioral trajectories that lead to the same empirical outcome. Motivated by clinical literature of a wide range of neurological and psychiatric disorders, we propose here a more general and flexible parametric framework for sequential decision making that involves a two-stream reward processing mechanism. We demonstrated that this framework is flexible and unified enough to incorporate a family of problems spanning multi-armed bandits (MAB), contextual bandits (CB) and reinforcement learning (RL), which decompose the sequential decision making process in different levels. Inspired by the known reward processing abnormalities of many mental disorders, our clinically-inspired agents demonstrated interesting behavioral trajectories and comparable performance on simulated tasks with particular reward distributions, a real-world dataset capturing human decision-making in gambling tasks, and the PacMan game across different reward stationarities in a lifelong learning setting.

* This article supersedes and extends our work arXiv:1706.02897 (MAB) and arXiv:1906.11286 (RL) into the Contextual Bandit (CB) framework. It generalized extensively into multi-armed bandits, contextual bandits and RL settings to create a unified framework of human behavioral agents

Via

Access Paper or Ask Questions

Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders

Jun 28, 2019

Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish

Figure 1 for Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders

Figure 2 for Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders

Figure 3 for Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders

Figure 4 for Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders

Abstract:Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for a reinforcement learning problem, which extends the standard Q-learning approach to incorporate a two-stream framework of reward processing with biases biologically associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain. For AI community, the development of agents that react differently to different types of rewards can enable us to understand a wide spectrum of multi-agent interactions in complex real-world socioeconomic systems. Empirically, the proposed model outperforms Q-Learning and Double Q-Learning in artificial scenarios with certain reward distributions and real-world human decision making gambling tasks. Moreover, from the behavioral modeling perspective, our parametric framework can be viewed as a first step towards a unifying computational model capturing reward processing abnormalities across multiple mental conditions and user preferences in long-term recommendation systems.

* arXiv admin note: substantial text overlap with arXiv:1706.02897

Via

Access Paper or Ask Questions

Autism Classification Using Brain Functional Connectivity Dynamics and Machine Learning

Dec 21, 2017

Ravi Tejwani, Adam Liska, Hongyuan You, Jenna Reinen, Payel Das

Figure 1 for Autism Classification Using Brain Functional Connectivity Dynamics and Machine Learning

Figure 2 for Autism Classification Using Brain Functional Connectivity Dynamics and Machine Learning

Figure 3 for Autism Classification Using Brain Functional Connectivity Dynamics and Machine Learning

Figure 4 for Autism Classification Using Brain Functional Connectivity Dynamics and Machine Learning

Abstract:The goal of the present study is to identify autism using machine learning techniques and resting-state brain imaging data, leveraging the temporal variability of the functional connections (FC) as the only information. We estimated and compared the FC variability across brain regions between typical, healthy subjects and autistic population by analyzing brain imaging data from a world-wide multi-site database known as ABIDE (Autism Brain Imaging Data Exchange). Our analysis revealed that patients diagnosed with autism spectrum disorder (ASD) show increased FC variability in several brain regions that are associated with low FC variability in the typical brain. We then used the enhanced FC variability of brain regions as features for training machine learning models for ASD classification and achieved 65% accuracy in identification of ASD versus control subjects within the dataset. We also used node strength estimated from number of functional connections per node averaged over the whole scan as features for ASD classification.The results reveal that the dynamic FC measures outperform or are comparable with the static FC measures in predicting ASD.

Via

Access Paper or Ask Questions