Picture for Gerald Tesauro

Gerald Tesauro

Learning in Factored Domains with Information-Constrained Visual Representations

Add code
Mar 30, 2023
Viaarxiv icon

Game-Theoretical Perspectives on Active Equilibria: A Preferred Solution Concept over Nash Equilibria

Add code
Oct 28, 2022
Viaarxiv icon

Influencing Long-Term Behavior in Multiagent Reinforcement Learning

Add code
Mar 07, 2022
Figure 1 for Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Figure 2 for Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Figure 3 for Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Figure 4 for Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Viaarxiv icon

AI Planning Annotation for Sample Efficient Reinforcement Learning

Add code
Mar 01, 2022
Figure 1 for AI Planning Annotation for Sample Efficient Reinforcement Learning
Figure 2 for AI Planning Annotation for Sample Efficient Reinforcement Learning
Figure 3 for AI Planning Annotation for Sample Efficient Reinforcement Learning
Figure 4 for AI Planning Annotation for Sample Efficient Reinforcement Learning
Viaarxiv icon

Context-Specific Representation Abstraction for Deep Option Learning

Add code
Sep 20, 2021
Figure 1 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 2 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 3 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 4 for Context-Specific Representation Abstraction for Deep Option Learning
Viaarxiv icon

Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games

Add code
Nov 23, 2020
Figure 1 for Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games
Figure 2 for Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games
Figure 3 for Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games
Figure 4 for Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games
Viaarxiv icon

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

Add code
Oct 31, 2020
Figure 1 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 2 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 3 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 4 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Viaarxiv icon

Deep RL With Information Constrained Policies: Generalization in Continuous Control

Add code
Oct 09, 2020
Figure 1 for Deep RL With Information Constrained Policies: Generalization in Continuous Control
Figure 2 for Deep RL With Information Constrained Policies: Generalization in Continuous Control
Figure 3 for Deep RL With Information Constrained Policies: Generalization in Continuous Control
Viaarxiv icon

Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines

Add code
Oct 08, 2020
Figure 1 for Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
Figure 2 for Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
Figure 3 for Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
Figure 4 for Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
Viaarxiv icon

Finding Macro-Actions with Disentangled Effects for Efficient Planning with the Goal-Count Heuristic

Add code
Apr 28, 2020
Figure 1 for Finding Macro-Actions with Disentangled Effects for Efficient Planning with the Goal-Count Heuristic
Figure 2 for Finding Macro-Actions with Disentangled Effects for Efficient Planning with the Goal-Count Heuristic
Figure 3 for Finding Macro-Actions with Disentangled Effects for Efficient Planning with the Goal-Count Heuristic
Figure 4 for Finding Macro-Actions with Disentangled Effects for Efficient Planning with the Goal-Count Heuristic
Viaarxiv icon