Picture for Andre Barreto

Andre Barreto

Video as the New Language for Real-World Decision Making

Add code
Feb 27, 2024
Figure 1 for Video as the New Language for Real-World Decision Making
Figure 2 for Video as the New Language for Real-World Decision Making
Figure 3 for Video as the New Language for Real-World Decision Making
Figure 4 for Video as the New Language for Real-World Decision Making
Viaarxiv icon

Temporal Abstraction in Reinforcement Learning with the Successor Representation

Add code
Oct 12, 2021
Figure 1 for Temporal Abstraction in Reinforcement Learning with the Successor Representation
Figure 2 for Temporal Abstraction in Reinforcement Learning with the Successor Representation
Figure 3 for Temporal Abstraction in Reinforcement Learning with the Successor Representation
Figure 4 for Temporal Abstraction in Reinforcement Learning with the Successor Representation
Viaarxiv icon

Discovering Diverse Nearly Optimal Policies withSuccessor Features

Add code
Jun 01, 2021
Figure 1 for Discovering Diverse Nearly Optimal Policies withSuccessor Features
Figure 2 for Discovering Diverse Nearly Optimal Policies withSuccessor Features
Figure 3 for Discovering Diverse Nearly Optimal Policies withSuccessor Features
Figure 4 for Discovering Diverse Nearly Optimal Policies withSuccessor Features
Viaarxiv icon

Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning

Add code
Feb 24, 2021
Figure 1 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 2 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 3 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 4 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Viaarxiv icon

Discovering a set of policies for the worst case reward

Add code
Feb 08, 2021
Figure 1 for Discovering a set of policies for the worst case reward
Figure 2 for Discovering a set of policies for the worst case reward
Figure 3 for Discovering a set of policies for the worst case reward
Figure 4 for Discovering a set of policies for the worst case reward
Viaarxiv icon

Temporal Difference Uncertainties as a Signal for Exploration

Add code
Oct 05, 2020
Figure 1 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 2 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 3 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 4 for Temporal Difference Uncertainties as a Signal for Exploration
Viaarxiv icon

Disentangled Cumulants Help Successor Representations Transfer to New Tasks

Add code
Nov 25, 2019
Figure 1 for Disentangled Cumulants Help Successor Representations Transfer to New Tasks
Figure 2 for Disentangled Cumulants Help Successor Representations Transfer to New Tasks
Figure 3 for Disentangled Cumulants Help Successor Representations Transfer to New Tasks
Figure 4 for Disentangled Cumulants Help Successor Representations Transfer to New Tasks
Viaarxiv icon

General non-linear Bellman equations

Add code
Jul 08, 2019
Figure 1 for General non-linear Bellman equations
Figure 2 for General non-linear Bellman equations
Viaarxiv icon

Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates

Add code
Jun 19, 2019
Figure 1 for Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates
Figure 2 for Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates
Figure 3 for Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates
Figure 4 for Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates
Viaarxiv icon

Fast Task Inference with Variational Intrinsic Successor Features

Add code
Jun 12, 2019
Figure 1 for Fast Task Inference with Variational Intrinsic Successor Features
Figure 2 for Fast Task Inference with Variational Intrinsic Successor Features
Figure 3 for Fast Task Inference with Variational Intrinsic Successor Features
Figure 4 for Fast Task Inference with Variational Intrinsic Successor Features
Viaarxiv icon