Picture for Pablo Sprechmann

Pablo Sprechmann

University of Minnesota

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

TacticAI: an AI assistant for football tactics

Add code
Oct 17, 2023
Viaarxiv icon

Unlocking the Power of Representations in Long-term Novelty-based Exploration

Add code
May 02, 2023
Viaarxiv icon

Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning

Add code
Feb 24, 2021
Figure 1 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 2 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 3 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 4 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Viaarxiv icon

Game Plan: What AI can do for Football, and What Football can do for AI

Add code
Nov 18, 2020
Figure 1 for Game Plan: What AI can do for Football, and What Football can do for AI
Figure 2 for Game Plan: What AI can do for Football, and What Football can do for AI
Figure 3 for Game Plan: What AI can do for Football, and What Football can do for AI
Figure 4 for Game Plan: What AI can do for Football, and What Football can do for AI
Viaarxiv icon

Temporal Difference Uncertainties as a Signal for Exploration

Add code
Oct 05, 2020
Figure 1 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 2 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 3 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 4 for Temporal Difference Uncertainties as a Signal for Exploration
Viaarxiv icon

Agent57: Outperforming the Atari Human Benchmark

Add code
Mar 30, 2020
Figure 1 for Agent57: Outperforming the Atari Human Benchmark
Figure 2 for Agent57: Outperforming the Atari Human Benchmark
Figure 3 for Agent57: Outperforming the Atari Human Benchmark
Figure 4 for Agent57: Outperforming the Atari Human Benchmark
Viaarxiv icon

Never Give Up: Learning Directed Exploration Strategies

Add code
Feb 14, 2020
Figure 1 for Never Give Up: Learning Directed Exploration Strategies
Figure 2 for Never Give Up: Learning Directed Exploration Strategies
Figure 3 for Never Give Up: Learning Directed Exploration Strategies
Figure 4 for Never Give Up: Learning Directed Exploration Strategies
Viaarxiv icon

Meta-learning of Sequential Strategies

Add code
May 08, 2019
Figure 1 for Meta-learning of Sequential Strategies
Figure 2 for Meta-learning of Sequential Strategies
Figure 3 for Meta-learning of Sequential Strategies
Figure 4 for Meta-learning of Sequential Strategies
Viaarxiv icon

Fast deep reinforcement learning using online adjustments from the past

Add code
Oct 18, 2018
Figure 1 for Fast deep reinforcement learning using online adjustments from the past
Figure 2 for Fast deep reinforcement learning using online adjustments from the past
Figure 3 for Fast deep reinforcement learning using online adjustments from the past
Figure 4 for Fast deep reinforcement learning using online adjustments from the past
Viaarxiv icon