Picture for Simon Schmitt

Simon Schmitt

Exploration via Epistemic Value Estimation

Add code
Mar 07, 2023
Viaarxiv icon

Chaining Value Functions for Off-Policy Learning

Add code
Feb 02, 2022
Figure 1 for Chaining Value Functions for Off-Policy Learning
Figure 2 for Chaining Value Functions for Off-Policy Learning
Figure 3 for Chaining Value Functions for Off-Policy Learning
Figure 4 for Chaining Value Functions for Off-Policy Learning
Viaarxiv icon

Learning and Planning in Complex Action Spaces

Add code
Apr 13, 2021
Figure 1 for Learning and Planning in Complex Action Spaces
Figure 2 for Learning and Planning in Complex Action Spaces
Figure 3 for Learning and Planning in Complex Action Spaces
Figure 4 for Learning and Planning in Complex Action Spaces
Viaarxiv icon

Muesli: Combining Improvements in Policy Optimization

Add code
Apr 13, 2021
Figure 1 for Muesli: Combining Improvements in Policy Optimization
Figure 2 for Muesli: Combining Improvements in Policy Optimization
Figure 3 for Muesli: Combining Improvements in Policy Optimization
Figure 4 for Muesli: Combining Improvements in Policy Optimization
Viaarxiv icon

AlgebraNets

Add code
Jun 16, 2020
Figure 1 for AlgebraNets
Figure 2 for AlgebraNets
Figure 3 for AlgebraNets
Figure 4 for AlgebraNets
Viaarxiv icon

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

Add code
Nov 19, 2019
Figure 1 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 2 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 3 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 4 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Viaarxiv icon

Gated Linear Networks

Add code
Sep 30, 2019
Figure 1 for Gated Linear Networks
Figure 2 for Gated Linear Networks
Figure 3 for Gated Linear Networks
Viaarxiv icon

Off-Policy Actor-Critic with Shared Experience Replay

Add code
Sep 25, 2019
Figure 1 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 2 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 3 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 4 for Off-Policy Actor-Critic with Shared Experience Replay
Viaarxiv icon

Multi-task Deep Reinforcement Learning with PopArt

Add code
Sep 12, 2018
Figure 1 for Multi-task Deep Reinforcement Learning with PopArt
Figure 2 for Multi-task Deep Reinforcement Learning with PopArt
Figure 3 for Multi-task Deep Reinforcement Learning with PopArt
Figure 4 for Multi-task Deep Reinforcement Learning with PopArt
Viaarxiv icon

Kickstarting Deep Reinforcement Learning

Add code
Mar 10, 2018
Figure 1 for Kickstarting Deep Reinforcement Learning
Figure 2 for Kickstarting Deep Reinforcement Learning
Figure 3 for Kickstarting Deep Reinforcement Learning
Figure 4 for Kickstarting Deep Reinforcement Learning
Viaarxiv icon