Picture for Simon Schmitt

Simon Schmitt

General Uncertainty Estimation with Delta Variances

Add code
Feb 20, 2025
Viaarxiv icon

Exploration via Epistemic Value Estimation

Add code
Mar 07, 2023
Viaarxiv icon

Chaining Value Functions for Off-Policy Learning

Add code
Feb 02, 2022
Figure 1 for Chaining Value Functions for Off-Policy Learning
Figure 2 for Chaining Value Functions for Off-Policy Learning
Figure 3 for Chaining Value Functions for Off-Policy Learning
Figure 4 for Chaining Value Functions for Off-Policy Learning
Viaarxiv icon

Learning and Planning in Complex Action Spaces

Add code
Apr 13, 2021
Figure 1 for Learning and Planning in Complex Action Spaces
Figure 2 for Learning and Planning in Complex Action Spaces
Figure 3 for Learning and Planning in Complex Action Spaces
Figure 4 for Learning and Planning in Complex Action Spaces
Viaarxiv icon

Muesli: Combining Improvements in Policy Optimization

Add code
Apr 13, 2021
Figure 1 for Muesli: Combining Improvements in Policy Optimization
Figure 2 for Muesli: Combining Improvements in Policy Optimization
Figure 3 for Muesli: Combining Improvements in Policy Optimization
Figure 4 for Muesli: Combining Improvements in Policy Optimization
Viaarxiv icon

AlgebraNets

Add code
Jun 16, 2020
Figure 1 for AlgebraNets
Figure 2 for AlgebraNets
Figure 3 for AlgebraNets
Figure 4 for AlgebraNets
Viaarxiv icon

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

Add code
Nov 19, 2019
Figure 1 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 2 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 3 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 4 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Viaarxiv icon

Gated Linear Networks

Add code
Sep 30, 2019
Figure 1 for Gated Linear Networks
Figure 2 for Gated Linear Networks
Figure 3 for Gated Linear Networks
Viaarxiv icon

Off-Policy Actor-Critic with Shared Experience Replay

Add code
Sep 25, 2019
Figure 1 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 2 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 3 for Off-Policy Actor-Critic with Shared Experience Replay
Figure 4 for Off-Policy Actor-Critic with Shared Experience Replay
Viaarxiv icon

Multi-task Deep Reinforcement Learning with PopArt

Add code
Sep 12, 2018
Figure 1 for Multi-task Deep Reinforcement Learning with PopArt
Figure 2 for Multi-task Deep Reinforcement Learning with PopArt
Figure 3 for Multi-task Deep Reinforcement Learning with PopArt
Figure 4 for Multi-task Deep Reinforcement Learning with PopArt
Viaarxiv icon