Picture for Julian Schrittwieser

Julian Schrittwieser

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Add code
Aug 07, 2023
Viaarxiv icon

Optimizing Memory Mapping Using Deep Reinforcement Learning

Add code
May 11, 2023
Viaarxiv icon

MuZero with Self-competition for Rate Control in VP9 Video Compression

Add code
Feb 14, 2022
Figure 1 for MuZero with Self-competition for Rate Control in VP9 Video Compression
Figure 2 for MuZero with Self-competition for Rate Control in VP9 Video Compression
Figure 3 for MuZero with Self-competition for Rate Control in VP9 Video Compression
Figure 4 for MuZero with Self-competition for Rate Control in VP9 Video Compression
Viaarxiv icon

Procedural Generalization by Planning with Self-Supervised World Models

Add code
Nov 02, 2021
Figure 1 for Procedural Generalization by Planning with Self-Supervised World Models
Figure 2 for Procedural Generalization by Planning with Self-Supervised World Models
Figure 3 for Procedural Generalization by Planning with Self-Supervised World Models
Figure 4 for Procedural Generalization by Planning with Self-Supervised World Models
Viaarxiv icon

Learning and Planning in Complex Action Spaces

Add code
Apr 13, 2021
Figure 1 for Learning and Planning in Complex Action Spaces
Figure 2 for Learning and Planning in Complex Action Spaces
Figure 3 for Learning and Planning in Complex Action Spaces
Figure 4 for Learning and Planning in Complex Action Spaces
Viaarxiv icon

Online and Offline Reinforcement Learning by Planning with a Learned Model

Add code
Apr 13, 2021
Figure 1 for Online and Offline Reinforcement Learning by Planning with a Learned Model
Figure 2 for Online and Offline Reinforcement Learning by Planning with a Learned Model
Figure 3 for Online and Offline Reinforcement Learning by Planning with a Learned Model
Figure 4 for Online and Offline Reinforcement Learning by Planning with a Learned Model
Viaarxiv icon

Local Search for Policy Iteration in Continuous Control

Add code
Oct 12, 2020
Figure 1 for Local Search for Policy Iteration in Continuous Control
Figure 2 for Local Search for Policy Iteration in Continuous Control
Figure 3 for Local Search for Policy Iteration in Continuous Control
Figure 4 for Local Search for Policy Iteration in Continuous Control
Viaarxiv icon

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

Add code
Nov 19, 2019
Figure 1 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 2 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 3 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Figure 4 for Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Viaarxiv icon