Picture for David Silver

David Silver

University College London

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

Add code
Jun 30, 2022
Figure 1 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 2 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 3 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 4 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Viaarxiv icon

Self-Consistent Models and Values

Add code
Oct 25, 2021
Figure 1 for Self-Consistent Models and Values
Figure 2 for Self-Consistent Models and Values
Figure 3 for Self-Consistent Models and Values
Figure 4 for Self-Consistent Models and Values
Viaarxiv icon

Bootstrapped Meta-Learning

Add code
Sep 09, 2021
Figure 1 for Bootstrapped Meta-Learning
Figure 2 for Bootstrapped Meta-Learning
Figure 3 for Bootstrapped Meta-Learning
Figure 4 for Bootstrapped Meta-Learning
Viaarxiv icon

The Option Keyboard: Combining Skills in Reinforcement Learning

Add code
Jun 24, 2021
Figure 1 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 2 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 3 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 4 for The Option Keyboard: Combining Skills in Reinforcement Learning
Viaarxiv icon

Proper Value Equivalence

Add code
Jun 18, 2021
Figure 1 for Proper Value Equivalence
Figure 2 for Proper Value Equivalence
Figure 3 for Proper Value Equivalence
Figure 4 for Proper Value Equivalence
Viaarxiv icon

Learning and Planning in Complex Action Spaces

Add code
Apr 13, 2021
Figure 1 for Learning and Planning in Complex Action Spaces
Figure 2 for Learning and Planning in Complex Action Spaces
Figure 3 for Learning and Planning in Complex Action Spaces
Figure 4 for Learning and Planning in Complex Action Spaces
Viaarxiv icon

Online and Offline Reinforcement Learning by Planning with a Learned Model

Add code
Apr 13, 2021
Figure 1 for Online and Offline Reinforcement Learning by Planning with a Learned Model
Figure 2 for Online and Offline Reinforcement Learning by Planning with a Learned Model
Figure 3 for Online and Offline Reinforcement Learning by Planning with a Learned Model
Figure 4 for Online and Offline Reinforcement Learning by Planning with a Learned Model
Viaarxiv icon

Muesli: Combining Improvements in Policy Optimization

Add code
Apr 13, 2021
Figure 1 for Muesli: Combining Improvements in Policy Optimization
Figure 2 for Muesli: Combining Improvements in Policy Optimization
Figure 3 for Muesli: Combining Improvements in Policy Optimization
Figure 4 for Muesli: Combining Improvements in Policy Optimization
Viaarxiv icon