Picture for Fabio Viola

Fabio Viola

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Podracer architectures for scalable Reinforcement Learning

Add code
Apr 13, 2021
Figure 1 for Podracer architectures for scalable Reinforcement Learning
Figure 2 for Podracer architectures for scalable Reinforcement Learning
Figure 3 for Podracer architectures for scalable Reinforcement Learning
Figure 4 for Podracer architectures for scalable Reinforcement Learning
Viaarxiv icon

Muesli: Combining Improvements in Policy Optimization

Add code
Apr 13, 2021
Figure 1 for Muesli: Combining Improvements in Policy Optimization
Figure 2 for Muesli: Combining Improvements in Policy Optimization
Figure 3 for Muesli: Combining Improvements in Policy Optimization
Figure 4 for Muesli: Combining Improvements in Policy Optimization
Viaarxiv icon

Counterfactual Credit Assignment in Model-Free Reinforcement Learning

Add code
Nov 18, 2020
Figure 1 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Figure 2 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Figure 3 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Figure 4 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Viaarxiv icon

On the role of planning in model-based deep reinforcement learning

Add code
Nov 08, 2020
Figure 1 for On the role of planning in model-based deep reinforcement learning
Figure 2 for On the role of planning in model-based deep reinforcement learning
Figure 3 for On the role of planning in model-based deep reinforcement learning
Figure 4 for On the role of planning in model-based deep reinforcement learning
Viaarxiv icon

Neural Communication Systems with Bandwidth-limited Channel

Add code
Apr 01, 2020
Figure 1 for Neural Communication Systems with Bandwidth-limited Channel
Figure 2 for Neural Communication Systems with Bandwidth-limited Channel
Figure 3 for Neural Communication Systems with Bandwidth-limited Channel
Figure 4 for Neural Communication Systems with Bandwidth-limited Channel
Viaarxiv icon

Value-driven Hindsight Modelling

Add code
Feb 19, 2020
Figure 1 for Value-driven Hindsight Modelling
Figure 2 for Value-driven Hindsight Modelling
Figure 3 for Value-driven Hindsight Modelling
Figure 4 for Value-driven Hindsight Modelling
Viaarxiv icon

Causally Correct Partial Models for Reinforcement Learning

Add code
Feb 07, 2020
Figure 1 for Causally Correct Partial Models for Reinforcement Learning
Figure 2 for Causally Correct Partial Models for Reinforcement Learning
Figure 3 for Causally Correct Partial Models for Reinforcement Learning
Figure 4 for Causally Correct Partial Models for Reinforcement Learning
Viaarxiv icon

TF-Replicator: Distributed Machine Learning for Researchers

Add code
Feb 01, 2019
Figure 1 for TF-Replicator: Distributed Machine Learning for Researchers
Figure 2 for TF-Replicator: Distributed Machine Learning for Researchers
Figure 3 for TF-Replicator: Distributed Machine Learning for Researchers
Figure 4 for TF-Replicator: Distributed Machine Learning for Researchers
Viaarxiv icon