Picture for Hubert Soyer

Hubert Soyer

Can foundation models actively gather information in interactive environments to test hypotheses?

Add code
Dec 09, 2024
Viaarxiv icon

Hierarchical Reinforcement Learning in Complex 3D Environments

Add code
Feb 28, 2023
Viaarxiv icon

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Add code
Sep 26, 2019
Figure 1 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 2 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 3 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 4 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Viaarxiv icon

Making Efficient Use of Demonstrations to Solve Hard Exploration Problems

Add code
Sep 03, 2019
Figure 1 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 2 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 3 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 4 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Viaarxiv icon

Multi-task Deep Reinforcement Learning with PopArt

Add code
Sep 12, 2018
Figure 1 for Multi-task Deep Reinforcement Learning with PopArt
Figure 2 for Multi-task Deep Reinforcement Learning with PopArt
Figure 3 for Multi-task Deep Reinforcement Learning with PopArt
Figure 4 for Multi-task Deep Reinforcement Learning with PopArt
Viaarxiv icon

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Add code
Jun 28, 2018
Figure 1 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Figure 2 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Figure 3 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Figure 4 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Viaarxiv icon

Low-pass Recurrent Neural Networks - A memory architecture for longer-term correlation discovery

Add code
May 13, 2018
Figure 1 for Low-pass Recurrent Neural Networks - A memory architecture for longer-term correlation discovery
Figure 2 for Low-pass Recurrent Neural Networks - A memory architecture for longer-term correlation discovery
Figure 3 for Low-pass Recurrent Neural Networks - A memory architecture for longer-term correlation discovery
Figure 4 for Low-pass Recurrent Neural Networks - A memory architecture for longer-term correlation discovery
Viaarxiv icon

Grounded Language Learning in a Simulated 3D World

Add code
Jun 26, 2017
Figure 1 for Grounded Language Learning in a Simulated 3D World
Figure 2 for Grounded Language Learning in a Simulated 3D World
Figure 3 for Grounded Language Learning in a Simulated 3D World
Figure 4 for Grounded Language Learning in a Simulated 3D World
Viaarxiv icon

Learning to reinforcement learn

Add code
Jan 23, 2017
Figure 1 for Learning to reinforcement learn
Figure 2 for Learning to reinforcement learn
Figure 3 for Learning to reinforcement learn
Figure 4 for Learning to reinforcement learn
Viaarxiv icon

Learning to Navigate in Complex Environments

Add code
Jan 13, 2017
Figure 1 for Learning to Navigate in Complex Environments
Figure 2 for Learning to Navigate in Complex Environments
Figure 3 for Learning to Navigate in Complex Environments
Figure 4 for Learning to Navigate in Complex Environments
Viaarxiv icon