Picture for Bernardo Avila Pires

Bernardo Avila Pires

University of Alberta

A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning

Add code
Jun 04, 2024
Viaarxiv icon

Offline Regularised Reinforcement Learning for Large Language Models Alignment

Add code
May 29, 2024
Viaarxiv icon

Human Alignment of Large Language Models through Online Preference Optimisation

Add code
Mar 13, 2024
Viaarxiv icon

Understanding plasticity in neural networks

Add code
Mar 02, 2023
Viaarxiv icon

Hierarchical Reinforcement Learning in Complex 3D Environments

Add code
Feb 28, 2023
Viaarxiv icon

BYOL-Explore: Exploration by Bootstrapped Prediction

Add code
Jun 16, 2022
Figure 1 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 2 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 3 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 4 for BYOL-Explore: Exploration by Bootstrapped Prediction
Viaarxiv icon

Neural Recursive Belief States in Multi-Agent Reinforcement Learning

Add code
Feb 03, 2021
Figure 1 for Neural Recursive Belief States in Multi-Agent Reinforcement Learning
Figure 2 for Neural Recursive Belief States in Multi-Agent Reinforcement Learning
Figure 3 for Neural Recursive Belief States in Multi-Agent Reinforcement Learning
Figure 4 for Neural Recursive Belief States in Multi-Agent Reinforcement Learning
Viaarxiv icon

Geometric Entropic Exploration

Add code
Jan 07, 2021
Figure 1 for Geometric Entropic Exploration
Figure 2 for Geometric Entropic Exploration
Figure 3 for Geometric Entropic Exploration
Figure 4 for Geometric Entropic Exploration
Viaarxiv icon

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

Add code
Jun 13, 2020
Figure 1 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 2 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 3 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 4 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Viaarxiv icon

Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning

Add code
Apr 30, 2020
Figure 1 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 2 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 3 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 4 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Viaarxiv icon