Picture for Remi Tachet des Combes

Remi Tachet des Combes

Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning

Add code
Oct 26, 2023
Viaarxiv icon

Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information

Add code
Oct 31, 2022
Viaarxiv icon

Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning

Add code
Jun 02, 2022
Figure 1 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Figure 2 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Figure 3 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Figure 4 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Viaarxiv icon

Non-Markovian policies occupancy measures

Add code
May 27, 2022
Figure 1 for Non-Markovian policies occupancy measures
Viaarxiv icon

On the Regularity of Attention

Add code
Feb 10, 2021
Viaarxiv icon

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms

Add code
Oct 02, 2020
Figure 1 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 2 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 3 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 4 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Viaarxiv icon

A Mathematical Theory of Attention

Add code
Jul 06, 2020
Viaarxiv icon

Deep Reinforcement and InfoMax Learning

Add code
Jun 12, 2020
Figure 1 for Deep Reinforcement and InfoMax Learning
Figure 2 for Deep Reinforcement and InfoMax Learning
Figure 3 for Deep Reinforcement and InfoMax Learning
Figure 4 for Deep Reinforcement and InfoMax Learning
Viaarxiv icon

Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift

Add code
Mar 10, 2020
Figure 1 for Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift
Figure 2 for Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift
Figure 3 for Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift
Figure 4 for Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift
Viaarxiv icon

A Reduction from Reinforcement Learning to No-Regret Online Learning

Add code
Jan 01, 2020
Viaarxiv icon