Picture for Remi Tachet

Remi Tachet

Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms

Add code
Feb 15, 2022
Figure 1 for Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
Figure 2 for Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
Figure 3 for Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
Figure 4 for Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
Viaarxiv icon

On the Chattering of SARSA with Linear Function Approximation

Add code
Feb 14, 2022
Viaarxiv icon

Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch

Add code
Nov 04, 2021
Figure 1 for Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
Figure 2 for Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
Figure 3 for Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
Viaarxiv icon

Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates

Add code
Sep 29, 2021
Figure 1 for Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Figure 2 for Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Figure 3 for Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Figure 4 for Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Viaarxiv icon

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Add code
Jun 25, 2021
Figure 1 for Decomposed Mutual Information Estimation for Contrastive Representation Learning
Figure 2 for Decomposed Mutual Information Estimation for Contrastive Representation Learning
Figure 3 for Decomposed Mutual Information Estimation for Contrastive Representation Learning
Figure 4 for Decomposed Mutual Information Estimation for Contrastive Representation Learning
Viaarxiv icon

Reinforcement Learning Framework for Deep Brain Stimulation Study

Add code
Feb 22, 2020
Figure 1 for Reinforcement Learning Framework for Deep Brain Stimulation Study
Figure 2 for Reinforcement Learning Framework for Deep Brain Stimulation Study
Figure 3 for Reinforcement Learning Framework for Deep Brain Stimulation Study
Figure 4 for Reinforcement Learning Framework for Deep Brain Stimulation Study
Viaarxiv icon

Robust Natural Language Inference Models with Example Forgetting

Add code
Nov 10, 2019
Figure 1 for Robust Natural Language Inference Models with Example Forgetting
Figure 2 for Robust Natural Language Inference Models with Example Forgetting
Figure 3 for Robust Natural Language Inference Models with Example Forgetting
Figure 4 for Robust Natural Language Inference Models with Example Forgetting
Viaarxiv icon