Picture for Pierre Ménard

Pierre Ménard

OVGU

Optimal Design for Reward Modeling in RLHF

Add code
Oct 23, 2024
Viaarxiv icon

Local and adaptive mirror descents in extensive-form games

Add code
Sep 01, 2023
Viaarxiv icon

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

Add code
May 22, 2023
Viaarxiv icon

Learning Generative Models with Goal-conditioned Reinforcement Learning

Add code
Mar 26, 2023
Viaarxiv icon

Adapting to game trees in zero-sum imperfect information games

Add code
Dec 23, 2022
Viaarxiv icon

KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal

Add code
May 27, 2022
Figure 1 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 2 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 3 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 4 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Viaarxiv icon

Indexed Minimum Empirical Divergence for Unimodal Bandits

Add code
Dec 02, 2021
Figure 1 for Indexed Minimum Empirical Divergence for Unimodal Bandits
Viaarxiv icon

Adaptive Multi-Goal Exploration

Add code
Nov 23, 2021
Figure 1 for Adaptive Multi-Goal Exploration
Figure 2 for Adaptive Multi-Goal Exploration
Figure 3 for Adaptive Multi-Goal Exploration
Viaarxiv icon

Problem Dependent View on Structured Thresholding Bandit Problems

Add code
Jun 18, 2021
Figure 1 for Problem Dependent View on Structured Thresholding Bandit Problems
Figure 2 for Problem Dependent View on Structured Thresholding Bandit Problems
Figure 3 for Problem Dependent View on Structured Thresholding Bandit Problems
Figure 4 for Problem Dependent View on Structured Thresholding Bandit Problems
Viaarxiv icon

Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall

Add code
Jun 11, 2021
Figure 1 for Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall
Viaarxiv icon