Picture for Odalric-Ambrym Maillard

Odalric-Ambrym Maillard

Scool

Kriging and Gaussian Process Interpolation for Georeferenced Data Augmentation

Add code
Jan 13, 2025
Viaarxiv icon

Provably Efficient Exploration in Reward Machines with Low Regret

Add code
Dec 26, 2024
Viaarxiv icon

Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning

Add code
Dec 19, 2024
Viaarxiv icon

How to Shrink Confidence Sets for Many Equivalent Discrete Distributions?

Add code
Jul 22, 2024
Viaarxiv icon

Power Mean Estimation in Stochastic Monte-Carlo Tree_Search

Add code
Jun 04, 2024
Viaarxiv icon

CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption

Add code
Sep 28, 2023
Viaarxiv icon

Monte-Carlo tree search with uncertainty propagation via optimal transport

Add code
Sep 19, 2023
Viaarxiv icon

AdaStop: sequential testing for efficient and reliable comparisons of Deep RL Agents

Add code
Jun 19, 2023
Viaarxiv icon

Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning

Add code
Oct 05, 2022
Figure 1 for Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning
Figure 2 for Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning
Viaarxiv icon

Risk-aware linear bandits with convex loss

Add code
Sep 15, 2022
Figure 1 for Risk-aware linear bandits with convex loss
Figure 2 for Risk-aware linear bandits with convex loss
Figure 3 for Risk-aware linear bandits with convex loss
Figure 4 for Risk-aware linear bandits with convex loss
Viaarxiv icon