Picture for Pierre Gaillard

Pierre Gaillard

Thoth

Logarithmic Regret for Unconstrained Submodular Maximization Stochastic Bandit

Add code
Oct 11, 2024
Viaarxiv icon

Minimax Adaptive Boosting for Online Nonparametric Regression

Add code
Oct 04, 2024
Viaarxiv icon

Structured Prediction in Online Learning

Add code
Jun 18, 2024
Viaarxiv icon

MetaCURL: Non-stationary Concave Utility Reinforcement Learning

Add code
May 30, 2024
Figure 1 for MetaCURL: Non-stationary Concave Utility Reinforcement Learning
Viaarxiv icon

Stop Relying on No-Choice and Do not Repeat the Moves: Optimal, Efficient and Practical Algorithms for Assortment Optimization

Add code
Feb 29, 2024
Viaarxiv icon

Covariance-Adaptive Least-Squares Algorithm for Stochastic Combinatorial Semi-Bandits

Add code
Feb 23, 2024
Viaarxiv icon

Online Learning Approach for Survival Analysis

Add code
Feb 07, 2024
Viaarxiv icon

Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent

Add code
Nov 30, 2023
Figure 1 for Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent
Figure 2 for Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent
Figure 3 for Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent
Figure 4 for Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent
Viaarxiv icon

Adaptive approximation of monotone functions

Add code
Sep 14, 2023
Viaarxiv icon

Sequential Counterfactual Risk Minimization

Add code
Feb 23, 2023
Viaarxiv icon