Picture for Claire Vernade

Claire Vernade

L2S

Quantization-Free Autoregressive Action Transformer

Add code
Mar 18, 2025
Viaarxiv icon

Clustered KL-barycenter design for policy evaluation

Add code
Mar 04, 2025
Viaarxiv icon

Efficient Risk-sensitive Planning via Entropic Risk Measures

Add code
Feb 27, 2025
Viaarxiv icon

Variational Bayes Portfolio Construction

Add code
Nov 09, 2024
Viaarxiv icon

Online Decision Deferral under Budget Constraints

Add code
Sep 30, 2024
Figure 1 for Online Decision Deferral under Budget Constraints
Figure 2 for Online Decision Deferral under Budget Constraints
Figure 3 for Online Decision Deferral under Budget Constraints
Figure 4 for Online Decision Deferral under Budget Constraints
Viaarxiv icon

A Pontryagin Perspective on Reinforcement Learning

Add code
May 28, 2024
Figure 1 for A Pontryagin Perspective on Reinforcement Learning
Figure 2 for A Pontryagin Perspective on Reinforcement Learning
Figure 3 for A Pontryagin Perspective on Reinforcement Learning
Figure 4 for A Pontryagin Perspective on Reinforcement Learning
Viaarxiv icon

Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits

Add code
Feb 08, 2024
Figure 1 for Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Figure 2 for Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Figure 3 for Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Figure 4 for Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Viaarxiv icon

Beyond Average Return in Markov Decision Processes

Add code
Oct 31, 2023
Viaarxiv icon

POMRL: No-Regret Learning-to-Plan with Increasing Horizons

Add code
Dec 30, 2022
Viaarxiv icon

Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms

Add code
Mar 13, 2022
Figure 1 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 2 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 3 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 4 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Viaarxiv icon