Picture for Yichun Hu

Yichun Hu

Contextual Linear Optimization with Bandit Feedback

Add code
May 26, 2024
Viaarxiv icon

Practical Policy Optimization with Personalized Experimentation

Add code
Mar 30, 2023
Viaarxiv icon

Fast Rates for the Regret of Offline Reinforcement Learning

Add code
Jan 31, 2021
Figure 1 for Fast Rates for the Regret of Offline Reinforcement Learning
Viaarxiv icon

Fast Rates for Contextual Linear Optimization

Add code
Nov 05, 2020
Figure 1 for Fast Rates for Contextual Linear Optimization
Figure 2 for Fast Rates for Contextual Linear Optimization
Viaarxiv icon

DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret

Add code
Jun 05, 2020
Figure 1 for DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret
Figure 2 for DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret
Figure 3 for DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret
Figure 4 for DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret
Viaarxiv icon

Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes

Add code
Sep 05, 2019
Figure 1 for Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes
Figure 2 for Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes
Figure 3 for Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes
Figure 4 for Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes
Viaarxiv icon