Picture for Kelly W. Zhang

Kelly W. Zhang

Tabular Foundation Models Can Do Survival Analysis

Add code
Jan 29, 2026
Viaarxiv icon

Statistical Reinforcement Learning in the Real World: A Survey of Challenges and Future Directions

Add code
Jan 21, 2026
Viaarxiv icon

Contextual Thompson Sampling via Generation of Missing Data

Add code
Feb 10, 2025
Viaarxiv icon

Impatient Bandits: Optimizing for the Long-Term Without Delay

Add code
Jan 14, 2025
Figure 1 for Impatient Bandits: Optimizing for the Long-Term Without Delay
Figure 2 for Impatient Bandits: Optimizing for the Long-Term Without Delay
Figure 3 for Impatient Bandits: Optimizing for the Long-Term Without Delay
Figure 4 for Impatient Bandits: Optimizing for the Long-Term Without Delay
Viaarxiv icon

A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial

Add code
Sep 03, 2024
Figure 1 for A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial
Figure 2 for A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial
Figure 3 for A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial
Figure 4 for A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial
Viaarxiv icon

Oralytics Reinforcement Learning Algorithm

Add code
Jun 19, 2024
Figure 1 for Oralytics Reinforcement Learning Algorithm
Figure 2 for Oralytics Reinforcement Learning Algorithm
Figure 3 for Oralytics Reinforcement Learning Algorithm
Figure 4 for Oralytics Reinforcement Learning Algorithm
Viaarxiv icon

The Fallacy of Minimizing Local Regret in the Sequential Task Setting

Add code
Mar 16, 2024
Figure 1 for The Fallacy of Minimizing Local Regret in the Sequential Task Setting
Viaarxiv icon

Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials

Add code
Feb 26, 2024
Figure 1 for Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials
Viaarxiv icon

Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care

Add code
Aug 15, 2022
Figure 1 for Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care
Figure 2 for Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care
Figure 3 for Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care
Viaarxiv icon

A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes

Add code
Jul 30, 2022
Figure 1 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Figure 2 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Figure 3 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Figure 4 for A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes
Viaarxiv icon