Picture for Runzhe Wan

Runzhe Wan

Zero-Inflated Bandits

Add code
Dec 25, 2023
Viaarxiv icon

Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches

Add code
Dec 20, 2023
Viaarxiv icon

Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards

Add code
Oct 28, 2023
Viaarxiv icon

Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring

Add code
Apr 02, 2023
Viaarxiv icon

Multiplier Bootstrap-based Exploration

Add code
Feb 03, 2023
Viaarxiv icon

STEEL: Singularity-aware Reinforcement Learning

Add code
Jan 31, 2023
Viaarxiv icon

Mining the Factor Zoo: Estimation of Latent Factor Models with Sufficient Proxies

Add code
Jan 03, 2023
Viaarxiv icon

Heterogeneous Synthetic Learner for Panel Data

Add code
Dec 30, 2022
Viaarxiv icon

Safe Exploration for Efficient Policy Evaluation and Comparison

Add code
Feb 26, 2022
Figure 1 for Safe Exploration for Efficient Policy Evaluation and Comparison
Figure 2 for Safe Exploration for Efficient Policy Evaluation and Comparison
Figure 3 for Safe Exploration for Efficient Policy Evaluation and Comparison
Figure 4 for Safe Exploration for Efficient Policy Evaluation and Comparison
Viaarxiv icon

Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework

Add code
Feb 26, 2022
Figure 1 for Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Figure 2 for Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Figure 3 for Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Figure 4 for Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Viaarxiv icon