Picture for Lei Ying

Lei Ying

Achieving O(1/N) Optimality Gap in Restless Bandits through Diffusion Approximation

Add code
Oct 19, 2024
Viaarxiv icon

Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference

Add code
Sep 25, 2024
Viaarxiv icon

Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence

Add code
May 23, 2024
Viaarxiv icon

Learning-Based Pricing and Matching for Two-Sided Queues

Add code
Mar 17, 2024
Viaarxiv icon

Cost Aware Best Arm Identification

Add code
Feb 26, 2024
Viaarxiv icon

Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration

Add code
Dec 22, 2023
Viaarxiv icon

Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs

Add code
Sep 27, 2023
Viaarxiv icon

Fast and Regret Optimal Best Arm Identification: Fundamental Limits and Low-Complexity Algorithms

Add code
Sep 01, 2023
Viaarxiv icon

Reconstructing Graph Diffusion History from a Single Snapshot

Add code
Jun 04, 2023
Viaarxiv icon

Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks

Add code
May 25, 2023
Viaarxiv icon