Picture for Zhengyuan Zhou

Zhengyuan Zhou

Learning an Optimal Assortment Policy under Observational Data

Add code
Feb 10, 2025
Viaarxiv icon

Nonconvex Stochastic Optimization under Heavy-Tailed Noises: Optimal Convergence without Gradient Clipping

Add code
Dec 27, 2024
Viaarxiv icon

Distributionally Robust Policy Learning under Concept Drifts

Add code
Dec 18, 2024
Viaarxiv icon

Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces

Add code
Jun 17, 2024
Viaarxiv icon

Adaptively Learning to Select-Rank in Online Platforms

Add code
Jun 07, 2024
Viaarxiv icon

Simulation-Based Benchmarking of Reinforcement Learning Agents for Personalized Retail Promotions

Add code
May 16, 2024
Viaarxiv icon

On the Last-Iterate Convergence of Shuffling Gradient Methods

Add code
Mar 12, 2024
Viaarxiv icon

Stochastic contextual bandits with graph feedback: from independence number to MAS number

Add code
Feb 12, 2024
Viaarxiv icon

Revisiting the Last-Iterate Convergence of Stochastic Gradient Methods

Add code
Dec 13, 2023
Viaarxiv icon

On the Foundation of Distributionally Robust Reinforcement Learning

Add code
Nov 15, 2023
Viaarxiv icon