Picture for Xiangcheng Zhang

Xiangcheng Zhang

ID policy (with reassignment) is asymptotically optimal for heterogeneous weakly-coupled MDPs

Add code
Feb 09, 2025
Viaarxiv icon

Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

Add code
Feb 28, 2024
Viaarxiv icon

Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization

Add code
Feb 14, 2023
Figure 1 for Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization
Viaarxiv icon