Picture for Xihong Su

Xihong Su

Stationary Policies are Optimal in Risk-averse Total-reward MDPs with EVaR

Add code
Aug 30, 2024
Viaarxiv icon

Solving Multi-Model MDPs by Coordinate Ascent and Dynamic Programming

Add code
Jul 08, 2024
Viaarxiv icon