Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Y. J. Ma

Regret Bounds for Risk-Sensitive Reinforcement Learning

Oct 11, 2022

O. Bastani, Y. J. Ma, E. Shen, W. Xu

Figure 1 for Regret Bounds for Risk-Sensitive Reinforcement Learning

Abstract:In safety-critical applications of reinforcement learning such as healthcare and robotics, it is often desirable to optimize risk-sensitive objectives that account for tail outcomes rather than expected reward. We prove the first regret bounds for reinforcement learning under a general class of risk-sensitive objectives including the popular CVaR objective. Our theory is based on a novel characterization of the CVaR objective as well as a novel optimistic MDP construction.

Via

Access Paper or Ask Questions