Picture for Olivier Lepel

Olivier Lepel

Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning

Add code
Oct 03, 2024
Viaarxiv icon