Picture for Robert Denkert

Robert Denkert

Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching

Add code
Apr 30, 2024
Viaarxiv icon