Picture for Alessandro Montenegro

Alessandro Montenegro

Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning

Add code
Jul 15, 2024
Viaarxiv icon

Learning Optimal Deterministic Policies with Stochastic Policy Gradients

Add code
May 03, 2024
Viaarxiv icon

Best Arm Identification for Stochastic Rising Bandits

Add code
Feb 15, 2023
Viaarxiv icon