Picture for Stefana Anita

Stefana Anita

On the Convergence Rate of the Stochastic Gradient Descent and application to a modified policy gradient for the Multi Armed Bandit

Add code
Feb 09, 2024
Viaarxiv icon