Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stefana Anita

On the Convergence Rate of the Stochastic Gradient Descent and application to a modified policy gradient for the Multi Armed Bandit

Feb 09, 2024

Stefana Anita, Gabriel Turinici

Abstract:We present a self-contained proof of the convergence rate of the Stochastic Gradient Descent (SGD) when the learning rate follows an inverse time decays schedule; we next apply the results to the convergence of a modified form of policy gradient Multi-Armed Bandit (MAB) with $L2$ regularization.

Via

Access Paper or Ask Questions