Picture for Reza Babanezhad

Reza Babanezhad

Fast Convergence of Softmax Policy Mirror Ascent

Add code
Nov 18, 2024
Viaarxiv icon

Noise-adaptive (Accelerated) Stochastic Heavy-Ball Momentum

Add code
Jan 12, 2024
Viaarxiv icon

Fast Online Node Labeling for Very Large Graphs

Add code
May 25, 2023
Viaarxiv icon

Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees

Add code
May 24, 2023
Viaarxiv icon

Target-based Surrogates for Stochastic Optimization

Add code
Feb 06, 2023
Viaarxiv icon

Towards Painless Policy Optimization for Constrained MDPs

Add code
Apr 11, 2022
Figure 1 for Towards Painless Policy Optimization for Constrained MDPs
Figure 2 for Towards Painless Policy Optimization for Constrained MDPs
Figure 3 for Towards Painless Policy Optimization for Constrained MDPs
Figure 4 for Towards Painless Policy Optimization for Constrained MDPs
Viaarxiv icon

Towards Noise-adaptive, Problem-adaptive Stochastic Gradient Descent

Add code
Oct 21, 2021
Figure 1 for Towards Noise-adaptive, Problem-adaptive Stochastic Gradient Descent
Viaarxiv icon

SVRG Meets AdaGrad: Painless Variance Reduction

Add code
Feb 18, 2021
Figure 1 for SVRG Meets AdaGrad: Painless Variance Reduction
Figure 2 for SVRG Meets AdaGrad: Painless Variance Reduction
Figure 3 for SVRG Meets AdaGrad: Painless Variance Reduction
Figure 4 for SVRG Meets AdaGrad: Painless Variance Reduction
Viaarxiv icon

Geometry-Aware Universal Mirror-Prox

Add code
Nov 23, 2020
Viaarxiv icon

To Each Optimizer a Norm, To Each Norm its Generalization

Add code
Jun 11, 2020
Figure 1 for To Each Optimizer a Norm, To Each Norm its Generalization
Figure 2 for To Each Optimizer a Norm, To Each Norm its Generalization
Figure 3 for To Each Optimizer a Norm, To Each Norm its Generalization
Figure 4 for To Each Optimizer a Norm, To Each Norm its Generalization
Viaarxiv icon