Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ozgur Martin

Bolstering Stochastic Gradient Descent with Model Building

Nov 13, 2021

S. Ilker Birbil, Ozgur Martin, Gonenc Onay, Figen Oztoprak

Figure 1 for Bolstering Stochastic Gradient Descent with Model Building

Figure 2 for Bolstering Stochastic Gradient Descent with Model Building

Figure 3 for Bolstering Stochastic Gradient Descent with Model Building

Figure 4 for Bolstering Stochastic Gradient Descent with Model Building

Abstract:Stochastic gradient descent method and its variants constitute the core optimization algorithms that achieve good convergence rates for solving machine learning problems. These rates are obtained especially when these algorithms are fine-tuned for the application at hand. Although this tuning process can require large computational costs, recent work has shown that these costs can be reduced by line search methods that iteratively adjust the stepsize. We propose an alternative approach to stochastic line search by using a new algorithm based on forward step model building. This model building step incorporates a second-order information that allows adjusting not only the stepsize but also the search direction. Noting that deep learning model parameters come in groups (layers of tensors), our method builds its model and calculates a new step for each parameter group. This novel diagonalization approach makes the selected step lengths adaptive. We provide convergence rate analysis, and experimentally show that the proposed algorithm achieves faster convergence and better generalization in most problems. Moreover, our experiments show that the proposed method is quite robust as it converges for a wide range of initial stepsizes.

Via

Access Paper or Ask Questions