A proof of convergence for the gradient descent optimization method with random initializations in the training of neural networks with ReLU activation for piecewise linear target functions

Add code
Aug 10, 2021

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: