Benefit of deep learning with non-convex noisy gradient descent: Provable excess risk bound and superiority to kernel methods

Add code
Dec 06, 2020

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: