Picture for Mor Shpigel Nacson

Mor Shpigel Nacson

How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers

Add code
Feb 09, 2024
Figure 1 for How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers
Figure 2 for How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers
Figure 3 for How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers
Viaarxiv icon

The Implicit Bias of Minima Stability in Multivariate Shallow ReLU Networks

Add code
Jun 30, 2023
Viaarxiv icon

Gradient Descent Monotonically Decreases the Sharpness of Gradient Flow Solutions in Scalar Networks and Beyond

Add code
May 22, 2023
Viaarxiv icon

On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent

Add code
Feb 19, 2021
Figure 1 for On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent
Figure 2 for On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent
Viaarxiv icon

At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks?

Add code
Sep 26, 2019
Figure 1 for At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks?
Figure 2 for At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks?
Figure 3 for At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks?
Figure 4 for At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks?
Viaarxiv icon

Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models

Add code
May 17, 2019
Viaarxiv icon

Convergence of Gradient Descent on Separable Data

Add code
Jun 12, 2018
Figure 1 for Convergence of Gradient Descent on Separable Data
Figure 2 for Convergence of Gradient Descent on Separable Data
Figure 3 for Convergence of Gradient Descent on Separable Data
Figure 4 for Convergence of Gradient Descent on Separable Data
Viaarxiv icon

Stochastic Gradient Descent on Separable Data: Exact Convergence with a Fixed Learning Rate

Add code
Jun 05, 2018
Figure 1 for Stochastic Gradient Descent on Separable Data: Exact Convergence with a Fixed Learning Rate
Figure 2 for Stochastic Gradient Descent on Separable Data: Exact Convergence with a Fixed Learning Rate
Figure 3 for Stochastic Gradient Descent on Separable Data: Exact Convergence with a Fixed Learning Rate
Viaarxiv icon

The Implicit Bias of Gradient Descent on Separable Data

Add code
Mar 21, 2018
Figure 1 for The Implicit Bias of Gradient Descent on Separable Data
Figure 2 for The Implicit Bias of Gradient Descent on Separable Data
Figure 3 for The Implicit Bias of Gradient Descent on Separable Data
Figure 4 for The Implicit Bias of Gradient Descent on Separable Data
Viaarxiv icon