Picture for Yaoyu Zhang

Yaoyu Zhang

Local Linear Recovery Guarantee of Deep Neural Networks at Overparameterization

Add code
Jun 26, 2024
Viaarxiv icon

Geometry of Critical Sets and Existence of Saddle Branches for Two-layer Neural Networks

Add code
May 26, 2024
Viaarxiv icon

A rationale from frequency perspective for grokking in training neural network

Add code
May 24, 2024
Viaarxiv icon

Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation

Add code
May 24, 2024
Figure 1 for Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation
Figure 2 for Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation
Figure 3 for Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation
Figure 4 for Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation
Viaarxiv icon

Disentangle Sample Size and Initialization Effect on Perfect Generalization for Single-Neuron Target

Add code
May 22, 2024
Viaarxiv icon

Connectivity Shapes Implicit Regularization in Matrix Factorization Models for Matrix Completion

Add code
May 22, 2024
Viaarxiv icon

Initialization is Critical to Whether Transformers Fit Composite Functions by Inference or Memorizing

Add code
May 08, 2024
Viaarxiv icon

Structure and Gradient Dynamics Near Global Minima of Two-layer Neural Networks

Add code
Sep 01, 2023
Viaarxiv icon

Optimistic Estimate Uncovers the Potential of Nonlinear Models

Add code
Jul 18, 2023
Viaarxiv icon

Linear Stability Hypothesis and Rank Stratification for Nonlinear Models

Add code
Nov 21, 2022
Viaarxiv icon