Picture for Shuning Shang

Shuning Shang

Initialization Matters: On the Benign Overfitting of Two-Layer ReLU CNN with Fully Trainable Layers

Add code
Oct 24, 2024
Viaarxiv icon

Benign Overfitting in Single-Head Attention

Add code
Oct 10, 2024
Viaarxiv icon