Picture for Zheng-An Chen

Zheng-An Chen

From Condensation to Rank Collapse: A Two-Stage Analysis of Transformer Training Dynamics

Add code
Oct 08, 2025
Viaarxiv icon

On Multi-Stage Loss Dynamics in Neural Networks: Mechanisms of Plateau and Descent Stages

Add code
Nov 06, 2024
Viaarxiv icon

Analyzing Multi-Stage Loss Curve: Plateau and Descent Mechanisms in Neural Networks

Add code
Oct 26, 2024
Viaarxiv icon

On the dynamics of three-layer neural networks: initial condensation

Add code
Feb 27, 2024
Viaarxiv icon