Picture for Zheng-An Chen

Zheng-An Chen

From Condensation to Rank Collapse: A Two-Stage Analysis of Transformer Training Dynamics

Add code
Oct 08, 2025
Viaarxiv icon

On Multi-Stage Loss Dynamics in Neural Networks: Mechanisms of Plateau and Descent Stages

Add code
Nov 06, 2024
Figure 1 for On Multi-Stage Loss Dynamics in Neural Networks: Mechanisms of Plateau and Descent Stages
Figure 2 for On Multi-Stage Loss Dynamics in Neural Networks: Mechanisms of Plateau and Descent Stages
Figure 3 for On Multi-Stage Loss Dynamics in Neural Networks: Mechanisms of Plateau and Descent Stages
Figure 4 for On Multi-Stage Loss Dynamics in Neural Networks: Mechanisms of Plateau and Descent Stages
Viaarxiv icon

Analyzing Multi-Stage Loss Curve: Plateau and Descent Mechanisms in Neural Networks

Add code
Oct 26, 2024
Figure 1 for Analyzing Multi-Stage Loss Curve: Plateau and Descent Mechanisms in Neural Networks
Figure 2 for Analyzing Multi-Stage Loss Curve: Plateau and Descent Mechanisms in Neural Networks
Figure 3 for Analyzing Multi-Stage Loss Curve: Plateau and Descent Mechanisms in Neural Networks
Figure 4 for Analyzing Multi-Stage Loss Curve: Plateau and Descent Mechanisms in Neural Networks
Viaarxiv icon

On the dynamics of three-layer neural networks: initial condensation

Add code
Feb 27, 2024
Viaarxiv icon