Picture for Zhanpeng Zhou

Zhanpeng Zhou

IGU-LoRA: Adaptive Rank Allocation via Integrated Gradients and Uncertainty-Aware Scoring

Add code
Mar 14, 2026
Viaarxiv icon

On the Learning Dynamics of Two-layer Linear Networks with Label Noise SGD

Add code
Mar 11, 2026
Viaarxiv icon

Fast Catch-Up, Late Switching: Optimal Batch Size Scheduling via Functional Scaling Laws

Add code
Feb 15, 2026
Viaarxiv icon

A Single Merging Suffices: Recovering Server-based Learning Performance in Decentralized Learning

Add code
Jul 09, 2025
Viaarxiv icon

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Add code
May 26, 2025
Viaarxiv icon

On the Role of Label Noise in the Feature Learning Process

Add code
May 25, 2025
Viaarxiv icon

New Evidence of the Two-Phase Learning Dynamics of Neural Networks

Add code
May 20, 2025
Figure 1 for New Evidence of the Two-Phase Learning Dynamics of Neural Networks
Figure 2 for New Evidence of the Two-Phase Learning Dynamics of Neural Networks
Figure 3 for New Evidence of the Two-Phase Learning Dynamics of Neural Networks
Figure 4 for New Evidence of the Two-Phase Learning Dynamics of Neural Networks
Viaarxiv icon

On the Cone Effect in the Learning Dynamics

Add code
Mar 20, 2025
Viaarxiv icon

The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training

Add code
Feb 26, 2025
Viaarxiv icon

Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training

Add code
Oct 14, 2024
Figure 1 for Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
Figure 2 for Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
Figure 3 for Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
Figure 4 for Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
Viaarxiv icon