Picture for Jingru Li

Jingru Li

Variance-Adaptive Muon: Accelerating LLM Pretraining with NSR-Modulated and Variance-Scaled Momentum

Add code
Jan 21, 2026
Viaarxiv icon

RainFusion: Adaptive Video Generation Acceleration via Multi-Dimensional Visual Redundancy

Add code
May 27, 2025
Viaarxiv icon

AFD-STA: Adaptive Filtering Denoising with Spatiotemporal Attention for Chaotic System Prediction

Add code
May 23, 2025
Viaarxiv icon

How to Teach: Learning Data-Free Knowledge Distillation from Curriculum

Add code
Aug 29, 2022
Figure 1 for How to Teach: Learning Data-Free Knowledge Distillation from Curriculum
Figure 2 for How to Teach: Learning Data-Free Knowledge Distillation from Curriculum
Figure 3 for How to Teach: Learning Data-Free Knowledge Distillation from Curriculum
Figure 4 for How to Teach: Learning Data-Free Knowledge Distillation from Curriculum
Viaarxiv icon