Picture for Jinping Zou

Jinping Zou

Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training

Add code
Jan 13, 2025
Viaarxiv icon

Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural Networks

Add code
Dec 22, 2024
Viaarxiv icon