Picture for Jinping Zou

Jinping Zou

Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training

Add code
Jan 13, 2025
Figure 1 for Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training
Figure 2 for Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training
Figure 3 for Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training
Figure 4 for Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training
Viaarxiv icon

Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural Networks

Add code
Dec 22, 2024
Figure 1 for Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural Networks
Figure 2 for Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural Networks
Figure 3 for Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural Networks
Figure 4 for Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural Networks
Viaarxiv icon