Picture for Xiaoge Deng

Xiaoge Deng

Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training

Add code
Jan 13, 2025
Viaarxiv icon

Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural Networks

Add code
Dec 22, 2024
Viaarxiv icon

Federated Prediction-Powered Inference from Decentralized Data

Add code
Sep 03, 2024
Figure 1 for Federated Prediction-Powered Inference from Decentralized Data
Figure 2 for Federated Prediction-Powered Inference from Decentralized Data
Figure 3 for Federated Prediction-Powered Inference from Decentralized Data
Figure 4 for Federated Prediction-Powered Inference from Decentralized Data
Viaarxiv icon

Score-based Generative Models with Adaptive Momentum

Add code
May 22, 2024
Figure 1 for Score-based Generative Models with Adaptive Momentum
Figure 2 for Score-based Generative Models with Adaptive Momentum
Figure 3 for Score-based Generative Models with Adaptive Momentum
Figure 4 for Score-based Generative Models with Adaptive Momentum
Viaarxiv icon

Accelerating Federated Learning by Selecting Beneficial Herd of Local Gradients

Add code
Mar 25, 2024
Viaarxiv icon

Towards Understanding the Generalizability of Delayed Stochastic Gradient Descent

Add code
Aug 18, 2023
Viaarxiv icon

S2 Reducer: High-Performance Sparse Communication to Accelerate Distributed Deep Learning

Add code
Oct 05, 2021
Figure 1 for S2 Reducer: High-Performance Sparse Communication to Accelerate Distributed Deep Learning
Figure 2 for S2 Reducer: High-Performance Sparse Communication to Accelerate Distributed Deep Learning
Figure 3 for S2 Reducer: High-Performance Sparse Communication to Accelerate Distributed Deep Learning
Figure 4 for S2 Reducer: High-Performance Sparse Communication to Accelerate Distributed Deep Learning
Viaarxiv icon