Picture for Satyen Kale

Satyen Kale

Google AI

Eager Updates For Overlapped Communication and Computation in DiLoCo

Add code
Feb 18, 2025
Viaarxiv icon

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Add code
Jan 30, 2025
Viaarxiv icon

Stacking as Accelerated Gradient Descent

Add code
Mar 08, 2024
Figure 1 for Stacking as Accelerated Gradient Descent
Figure 2 for Stacking as Accelerated Gradient Descent
Figure 3 for Stacking as Accelerated Gradient Descent
Figure 4 for Stacking as Accelerated Gradient Descent
Viaarxiv icon

Efficient Stagewise Pretraining via Progressive Subnetworks

Add code
Feb 08, 2024
Viaarxiv icon

Asynchronous Local-SGD Training for Language Modeling

Add code
Jan 17, 2024
Figure 1 for Asynchronous Local-SGD Training for Language Modeling
Figure 2 for Asynchronous Local-SGD Training for Language Modeling
Figure 3 for Asynchronous Local-SGD Training for Language Modeling
Figure 4 for Asynchronous Local-SGD Training for Language Modeling
Viaarxiv icon

Improved Differentially Private and Lazy Online Convex Optimization

Add code
Dec 20, 2023
Viaarxiv icon

On the Convergence of Federated Averaging with Cyclic Client Participation

Add code
Feb 06, 2023
Viaarxiv icon

From Gradient Flow on Population Loss to Learning with Stochastic Gradient Descent

Add code
Oct 13, 2022
Viaarxiv icon

Private Matrix Approximation and Geometry of Unitary Orbits

Add code
Jul 06, 2022
Viaarxiv icon

Beyond Uniform Lipschitz Condition in Differentially Private Optimization

Add code
Jun 21, 2022
Figure 1 for Beyond Uniform Lipschitz Condition in Differentially Private Optimization
Figure 2 for Beyond Uniform Lipschitz Condition in Differentially Private Optimization
Figure 3 for Beyond Uniform Lipschitz Condition in Differentially Private Optimization
Figure 4 for Beyond Uniform Lipschitz Condition in Differentially Private Optimization
Viaarxiv icon