Picture for Alexander Tyurin

Alexander Tyurin

Gradient Descent as a Perceptron Algorithm: Understanding Dynamics and Implicit Acceleration

Add code
Dec 12, 2025
Viaarxiv icon

Near-Optimal Convergence of Accelerated Gradient Methods under Generalized and $(L_0, L_1)$-Smoothness

Add code
Aug 09, 2025
Viaarxiv icon

Birch SGD: A Tree Graph Framework for Local and Asynchronous SGD Methods

Add code
May 14, 2025
Viaarxiv icon

Ringmaster ASGD: The First Asynchronous SGD with Optimal Time Complexity

Add code
Jan 27, 2025
Viaarxiv icon

From Logistic Regression to the Perceptron Algorithm: Exploring Gradient Descent with Large Step Sizes

Add code
Dec 11, 2024
Viaarxiv icon

Tighter Performance Theory of FedExProx

Add code
Oct 20, 2024
Figure 1 for Tighter Performance Theory of FedExProx
Figure 2 for Tighter Performance Theory of FedExProx
Figure 3 for Tighter Performance Theory of FedExProx
Figure 4 for Tighter Performance Theory of FedExProx
Viaarxiv icon

Freya PAGE: First Optimal Time Complexity for Large-Scale Nonconvex Finite-Sum Optimization with Heterogeneous Asynchronous Computations

Add code
May 24, 2024
Figure 1 for Freya PAGE: First Optimal Time Complexity for Large-Scale Nonconvex Finite-Sum Optimization with Heterogeneous Asynchronous Computations
Figure 2 for Freya PAGE: First Optimal Time Complexity for Large-Scale Nonconvex Finite-Sum Optimization with Heterogeneous Asynchronous Computations
Figure 3 for Freya PAGE: First Optimal Time Complexity for Large-Scale Nonconvex Finite-Sum Optimization with Heterogeneous Asynchronous Computations
Figure 4 for Freya PAGE: First Optimal Time Complexity for Large-Scale Nonconvex Finite-Sum Optimization with Heterogeneous Asynchronous Computations
Viaarxiv icon

Improving the Worst-Case Bidirectional Communication Complexity for Nonconvex Distributed Optimization under Function Similarity

Add code
Feb 09, 2024
Figure 1 for Improving the Worst-Case Bidirectional Communication Complexity for Nonconvex Distributed Optimization under Function Similarity
Figure 2 for Improving the Worst-Case Bidirectional Communication Complexity for Nonconvex Distributed Optimization under Function Similarity
Figure 3 for Improving the Worst-Case Bidirectional Communication Complexity for Nonconvex Distributed Optimization under Function Similarity
Figure 4 for Improving the Worst-Case Bidirectional Communication Complexity for Nonconvex Distributed Optimization under Function Similarity
Viaarxiv icon

Shadowheart SGD: Distributed Asynchronous SGD with Optimal Time Complexity Under Arbitrary Computation and Communication Heterogeneity

Add code
Feb 07, 2024
Figure 1 for Shadowheart SGD: Distributed Asynchronous SGD with Optimal Time Complexity Under Arbitrary Computation and Communication Heterogeneity
Figure 2 for Shadowheart SGD: Distributed Asynchronous SGD with Optimal Time Complexity Under Arbitrary Computation and Communication Heterogeneity
Figure 3 for Shadowheart SGD: Distributed Asynchronous SGD with Optimal Time Complexity Under Arbitrary Computation and Communication Heterogeneity
Figure 4 for Shadowheart SGD: Distributed Asynchronous SGD with Optimal Time Complexity Under Arbitrary Computation and Communication Heterogeneity
Viaarxiv icon

Momentum Provably Improves Error Feedback!

Add code
May 24, 2023
Viaarxiv icon