Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:EF21 with Bells & Whistles: Practical Algorithmic Extensions of Modern Error Feedback

Oct 07, 2021

Ilyas Fatkhullin, Igor Sokolov, Eduard Gorbunov, Zhize Li, Peter Richtárik

Figure 1 for EF21 with Bells & Whistles: Practical Algorithmic Extensions of Modern Error Feedback

Figure 2 for EF21 with Bells & Whistles: Practical Algorithmic Extensions of Modern Error Feedback

Figure 3 for EF21 with Bells & Whistles: Practical Algorithmic Extensions of Modern Error Feedback

Figure 4 for EF21 with Bells & Whistles: Practical Algorithmic Extensions of Modern Error Feedback

Share this with someone who'll enjoy it:

Abstract:First proposed by Seide (2014) as a heuristic, error feedback (EF) is a very popular mechanism for enforcing convergence of distributed gradient-based optimization methods enhanced with communication compression strategies based on the application of contractive compression operators. However, existing theory of EF relies on very strong assumptions (e.g., bounded gradients), and provides pessimistic convergence rates (e.g., while the best known rate for EF in the smooth nonconvex regime, and when full gradients are compressed, is $O(1/T^{2/3})$, the rate of gradient descent in the same regime is $O(1/T)$). Recently, Richt\'{a}rik et al. (2021) proposed a new error feedback mechanism, EF21, based on the construction of a Markov compressor induced by a contractive compressor. EF21 removes the aforementioned theoretical deficiencies of EF and at the same time works better in practice. In this work we propose six practical extensions of EF21, all supported by strong convergence theory: partial participation, stochastic approximation, variance reduction, proximal setting, momentum and bidirectional compression. Several of these techniques were never analyzed in conjunction with EF before, and in cases where they were (e.g., bidirectional compression), our rates are vastly superior.

* 71 pages, 7 algorithms, 18 Lemmas, 13 Theorems, 18 Corollaries, 6 Tables, 7 Figures

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:EF21 with Bells & Whistles: Practical Algorithmic Extensions of Modern Error Feedback

Paper and Code