Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?

Feb 01, 2022

Sheikh Shams Azam, Seyyedali Hosseinalipour, Qiang Qiu, Christopher Brinton

Figure 1 for Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?

Figure 2 for Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?

Figure 3 for Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?

Figure 4 for Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?

Share this with someone who'll enjoy it:

Abstract:In this paper, we question the rationale behind propagating large numbers of parameters through a distributed system during federated learning. We start by examining the rank characteristics of the subspace spanned by gradients across epochs (i.e., the gradient-space) in centralized model training, and observe that this gradient-space often consists of a few leading principal components accounting for an overwhelming majority (95-99%) of the explained variance. Motivated by this, we propose the "Look-back Gradient Multiplier" (LBGM) algorithm, which exploits this low-rank property to enable gradient recycling between model update rounds of federated learning, reducing transmissions of large parameters to single scalars for aggregation. We analytically characterize the convergence behavior of LBGM, revealing the nature of the trade-off between communication savings and model performance. Our subsequent experimental results demonstrate the improvement LBGM obtains in communication overhead compared to conventional federated learning on several datasets and deep learning models. Additionally, we show that LBGM is a general plug-and-play algorithm that can be used standalone or stacked on top of existing sparsification techniques for distributed model training.

* In Proceedings of the 10th International Conference on Learning Representations (ICLR) 2022

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?

Paper and Code