Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression

Apr 09, 2024

Sai Aparna Aketi, Abolfazl Hashemi, Kaushik Roy

Figure 1 for AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression

Figure 2 for AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression

Figure 3 for AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression

Figure 4 for AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression

Share this with someone who'll enjoy it:

Abstract:Decentralized learning is crucial in supporting on-device learning over large distributed datasets, eliminating the need for a central server. However, the communication overhead remains a major bottleneck for the practical realization of such decentralized setups. To tackle this issue, several algorithms for decentralized training with compressed communication have been proposed in the literature. Most of these algorithms introduce an additional hyper-parameter referred to as consensus step-size which is tuned based on the compression ratio at the beginning of the training. In this work, we propose AdaGossip, a novel technique that adaptively adjusts the consensus step-size based on the compressed model differences between neighboring agents. We demonstrate the effectiveness of the proposed method through an exhaustive set of experiments on various Computer Vision datasets (CIFAR-10, CIFAR-100, Fashion MNIST, Imagenette, and ImageNet), model architectures, and network topologies. Our experiments show that the proposed method achieves superior performance ($0-2\%$ improvement in test accuracy) compared to the current state-of-the-art method for decentralized learning with communication compression.

* 11 pages, 3 figures, 8 tables. arXiv admin note: text overlap with arXiv:2305.04792, arXiv:2310.15890

View paper on

Share this with someone who'll enjoy it:

Title:AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression

Paper and Code