Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Quantizing data for distributed learning

Dec 14, 2020

Osama A. Hanna, Yahya H. Ezzeldin, Christina Fragouli, Suhas Diggavi

Figure 1 for Quantizing data for distributed learning

Figure 2 for Quantizing data for distributed learning

Figure 3 for Quantizing data for distributed learning

Figure 4 for Quantizing data for distributed learning

Share this with someone who'll enjoy it:

Abstract:We consider machine learning applications that train a model by leveraging data distributed over a network, where communication constraints can create a performance bottleneck. A number of recent approaches are proposing to overcome this bottleneck through compression of gradient updates. However, as models become larger, so does the size of the gradient updates. In this paper, we propose an alternate approach, that quantizes data instead of gradients, and can support learning over applications where the size of gradient updates is prohibitive. Our approach combines aspects of: (1) sample selection; (2) dataset quantization; and (3) gradient compensation. We analyze the convergence of the proposed approach for smooth convex and non-convex objective functions and show that we can achieve order optimal convergence rates with communication that mostly depends on the data rather than the model (gradient) dimension. We use our proposed algorithm to train ResNet models on the CIFAR-10 and ImageNet datasets, and show that we can achieve an order of magnitude savings over gradient compression methods.

View paper on

Share this with someone who'll enjoy it:

Title:Quantizing data for distributed learning

Paper and Code