Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees

Oct 07, 2021

Aleksandr Beznosikov, Peter Richtárik, Michael Diskin, Max Ryabinin, Alexander Gasnikov

Figure 1 for Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees

Figure 2 for Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees

Figure 3 for Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees

Figure 4 for Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees

Share this with someone who'll enjoy it:

Abstract:Variational inequalities in general and saddle point problems in particular are increasingly relevant in machine learning applications, including adversarial learning, GANs, transport and robust optimization. With increasing data and problem sizes necessary to train high performing models across these and other applications, it is necessary to rely on parallel and distributed computing. However, in distributed training, communication among the compute nodes is a key bottleneck during training, and this problem is exacerbated for high dimensional and over-parameterized models models. Due to these considerations, it is important to equip existing methods with strategies that would allow to reduce the volume of transmitted information during training while obtaining a model of comparable quality. In this paper, we present the first theoretically grounded distributed methods for solving variational inequalities and saddle point problems using compressed communication: MASHA1 and MASHA2. Our theory and methods allow for the use of both unbiased (such as Rand$k$; MASHA1) and contractive (such as Top$k$; MASHA2) compressors. We empirically validate our conclusions using two experimental setups: a standard bilinear min-max problem, and large-scale distributed adversarial training of transformers.

* 30 pages, 2 algorithms (MASHA 1 and MASHA2), 2 theorems

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees

Paper and Code