Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training

Nov 20, 2019

Lei Guan, Wotao Yin, Dongsheng Li, Xicheng Lu

Figure 1 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training

Figure 2 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training

Figure 3 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training

Figure 4 for XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training

Share this with someone who'll enjoy it:

Abstract:We propose XPipe, an efficient asynchronous pipeline model parallelism approach for multi-GPU DNN training. XPipe is designed to make use of multiple GPUs to concurrently and continuously train different parts of a DNN model. To improve GPU utilization and achieve high throughput, it splits a mini-batch into a set of micro-batches and allows the overlapping of the pipelines of multiple micro-batches, including those belonging to different mini-batches. Most importantly, the novel weight prediction strategy adopted by XPipe enables it to effectively address the weight inconsistency and staleness issues incurred by the asynchronous pipeline parallelism. As a result, XPipe incorporates the advantages of both synchronous and asynchronous pipeline model parallelism approaches. Concretely, it can achieve very comparable (even slightly better) model accuracy as its synchronous counterpart, while obtaining higher throughput than it. Experimental results show that XPipe outperforms other state-of-the-art synchronous and asynchronous model parallelism approaches.

* 9 pages

View paper on

Share this with someone who'll enjoy it:

Title:XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training

Paper and Code