Picture for Anar Yusifov

Anar Yusifov

Data-parallel distributed training of very large models beyond GPU capacity

Add code
Nov 29, 2018
Figure 1 for Data-parallel distributed training of very large models beyond GPU capacity
Figure 2 for Data-parallel distributed training of very large models beyond GPU capacity
Figure 3 for Data-parallel distributed training of very large models beyond GPU capacity
Figure 4 for Data-parallel distributed training of very large models beyond GPU capacity
Viaarxiv icon