Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

Add code
Oct 05, 2019
Figure 1 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 2 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 3 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 4 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: