Picture for Patrick LeGresley

Patrick LeGresley

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Viaarxiv icon

Nemotron-4 15B Technical Report

Add code
Feb 27, 2024
Viaarxiv icon

Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model

Add code
Feb 04, 2022
Figure 1 for Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model
Figure 2 for Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model
Figure 3 for Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model
Figure 4 for Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model
Viaarxiv icon

Efficient Large-Scale Language Model Training on GPU Clusters

Add code
Apr 09, 2021
Figure 1 for Efficient Large-Scale Language Model Training on GPU Clusters
Figure 2 for Efficient Large-Scale Language Model Training on GPU Clusters
Figure 3 for Efficient Large-Scale Language Model Training on GPU Clusters
Figure 4 for Efficient Large-Scale Language Model Training on GPU Clusters
Viaarxiv icon

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

Add code
Oct 05, 2019
Figure 1 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 2 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 3 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Figure 4 for Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Viaarxiv icon

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Add code
Dec 08, 2015
Figure 1 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Figure 2 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Figure 3 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Figure 4 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Viaarxiv icon