Picture for Abhinav Venigalla

Abhinav Venigalla

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

Add code
Mar 27, 2024
Viaarxiv icon

MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining

Add code
Jan 16, 2024
Viaarxiv icon

Representation range needs for 16-bit neural network training

Add code
Apr 06, 2021
Figure 1 for Representation range needs for 16-bit neural network training
Figure 2 for Representation range needs for 16-bit neural network training
Figure 3 for Representation range needs for 16-bit neural network training
Figure 4 for Representation range needs for 16-bit neural network training
Viaarxiv icon

Adaptive Braking for Mitigating Gradient Delay

Add code
Jul 10, 2020
Figure 1 for Adaptive Braking for Mitigating Gradient Delay
Figure 2 for Adaptive Braking for Mitigating Gradient Delay
Figure 3 for Adaptive Braking for Mitigating Gradient Delay
Figure 4 for Adaptive Braking for Mitigating Gradient Delay
Viaarxiv icon

Pipelined Backpropagation at Scale: Training Large Models without Batches

Add code
Mar 25, 2020
Figure 1 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Figure 2 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Figure 3 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Figure 4 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Viaarxiv icon