Picture for Jonathan Hseu

Jonathan Hseu

Reducing BERT Pre-Training Time from 3 Days to 76 Minutes

Add code
Apr 01, 2019
Figure 1 for Reducing BERT Pre-Training Time from 3 Days to 76 Minutes
Figure 2 for Reducing BERT Pre-Training Time from 3 Days to 76 Minutes
Figure 3 for Reducing BERT Pre-Training Time from 3 Days to 76 Minutes
Figure 4 for Reducing BERT Pre-Training Time from 3 Days to 76 Minutes
Viaarxiv icon

Large-Batch Training for LSTM and Beyond

Add code
Jan 24, 2019
Figure 1 for Large-Batch Training for LSTM and Beyond
Figure 2 for Large-Batch Training for LSTM and Beyond
Figure 3 for Large-Batch Training for LSTM and Beyond
Figure 4 for Large-Batch Training for LSTM and Beyond
Viaarxiv icon