Picture for Leo Phan

Leo Phan

Scaling Studies for Efficient Parameter Search and Parallelism for Large Language Model Pre-training

Add code
Oct 11, 2023
Viaarxiv icon