Picture for Andrew Gu

Andrew Gu

Jack

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Meta Lattice: Model Space Redesign for Cost-Effective Industry-Scale Ads Recommendations

Add code
Dec 15, 2025
Viaarxiv icon

GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection

Add code
Apr 29, 2025
Figure 1 for GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection
Figure 2 for GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection
Figure 3 for GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection
Figure 4 for GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection
Viaarxiv icon

SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile

Add code
Nov 01, 2024
Figure 1 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile
Figure 2 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile
Figure 3 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile
Figure 4 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile
Viaarxiv icon

TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training

Add code
Oct 09, 2024
Figure 1 for TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training
Figure 2 for TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training
Figure 3 for TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training
Figure 4 for TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

Add code
Apr 21, 2023
Viaarxiv icon

Deep Transfer Learning for Infectious Disease Case Detection Using Electronic Medical Records

Add code
Mar 08, 2021
Figure 1 for Deep Transfer Learning for Infectious Disease Case Detection Using Electronic Medical Records
Figure 2 for Deep Transfer Learning for Infectious Disease Case Detection Using Electronic Medical Records
Figure 3 for Deep Transfer Learning for Infectious Disease Case Detection Using Electronic Medical Records
Figure 4 for Deep Transfer Learning for Infectious Disease Case Detection Using Electronic Medical Records
Viaarxiv icon