Picture for Timothy Chou

Timothy Chou

Sid

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Fast and Simplex: 2-Simplicial Attention in Triton

Add code
Jul 03, 2025
Viaarxiv icon

Accelerating Transformer Inference and Training with 2:4 Activation Sparsity

Add code
Mar 20, 2025
Figure 1 for Accelerating Transformer Inference and Training with 2:4 Activation Sparsity
Figure 2 for Accelerating Transformer Inference and Training with 2:4 Activation Sparsity
Figure 3 for Accelerating Transformer Inference and Training with 2:4 Activation Sparsity
Figure 4 for Accelerating Transformer Inference and Training with 2:4 Activation Sparsity
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon