Picture for Jacob Hatef

Jacob Hatef

DK

Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning

Add code
Jan 08, 2025
Viaarxiv icon

Demystifying the Communication Characteristics for Distributed Transformer Models

Add code
Aug 19, 2024
Figure 1 for Demystifying the Communication Characteristics for Distributed Transformer Models
Figure 2 for Demystifying the Communication Characteristics for Distributed Transformer Models
Figure 3 for Demystifying the Communication Characteristics for Distributed Transformer Models
Figure 4 for Demystifying the Communication Characteristics for Distributed Transformer Models
Viaarxiv icon

The Case for Co-Designing Model Architectures with Hardware

Add code
Jan 30, 2024
Figure 1 for The Case for Co-Designing Model Architectures with Hardware
Figure 2 for The Case for Co-Designing Model Architectures with Hardware
Figure 3 for The Case for Co-Designing Model Architectures with Hardware
Figure 4 for The Case for Co-Designing Model Architectures with Hardware
Viaarxiv icon