Picture for Lang Xu

Lang Xu

Accelerating Large Language Model Training with Hybrid GPU-based Compression

Add code
Sep 04, 2024
Figure 1 for Accelerating Large Language Model Training with Hybrid GPU-based Compression
Figure 2 for Accelerating Large Language Model Training with Hybrid GPU-based Compression
Figure 3 for Accelerating Large Language Model Training with Hybrid GPU-based Compression
Figure 4 for Accelerating Large Language Model Training with Hybrid GPU-based Compression
Viaarxiv icon

Demystifying the Communication Characteristics for Distributed Transformer Models

Add code
Aug 19, 2024
Figure 1 for Demystifying the Communication Characteristics for Distributed Transformer Models
Figure 2 for Demystifying the Communication Characteristics for Distributed Transformer Models
Figure 3 for Demystifying the Communication Characteristics for Distributed Transformer Models
Figure 4 for Demystifying the Communication Characteristics for Distributed Transformer Models
Viaarxiv icon