Picture for Youhe Jiang

Youhe Jiang

Improving Automatic Parallel Training via Balanced Memory Workload Optimization

Add code
Jul 05, 2023
Figure 1 for Improving Automatic Parallel Training via Balanced Memory Workload Optimization
Figure 2 for Improving Automatic Parallel Training via Balanced Memory Workload Optimization
Figure 3 for Improving Automatic Parallel Training via Balanced Memory Workload Optimization
Figure 4 for Improving Automatic Parallel Training via Balanced Memory Workload Optimization
Viaarxiv icon

Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism

Add code
Nov 25, 2022
Viaarxiv icon