Picture for Yiyao Sheng

Yiyao Sheng

ByteCheckpoint: A Unified Checkpointing System for LLM Development

Add code
Jul 29, 2024
Figure 1 for ByteCheckpoint: A Unified Checkpointing System for LLM Development
Figure 2 for ByteCheckpoint: A Unified Checkpointing System for LLM Development
Figure 3 for ByteCheckpoint: A Unified Checkpointing System for LLM Development
Figure 4 for ByteCheckpoint: A Unified Checkpointing System for LLM Development
Viaarxiv icon

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Add code
Feb 23, 2024
Figure 1 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Figure 2 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Figure 3 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Figure 4 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Viaarxiv icon