Picture for Zhuang Wang

Zhuang Wang

Marconi: Prefix Caching for the Era of Hybrid LLMs

Add code
Nov 28, 2024
Viaarxiv icon

Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement

Add code
Jul 05, 2024
Figure 1 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Figure 2 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Figure 3 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Figure 4 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Viaarxiv icon

Zen: Near-Optimal Sparse Tensor Synchronization for Distributed DNN Training

Add code
Sep 23, 2023
Viaarxiv icon

ByteComp: Revisiting Gradient Compression in Distributed Training

Add code
Jun 06, 2022
Figure 1 for ByteComp: Revisiting Gradient Compression in Distributed Training
Figure 2 for ByteComp: Revisiting Gradient Compression in Distributed Training
Figure 3 for ByteComp: Revisiting Gradient Compression in Distributed Training
Figure 4 for ByteComp: Revisiting Gradient Compression in Distributed Training
Viaarxiv icon

MergeComp: A Compression Scheduler for Scalable Communication-Efficient Distributed Training

Add code
Mar 28, 2021
Figure 1 for MergeComp: A Compression Scheduler for Scalable Communication-Efficient Distributed Training
Figure 2 for MergeComp: A Compression Scheduler for Scalable Communication-Efficient Distributed Training
Figure 3 for MergeComp: A Compression Scheduler for Scalable Communication-Efficient Distributed Training
Figure 4 for MergeComp: A Compression Scheduler for Scalable Communication-Efficient Distributed Training
Viaarxiv icon