Picture for Yixuan Mei

Yixuan Mei

Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs

Add code
Jun 03, 2024
Figure 1 for Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs
Figure 2 for Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs
Figure 3 for Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs
Figure 4 for Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs
Viaarxiv icon

Quarl: A Learning-Based Quantum Circuit Optimizer

Add code
Jul 17, 2023
Viaarxiv icon