Picture for Yongji Wu

Yongji Wu

Duke University

Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement

Add code
Jul 05, 2024
Figure 1 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Figure 2 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Figure 3 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Figure 4 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Viaarxiv icon

VcLLM: Video Codecs are Secretly Tensor Codecs

Add code
Jun 29, 2024
Figure 1 for VcLLM: Video Codecs are Secretly Tensor Codecs
Figure 2 for VcLLM: Video Codecs are Secretly Tensor Codecs
Figure 3 for VcLLM: Video Codecs are Secretly Tensor Codecs
Figure 4 for VcLLM: Video Codecs are Secretly Tensor Codecs
Viaarxiv icon

Adaptive Skeleton Graph Decoding

Add code
Feb 19, 2024
Viaarxiv icon

Computing in the Era of Large Generative Models: From Cloud-Native to AI-Native

Add code
Jan 17, 2024
Viaarxiv icon

Curator: Efficient Indexing for Multi-Tenant Vector Databases

Add code
Jan 13, 2024
Viaarxiv icon

AR Visualization System for Ship Detection and Recognition Based on AI

Add code
Nov 21, 2023
Viaarxiv icon

Punica: Multi-Tenant LoRA Serving

Add code
Oct 28, 2023
Viaarxiv icon

Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures

Add code
May 10, 2022
Figure 1 for Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures
Figure 2 for Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures
Figure 3 for Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures
Figure 4 for Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures
Viaarxiv icon

How Powerful is Graph Convolution for Recommendation?

Add code
Aug 17, 2021
Figure 1 for How Powerful is Graph Convolution for Recommendation?
Figure 2 for How Powerful is Graph Convolution for Recommendation?
Figure 3 for How Powerful is Graph Convolution for Recommendation?
Figure 4 for How Powerful is Graph Convolution for Recommendation?
Viaarxiv icon

Linear-Time Self Attention with Codeword Histogram for Efficient Recommendation

Add code
May 28, 2021
Figure 1 for Linear-Time Self Attention with Codeword Histogram for Efficient Recommendation
Figure 2 for Linear-Time Self Attention with Codeword Histogram for Efficient Recommendation
Figure 3 for Linear-Time Self Attention with Codeword Histogram for Efficient Recommendation
Figure 4 for Linear-Time Self Attention with Codeword Histogram for Efficient Recommendation
Viaarxiv icon