Picture for Jiuqiang Tang

Jiuqiang Tang

Scaling On-Device GPU Inference for Large Generative Models

Add code
May 01, 2025
Viaarxiv icon

StreamVC: Real-Time Low-Latency Voice Conversion

Add code
Jan 05, 2024
Viaarxiv icon

Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations

Add code
Apr 21, 2023
Figure 1 for Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations
Figure 2 for Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations
Figure 3 for Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations
Figure 4 for Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations
Viaarxiv icon