Picture for Yujun Lin

Yujun Lin

Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

Add code
Feb 03, 2025
Viaarxiv icon

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Add code
Jan 30, 2025
Figure 1 for SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Figure 2 for SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Figure 3 for SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Figure 4 for SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Viaarxiv icon

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Add code
Nov 07, 2024
Figure 1 for SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Figure 2 for SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Figure 3 for SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Figure 4 for SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Viaarxiv icon

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Add code
Oct 15, 2024
Viaarxiv icon

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Add code
May 07, 2024
Viaarxiv icon

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Add code
Apr 25, 2022
Figure 1 for Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Figure 2 for Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Figure 3 for Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Figure 4 for Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Viaarxiv icon

TorchSparse: Efficient Point Cloud Inference Engine

Add code
Apr 21, 2022
Figure 1 for TorchSparse: Efficient Point Cloud Inference Engine
Figure 2 for TorchSparse: Efficient Point Cloud Inference Engine
Figure 3 for TorchSparse: Efficient Point Cloud Inference Engine
Figure 4 for TorchSparse: Efficient Point Cloud Inference Engine
Viaarxiv icon

QuantumNAS: Noise-Adaptive Search for Robust Quantum Circuits

Add code
Aug 02, 2021
Figure 1 for QuantumNAS: Noise-Adaptive Search for Robust Quantum Circuits
Figure 2 for QuantumNAS: Noise-Adaptive Search for Robust Quantum Circuits
Figure 3 for QuantumNAS: Noise-Adaptive Search for Robust Quantum Circuits
Figure 4 for QuantumNAS: Noise-Adaptive Search for Robust Quantum Circuits
Viaarxiv icon

NAAS: Neural Accelerator Architecture Search

Add code
May 27, 2021
Figure 1 for NAAS: Neural Accelerator Architecture Search
Figure 2 for NAAS: Neural Accelerator Architecture Search
Figure 3 for NAAS: Neural Accelerator Architecture Search
Figure 4 for NAAS: Neural Accelerator Architecture Search
Viaarxiv icon

Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution

Add code
Aug 13, 2020
Figure 1 for Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
Figure 2 for Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
Figure 3 for Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
Figure 4 for Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
Viaarxiv icon