Picture for Song Han

Song Han

University of Connecticut

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Add code
Feb 03, 2026
Viaarxiv icon

Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery

Add code
Jan 27, 2026
Viaarxiv icon

Scaling Test-time Inference for Visual Grounding

Add code
Jan 20, 2026
Viaarxiv icon

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

Add code
Jan 20, 2026
Viaarxiv icon

Pretraining Frame Preservation in Autoregressive Video Memory Compression

Add code
Dec 29, 2025
Viaarxiv icon

Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

Add code
Dec 16, 2025
Figure 1 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Figure 2 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Figure 3 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Figure 4 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Viaarxiv icon

BLASST: Dynamic BLocked Attention Sparsity via Softmax Thresholding

Add code
Dec 12, 2025
Viaarxiv icon

FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos

Add code
Dec 11, 2025
Figure 1 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 2 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 3 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 4 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Viaarxiv icon

Optimizing Mixture of Block Attention

Add code
Nov 14, 2025
Viaarxiv icon

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Add code
Nov 13, 2025
Viaarxiv icon