Picture for Mu Li

Mu Li

Back to Basics: Revisiting ASR in the Age of Voice Agents

Add code
Mar 26, 2026
Viaarxiv icon

Multimodal Priors-Augmented Text-Driven 3D Human-Object Interaction Generation

Add code
Feb 11, 2026
Viaarxiv icon

Dynamic Worlds, Dynamic Humans: Generating Virtual Human-Scene Interaction Motion in Dynamic Scenes

Add code
Jan 27, 2026
Viaarxiv icon

DynaQuant: Dynamic Mixed-Precision Quantization for Learned Image Compression

Add code
Nov 11, 2025
Viaarxiv icon

Dataset Distillation as Data Compression: A Rate-Utility Perspective

Add code
Jul 23, 2025
Viaarxiv icon

EmergentTTS-Eval: Evaluating TTS Models on Complex Prosodic, Expressiveness, and Linguistic Challenges Using Model-as-a-Judge

Add code
May 29, 2025
Viaarxiv icon

Learned Image Compression with Dictionary-based Entropy Model

Add code
Apr 01, 2025
Figure 1 for Learned Image Compression with Dictionary-based Entropy Model
Figure 2 for Learned Image Compression with Dictionary-based Entropy Model
Figure 3 for Learned Image Compression with Dictionary-based Entropy Model
Figure 4 for Learned Image Compression with Dictionary-based Entropy Model
Viaarxiv icon

ShiftLIC: Lightweight Learned Image Compression with Spatial-Channel Shift Operations

Add code
Mar 29, 2025
Viaarxiv icon

Fg-T2M++: LLMs-Augmented Fine-Grained Text Driven Human Motion Generation

Add code
Feb 08, 2025
Viaarxiv icon

Learned Scanpaths Aid Blind Panoramic Video Quality Assessment

Add code
Mar 30, 2024
Viaarxiv icon