Picture for Qiuqiang Kong

Qiuqiang Kong

SemanticAudio: Audio Generation and Editing in Semantic Space

Add code
Jan 29, 2026
Viaarxiv icon

ImmersiveFlow: Stereo-to-7.1.4 spatial audio generation with flow matching

Add code
Jan 19, 2026
Viaarxiv icon

Summary of The Inaugural Music Source Restoration Challenge

Add code
Jan 07, 2026
Viaarxiv icon

MelCap: A Unified Single-Codebook Neural Codec for High-Fidelity Audio Compression

Add code
Oct 02, 2025
Viaarxiv icon

PodEval: A Multimodal Evaluation Framework for Podcast Audio Generation

Add code
Oct 01, 2025
Viaarxiv icon

Region-Specific Audio Tagging for Spatial Sound

Add code
Sep 11, 2025
Viaarxiv icon

CLEAR: Continuous Latent Autoregressive Modeling for High-quality and Low-latency Speech Synthesis

Add code
Aug 26, 2025
Viaarxiv icon

Music Source Restoration

Add code
May 27, 2025
Viaarxiv icon

Training-Free Multi-Step Audio Source Separation

Add code
May 26, 2025
Figure 1 for Training-Free Multi-Step Audio Source Separation
Figure 2 for Training-Free Multi-Step Audio Source Separation
Figure 3 for Training-Free Multi-Step Audio Source Separation
Figure 4 for Training-Free Multi-Step Audio Source Separation
Viaarxiv icon

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Add code
Feb 06, 2025
Viaarxiv icon