Picture for Qiuqiang Kong

Qiuqiang Kong

DRCap: Decoding CLAP Latents with Retrieval-augmented Generation for Zero-shot Audio Captioning

Add code
Oct 12, 2024
Viaarxiv icon

Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement

Add code
Sep 15, 2024
Viaarxiv icon

Language-Queried Target Sound Extraction Without Parallel Training Data

Add code
Sep 14, 2024
Figure 1 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 2 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 3 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 4 for Language-Queried Target Sound Extraction Without Parallel Training Data
Viaarxiv icon

SymPAC: Scalable Symbolic Music Generation With Prompts And Constraints

Add code
Sep 04, 2024
Figure 1 for SymPAC: Scalable Symbolic Music Generation With Prompts And Constraints
Figure 2 for SymPAC: Scalable Symbolic Music Generation With Prompts And Constraints
Figure 3 for SymPAC: Scalable Symbolic Music Generation With Prompts And Constraints
Figure 4 for SymPAC: Scalable Symbolic Music Generation With Prompts And Constraints
Viaarxiv icon

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Add code
Aug 30, 2024
Figure 1 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 2 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 3 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 4 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Viaarxiv icon

Foundation Models for Music: A Survey

Add code
Aug 27, 2024
Figure 1 for Foundation Models for Music: A Survey
Figure 2 for Foundation Models for Music: A Survey
Figure 3 for Foundation Models for Music: A Survey
Figure 4 for Foundation Models for Music: A Survey
Viaarxiv icon

Universal Sound Separation with Self-Supervised Audio Masked Autoencoder

Add code
Jul 16, 2024
Viaarxiv icon

MusicScore: A Dataset for Music Score Modeling and Generation

Add code
Jun 17, 2024
Viaarxiv icon

Towards Out-of-Distribution Detection in Vocoder Recognition via Latent Feature Reconstruction

Add code
Jun 04, 2024
Viaarxiv icon

WavCraft: Audio Editing and Generation with Natural Language Prompts

Add code
Mar 15, 2024
Viaarxiv icon