Picture for Hangting Chen

Hangting Chen

MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization

Add code
Jan 03, 2025
Viaarxiv icon

SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor

Add code
Dec 18, 2024
Figure 1 for SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
Figure 2 for SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
Figure 3 for SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
Figure 4 for SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
Viaarxiv icon

Gull: A Generative Multifunctional Audio Codec

Add code
Apr 07, 2024
Viaarxiv icon

Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings

Add code
Jan 29, 2024
Figure 1 for Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings
Figure 2 for Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings
Figure 3 for Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings
Figure 4 for Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings
Viaarxiv icon

Consistent and Relevant: Rethink the Query Embedding in General Sound Separation

Add code
Dec 24, 2023
Figure 1 for Consistent and Relevant: Rethink the Query Embedding in General Sound Separation
Figure 2 for Consistent and Relevant: Rethink the Query Embedding in General Sound Separation
Figure 3 for Consistent and Relevant: Rethink the Query Embedding in General Sound Separation
Figure 4 for Consistent and Relevant: Rethink the Query Embedding in General Sound Separation
Viaarxiv icon

SECap: Speech Emotion Captioning with Large Language Model

Add code
Dec 23, 2023
Viaarxiv icon

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data

Add code
Sep 25, 2023
Viaarxiv icon

Complexity Scaling for Speech Denoising

Add code
Sep 14, 2023
Figure 1 for Complexity Scaling for Speech Denoising
Figure 2 for Complexity Scaling for Speech Denoising
Figure 3 for Complexity Scaling for Speech Denoising
Figure 4 for Complexity Scaling for Speech Denoising
Viaarxiv icon

Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression

Add code
Aug 21, 2023
Viaarxiv icon

Bayes Risk Transducer: Transducer with Controllable Alignment Prediction

Add code
Aug 19, 2023
Viaarxiv icon