Picture for Shansong Liu

Shansong Liu

Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer

Add code
Oct 07, 2024
Viaarxiv icon

M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models

Add code
Nov 28, 2023
Viaarxiv icon

HumTrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond

Add code
Sep 18, 2023
Viaarxiv icon

Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning

Add code
Aug 22, 2023
Viaarxiv icon

A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion

Add code
Jul 06, 2022
Figure 1 for A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Figure 2 for A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Figure 3 for A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Figure 4 for A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Viaarxiv icon

Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition

Add code
Mar 19, 2022
Figure 1 for Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition
Figure 2 for Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition
Figure 3 for Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition
Figure 4 for Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition
Viaarxiv icon

Recent Progress in the CUHK Dysarthric Speech Recognition System

Add code
Jan 15, 2022
Figure 1 for Recent Progress in the CUHK Dysarthric Speech Recognition System
Figure 2 for Recent Progress in the CUHK Dysarthric Speech Recognition System
Figure 3 for Recent Progress in the CUHK Dysarthric Speech Recognition System
Figure 4 for Recent Progress in the CUHK Dysarthric Speech Recognition System
Viaarxiv icon

Investigation of Data Augmentation Techniques for Disordered Speech Recognition

Add code
Jan 14, 2022
Figure 1 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Figure 2 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Figure 3 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Figure 4 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Viaarxiv icon

Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition

Add code
Jan 14, 2022
Figure 1 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 2 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 3 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 4 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Viaarxiv icon

Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks

Add code
Jan 12, 2022
Figure 1 for Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks
Figure 2 for Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks
Figure 3 for Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks
Figure 4 for Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks
Viaarxiv icon