Picture for Liqun Deng

Liqun Deng

SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech

Add code
Jul 03, 2024
Figure 1 for SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech
Figure 2 for SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech
Figure 3 for SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech
Viaarxiv icon

Music-PAW: Learning Music Representations via Hierarchical Part-whole Interaction and Contrast

Add code
Dec 11, 2023
Viaarxiv icon

Prompt-driven Target Speech Diarization

Add code
Oct 23, 2023
Viaarxiv icon

USED: Universal Speaker Extraction and Diarization

Add code
Sep 19, 2023
Viaarxiv icon

DisCover: Disentangled Music Representation Learning for Cover Song Identification

Add code
Jul 19, 2023
Figure 1 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Figure 2 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Figure 3 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Figure 4 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Viaarxiv icon

CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction

Add code
Apr 12, 2022
Figure 1 for CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction
Figure 2 for CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction
Figure 3 for CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction
Figure 4 for CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction
Viaarxiv icon

Reducing language context confusion for end-to-end code-switching automatic speech recognition

Add code
Jan 28, 2022
Figure 1 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Figure 2 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Figure 3 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Figure 4 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Viaarxiv icon

CCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation detection and diagnosis

Add code
Nov 16, 2021
Figure 1 for CCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation detection and diagnosis
Figure 2 for CCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation detection and diagnosis
Figure 3 for CCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation detection and diagnosis
Figure 4 for CCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation detection and diagnosis
Viaarxiv icon

EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion

Add code
Jul 04, 2021
Figure 1 for EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion
Figure 2 for EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion
Figure 3 for EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion
Figure 4 for EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion
Viaarxiv icon

VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion

Add code
Jun 18, 2021
Figure 1 for VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion
Figure 2 for VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion
Figure 3 for VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion
Figure 4 for VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion
Viaarxiv icon