Picture for Thomas Fang Zheng

Thomas Fang Zheng

Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models

Add code
Aug 28, 2024
Viaarxiv icon

A Joint Noise Disentanglement and Adversarial Training Framework for Robust Speaker Verification

Add code
Aug 22, 2024
Figure 1 for A Joint Noise Disentanglement and Adversarial Training Framework for Robust Speaker Verification
Figure 2 for A Joint Noise Disentanglement and Adversarial Training Framework for Robust Speaker Verification
Figure 3 for A Joint Noise Disentanglement and Adversarial Training Framework for Robust Speaker Verification
Figure 4 for A Joint Noise Disentanglement and Adversarial Training Framework for Robust Speaker Verification
Viaarxiv icon

Speaker Adaptation for Quantised End-to-End ASR Models

Add code
Aug 07, 2024
Figure 1 for Speaker Adaptation for Quantised End-to-End ASR Models
Figure 2 for Speaker Adaptation for Quantised End-to-End ASR Models
Viaarxiv icon

SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR

Add code
Jun 28, 2024
Figure 1 for SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR
Figure 2 for SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR
Figure 3 for SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR
Figure 4 for SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR
Viaarxiv icon

Enhancing Quantised End-to-End ASR Models via Personalisation

Add code
Sep 17, 2023
Viaarxiv icon

How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition

Add code
Nov 24, 2021
Figure 1 for How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Figure 2 for How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Figure 3 for How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Figure 4 for How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Viaarxiv icon

A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing

Add code
Oct 11, 2021
Figure 1 for A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing
Figure 2 for A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing
Figure 3 for A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing
Figure 4 for A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing
Viaarxiv icon

Attack on practical speaker verification system using universal adversarial perturbations

Add code
May 19, 2021
Figure 1 for Attack on practical speaker verification system using universal adversarial perturbations
Figure 2 for Attack on practical speaker verification system using universal adversarial perturbations
Figure 3 for Attack on practical speaker verification system using universal adversarial perturbations
Figure 4 for Attack on practical speaker verification system using universal adversarial perturbations
Viaarxiv icon

CN-Celeb: multi-genre speaker recognition

Add code
Dec 23, 2020
Figure 1 for CN-Celeb: multi-genre speaker recognition
Figure 2 for CN-Celeb: multi-genre speaker recognition
Figure 3 for CN-Celeb: multi-genre speaker recognition
Figure 4 for CN-Celeb: multi-genre speaker recognition
Viaarxiv icon

Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification

Add code
Oct 27, 2020
Figure 1 for Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification
Figure 2 for Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification
Figure 3 for Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification
Viaarxiv icon