Picture for Zehai Tu

Zehai Tu

Enhancing Expressive Voice Conversion with Discrete Pitch-Conditioned Flow Matching Model

Add code
Feb 08, 2025
Viaarxiv icon

CueTip: An Interactive and Explainable Physics-aware Pool Assistant

Add code
Jan 30, 2025
Viaarxiv icon

Enabling Beam Search for Language Model-Based Text-to-Speech Synthesis

Add code
Aug 29, 2024
Viaarxiv icon

Intelligibility prediction with a pretrained noise-robust automatic speech recognition model

Add code
Oct 20, 2023
Viaarxiv icon

Energy-Based Models For Speech Synthesis

Add code
Oct 19, 2023
Viaarxiv icon

Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction

Add code
Apr 08, 2022
Figure 1 for Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction
Figure 2 for Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction
Figure 3 for Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction
Figure 4 for Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction
Viaarxiv icon

Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners

Add code
Apr 08, 2022
Figure 1 for Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners
Figure 2 for Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners
Figure 3 for Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners
Figure 4 for Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners
Viaarxiv icon

Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition

Add code
Apr 08, 2022
Figure 1 for Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition
Figure 2 for Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition
Figure 3 for Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition
Figure 4 for Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition
Viaarxiv icon

Optimising Hearing Aid Fittings for Speech in Noise with a Differentiable Hearing Loss Model

Add code
Jun 08, 2021
Figure 1 for Optimising Hearing Aid Fittings for Speech in Noise with a Differentiable Hearing Loss Model
Figure 2 for Optimising Hearing Aid Fittings for Speech in Noise with a Differentiable Hearing Loss Model
Figure 3 for Optimising Hearing Aid Fittings for Speech in Noise with a Differentiable Hearing Loss Model
Figure 4 for Optimising Hearing Aid Fittings for Speech in Noise with a Differentiable Hearing Loss Model
Viaarxiv icon

DHASP: Differentiable Hearing Aid Speech Processing

Add code
Mar 15, 2021
Figure 1 for DHASP: Differentiable Hearing Aid Speech Processing
Figure 2 for DHASP: Differentiable Hearing Aid Speech Processing
Figure 3 for DHASP: Differentiable Hearing Aid Speech Processing
Figure 4 for DHASP: Differentiable Hearing Aid Speech Processing
Viaarxiv icon