Picture for Zhen-Hua Ling

Zhen-Hua Ling

Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis

Add code
Dec 22, 2024
Viaarxiv icon

On the Generation and Removal of Speaker Adversarial Perturbation for Voice-Privacy Protection

Add code
Dec 12, 2024
Viaarxiv icon

Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

Add code
Dec 09, 2024
Viaarxiv icon

A Neural Denoising Vocoder for Clean Waveform Generation from Noisy Mel-Spectrogram based on Amplitude and Phase Predictions

Add code
Nov 19, 2024
Viaarxiv icon

ESTVocoder: An Excitation-Spectral-Transformed Neural Vocoder Conditioned on Mel Spectrogram

Add code
Nov 18, 2024
Viaarxiv icon

SAMOS: A Neural MOS Prediction Model Leveraging Semantic Representations and Acoustic Features

Add code
Nov 18, 2024
Viaarxiv icon

Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion

Add code
Nov 17, 2024
Viaarxiv icon

MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios

Add code
Nov 01, 2024
Viaarxiv icon

APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm

Add code
Oct 30, 2024
Viaarxiv icon

ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs

Add code
Oct 16, 2024
Viaarxiv icon