Picture for Zhen-Hua Ling

Zhen-Hua Ling

A Neural Denoising Vocoder for Clean Waveform Generation from Noisy Mel-Spectrogram based on Amplitude and Phase Predictions

Add code
Nov 19, 2024
Viaarxiv icon

ESTVocoder: An Excitation-Spectral-Transformed Neural Vocoder Conditioned on Mel Spectrogram

Add code
Nov 18, 2024
Viaarxiv icon

SAMOS: A Neural MOS Prediction Model Leveraging Semantic Representations and Acoustic Features

Add code
Nov 18, 2024
Viaarxiv icon

MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios

Add code
Nov 01, 2024
Viaarxiv icon

APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm

Add code
Oct 30, 2024
Viaarxiv icon

ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs

Add code
Oct 16, 2024
Viaarxiv icon

Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation

Add code
Oct 08, 2024
Figure 1 for Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
Figure 2 for Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
Figure 3 for Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
Figure 4 for Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
Viaarxiv icon

Stage-Wise and Prior-Aware Neural Speech Phase Prediction

Add code
Oct 07, 2024
Viaarxiv icon

Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding

Add code
Jun 12, 2024
Viaarxiv icon

Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech

Add code
Jun 11, 2024
Viaarxiv icon