Picture for Yang Ai

Yang Ai

A Neural Denoising Vocoder for Clean Waveform Generation from Noisy Mel-Spectrogram based on Amplitude and Phase Predictions

Add code
Nov 19, 2024
Viaarxiv icon

SAMOS: A Neural MOS Prediction Model Leveraging Semantic Representations and Acoustic Features

Add code
Nov 18, 2024
Viaarxiv icon

ESTVocoder: An Excitation-Spectral-Transformed Neural Vocoder Conditioned on Mel Spectrogram

Add code
Nov 18, 2024
Viaarxiv icon

Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion

Add code
Nov 17, 2024
Viaarxiv icon

MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios

Add code
Nov 01, 2024
Viaarxiv icon

APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm

Add code
Oct 30, 2024
Viaarxiv icon

ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs

Add code
Oct 16, 2024
Viaarxiv icon

Stage-Wise and Prior-Aware Neural Speech Phase Prediction

Add code
Oct 07, 2024
Viaarxiv icon

Refining Self-Supervised Learnt Speech Representation using Brain Activations

Add code
Jun 12, 2024
Viaarxiv icon

Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control

Add code
Jun 04, 2024
Viaarxiv icon