Picture for Zhizheng Wu

Zhizheng Wu

NV-Bench: Benchmark of Nonverbal Vocalization Synthesis for Expressive Text-to-Speech Generation

Add code
Mar 16, 2026
Viaarxiv icon

WhispEar: A Bi-directional Framework for Scaling Whispered Speech Conversion via Pseudo-Parallel Whisper Generation

Add code
Mar 09, 2026
Viaarxiv icon

Anatomy of the Modality Gap: Dissecting the Internal States of End-to-End Speech LLMs

Add code
Mar 02, 2026
Viaarxiv icon

VoxPrivacy: A Benchmark for Evaluating Interactional Privacy of Speech Language Models

Add code
Jan 27, 2026
Viaarxiv icon

Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration

Add code
Jan 06, 2026
Viaarxiv icon

Aliasing-Free Neural Audio Synthesis

Add code
Dec 23, 2025
Figure 1 for Aliasing-Free Neural Audio Synthesis
Figure 2 for Aliasing-Free Neural Audio Synthesis
Figure 3 for Aliasing-Free Neural Audio Synthesis
Figure 4 for Aliasing-Free Neural Audio Synthesis
Viaarxiv icon

SpeechJudge: Towards Human-Level Judgment for Speech Naturalness

Add code
Nov 11, 2025
Figure 1 for SpeechJudge: Towards Human-Level Judgment for Speech Naturalness
Figure 2 for SpeechJudge: Towards Human-Level Judgment for Speech Naturalness
Figure 3 for SpeechJudge: Towards Human-Level Judgment for Speech Naturalness
Figure 4 for SpeechJudge: Towards Human-Level Judgment for Speech Naturalness
Viaarxiv icon

SP-MCQA: Evaluating Intelligibility of TTS Beyond the Word Level

Add code
Oct 30, 2025
Viaarxiv icon

The Singing Voice Conversion Challenge 2025: From Singer Identity Conversion To Singing Style Conversion

Add code
Sep 19, 2025
Viaarxiv icon

AnyAccomp: Generalizable Accompaniment Generation via Quantized Melodic Bottleneck

Add code
Sep 17, 2025
Figure 1 for AnyAccomp: Generalizable Accompaniment Generation via Quantized Melodic Bottleneck
Figure 2 for AnyAccomp: Generalizable Accompaniment Generation via Quantized Melodic Bottleneck
Figure 3 for AnyAccomp: Generalizable Accompaniment Generation via Quantized Melodic Bottleneck
Figure 4 for AnyAccomp: Generalizable Accompaniment Generation via Quantized Melodic Bottleneck
Viaarxiv icon