Picture for Chenpeng Du

Chenpeng Du

LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec

Add code
Oct 21, 2024
Viaarxiv icon

vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders

Add code
Sep 03, 2024
Figure 1 for vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders
Figure 2 for vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders
Figure 3 for vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders
Figure 4 for vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders
Viaarxiv icon

Language Model Can Listen While Speaking

Add code
Aug 05, 2024
Viaarxiv icon

AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding

Add code
May 06, 2024
Viaarxiv icon

Attention-Constrained Inference for Robust Decoder-Only Text-to-Speech

Add code
Apr 30, 2024
Viaarxiv icon

GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting

Add code
Apr 29, 2024
Viaarxiv icon

The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge

Add code
Apr 10, 2024
Viaarxiv icon

VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech

Add code
Jan 30, 2024
Viaarxiv icon

DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder

Add code
Nov 03, 2023
Viaarxiv icon

Acoustic BPE for Speech Generation with Discrete Tokens

Add code
Oct 23, 2023
Viaarxiv icon