Picture for Gongping Huang

Gongping Huang

CoCoEmo: Composable and Controllable Human-Like Emotional TTS via Activation Steering

Add code
Feb 03, 2026
Viaarxiv icon

A Unified Neural Codec Language Model for Selective Editable Text to Speech Generation

Add code
Jan 18, 2026
Viaarxiv icon

Robust Online Overdetermined Independent Vector Analysis Based on Bilinear Decomposition

Add code
Jan 18, 2026
Viaarxiv icon

TTMBA: Towards Text To Multiple Sources Binaural Audio Generation

Add code
Jul 22, 2025
Figure 1 for TTMBA: Towards Text To Multiple Sources Binaural Audio Generation
Figure 2 for TTMBA: Towards Text To Multiple Sources Binaural Audio Generation
Figure 3 for TTMBA: Towards Text To Multiple Sources Binaural Audio Generation
Figure 4 for TTMBA: Towards Text To Multiple Sources Binaural Audio Generation
Viaarxiv icon

Spatial-Filter-Bank-Based Neural Method for Multichannel Speech Enhancement

Add code
Apr 02, 2025
Viaarxiv icon

LMFCA-Net: A Lightweight Model for Multi-Channel Speech Enhancement with Efficient Narrow-Band and Cross-Band Attention

Add code
Feb 17, 2025
Viaarxiv icon

Advances in Microphone Array Processing and Multichannel Speech Enhancement

Add code
Feb 13, 2025
Viaarxiv icon

Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities

Add code
Nov 29, 2024
Figure 1 for Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities
Figure 2 for Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities
Figure 3 for Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities
Figure 4 for Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities
Viaarxiv icon