Picture for Chang Zeng

Chang Zeng

SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios

Add code
Oct 02, 2024
Figure 1 for SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Figure 2 for SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Figure 3 for SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Figure 4 for SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Viaarxiv icon

Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches

Add code
Sep 10, 2024
Viaarxiv icon

InstructSing: High-Fidelity Singing Voice Generation via Instructing Yourself

Add code
Sep 10, 2024
Viaarxiv icon

A Benchmark for Multi-speaker Anonymization

Add code
Jul 08, 2024
Viaarxiv icon

HAM-TTS: Hierarchical Acoustic Modeling for Token-Based Zero-Shot Text-to-Speech with Model and Data Scaling

Add code
Mar 09, 2024
Viaarxiv icon

CrossSinger: A Cross-Lingual Multi-Singer High-Fidelity Singing Voice Synthesizer Trained on Monolingual Singers

Add code
Sep 22, 2023
Viaarxiv icon

Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms

Add code
May 18, 2023
Viaarxiv icon

Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognit

Add code
Mar 23, 2023
Viaarxiv icon

Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification

Add code
Feb 22, 2023
Figure 1 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Figure 2 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Figure 3 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Figure 4 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Viaarxiv icon

Xiaoicesing 2: A High-Fidelity Singing Voice Synthesizer Based on Generative Adversarial Network

Add code
Oct 28, 2022
Viaarxiv icon