Picture for Meng Ge

Meng Ge

Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models

Add code
Jan 24, 2025
Viaarxiv icon

Reducing the Gap Between Pretrained Speech Enhancement and Recognition Models Using a Real Speech-Trained Bridging Module

Add code
Jan 05, 2025
Viaarxiv icon

Time-Graph Frequency Representation with Singular Value Decomposition for Neural Speech Enhancement

Add code
Dec 24, 2024
Viaarxiv icon

Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement

Add code
Dec 21, 2024
Viaarxiv icon

WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction

Add code
Sep 24, 2024
Figure 1 for WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction
Figure 2 for WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction
Figure 3 for WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction
Figure 4 for WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction
Viaarxiv icon

Progressive Residual Extraction based Pre-training for Speech Representation Learning

Add code
Aug 31, 2024
Viaarxiv icon

SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech

Add code
Jul 03, 2024
Figure 1 for SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech
Figure 2 for SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech
Figure 3 for SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech
Viaarxiv icon

sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks

Add code
Mar 09, 2024
Viaarxiv icon

An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement

Add code
Jan 18, 2024
Viaarxiv icon

Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio

Add code
Jan 05, 2024
Viaarxiv icon