Picture for Longbiao Wang

Longbiao Wang

Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models

Add code
Jan 24, 2025
Viaarxiv icon

Reducing the Gap Between Pretrained Speech Enhancement and Recognition Models Using a Real Speech-Trained Bridging Module

Add code
Jan 05, 2025
Viaarxiv icon

Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding

Add code
Dec 24, 2024
Figure 1 for Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding
Figure 2 for Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding
Figure 3 for Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding
Viaarxiv icon

Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement

Add code
Dec 21, 2024
Viaarxiv icon

Enriching Multimodal Sentiment Analysis through Textual Emotional Descriptions of Visual-Audio Content

Add code
Dec 12, 2024
Viaarxiv icon

Progressive Residual Extraction based Pre-training for Speech Representation Learning

Add code
Aug 31, 2024
Viaarxiv icon

VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing

Add code
Aug 11, 2024
Viaarxiv icon

An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios

Add code
Jun 13, 2024
Viaarxiv icon

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

Add code
Jan 07, 2024
Figure 1 for ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
Figure 2 for ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
Viaarxiv icon

A Refining Underlying Information Framework for Monaural Speech Enhancement

Add code
Dec 24, 2023
Viaarxiv icon