Picture for Tianrui Wang

Tianrui Wang

Reducing the Gap Between Pretrained Speech Enhancement and Recognition Models Using a Real Speech-Trained Bridging Module

Add code
Jan 05, 2025
Viaarxiv icon

Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding

Add code
Dec 24, 2024
Viaarxiv icon

Time-Graph Frequency Representation with Singular Value Decomposition for Neural Speech Enhancement

Add code
Dec 24, 2024
Viaarxiv icon

Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement

Add code
Dec 21, 2024
Viaarxiv icon

EmoPro: A Prompt Selection Strategy for Emotional Expression in LM-based Speech Synthesis

Add code
Sep 27, 2024
Viaarxiv icon

Progressive Residual Extraction based Pre-training for Speech Representation Learning

Add code
Aug 31, 2024
Viaarxiv icon

VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing

Add code
Aug 11, 2024
Viaarxiv icon

A Refining Underlying Information Framework for Monaural Speech Enhancement

Add code
Dec 24, 2023
Viaarxiv icon

On decoder-only architecture for speech-to-text and large language model integration

Add code
Jul 14, 2023
Viaarxiv icon

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation

Add code
May 25, 2023
Viaarxiv icon