Picture for Yuejiao Wang

Yuejiao Wang

Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions

Add code
Sep 13, 2024
Figure 1 for Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions
Figure 2 for Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions
Figure 3 for Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions
Figure 4 for Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions
Viaarxiv icon

Large Language Model-based FMRI Encoding of Language Functions for Subjects with Neurocognitive Disorder

Add code
Jul 15, 2024
Viaarxiv icon

Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System

Add code
Jul 13, 2024
Viaarxiv icon

Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction

Add code
Jan 31, 2024
Viaarxiv icon

UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization

Add code
Jan 26, 2024
Viaarxiv icon

A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One

Add code
Mar 05, 2023
Viaarxiv icon