Picture for Xianrui Zheng

Xianrui Zheng

SOT Triggered Neural Clustering for Speaker Attributed ASR

Add code
Jul 02, 2024
Viaarxiv icon

Conditional Diffusion Model for Target Speaker Extraction

Add code
Oct 07, 2023
Figure 1 for Conditional Diffusion Model for Target Speaker Extraction
Figure 2 for Conditional Diffusion Model for Target Speaker Extraction
Figure 3 for Conditional Diffusion Model for Target Speaker Extraction
Figure 4 for Conditional Diffusion Model for Target Speaker Extraction
Viaarxiv icon

Can Contextual Biasing Remain Effective with Whisper and GPT-2?

Add code
Jun 02, 2023
Viaarxiv icon

Self-Supervised Learning-Based Source Separation for Meeting Data

Add code
Apr 03, 2023
Viaarxiv icon

Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription

Add code
Jul 08, 2022
Figure 1 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 2 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 3 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 4 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Viaarxiv icon

Multi-turn RNN-T for streaming recognition of multi-party speech

Add code
Dec 19, 2021
Figure 1 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 2 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 3 for Multi-turn RNN-T for streaming recognition of multi-party speech
Figure 4 for Multi-turn RNN-T for streaming recognition of multi-party speech
Viaarxiv icon

Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition

Add code
Jul 29, 2021
Figure 1 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Figure 2 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Figure 3 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Figure 4 for Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Viaarxiv icon