Picture for Zhiyun Fan

Zhiyun Fan

SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR

Add code
Mar 04, 2024
Viaarxiv icon

Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire

Add code
Nov 17, 2022
Viaarxiv icon

Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire

Add code
Jun 27, 2022
Figure 1 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Figure 2 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Figure 3 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Figure 4 for Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Viaarxiv icon

Exploring wav2vec 2.0 on speaker verification and language identification

Add code
Jan 14, 2021
Figure 1 for Exploring wav2vec 2.0 on speaker verification and language identification
Figure 2 for Exploring wav2vec 2.0 on speaker verification and language identification
Figure 3 for Exploring wav2vec 2.0 on speaker verification and language identification
Figure 4 for Exploring wav2vec 2.0 on speaker verification and language identification
Viaarxiv icon

Speaker-aware speech-transformer

Add code
Jan 02, 2020
Figure 1 for Speaker-aware speech-transformer
Figure 2 for Speaker-aware speech-transformer
Figure 3 for Speaker-aware speech-transformer
Figure 4 for Speaker-aware speech-transformer
Viaarxiv icon

Unsupervised pre-traing for sequence to sequence speech recognition

Add code
Oct 28, 2019
Figure 1 for Unsupervised pre-traing for sequence to sequence speech recognition
Figure 2 for Unsupervised pre-traing for sequence to sequence speech recognition
Figure 3 for Unsupervised pre-traing for sequence to sequence speech recognition
Figure 4 for Unsupervised pre-traing for sequence to sequence speech recognition
Viaarxiv icon