Picture for Junyi Ao

Junyi Ao

SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech

Add code
Jul 03, 2024
Viaarxiv icon

SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

Add code
Jun 19, 2024
Viaarxiv icon

Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks

Add code
Feb 28, 2024
Figure 1 for Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks
Figure 2 for Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks
Figure 3 for Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks
Figure 4 for Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks
Viaarxiv icon

The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge

Add code
Dec 26, 2023
Viaarxiv icon

USED: Universal Speaker Extraction and Diarization

Add code
Sep 19, 2023
Viaarxiv icon

Self-Supervised Acoustic Word Embedding Learning via Correspondence Transformer Encoder

Add code
Jul 19, 2023
Viaarxiv icon

token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text

Add code
Oct 30, 2022
Viaarxiv icon

CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning

Add code
Oct 08, 2022
Figure 1 for CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
Figure 2 for CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
Figure 3 for CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
Figure 4 for CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
Viaarxiv icon

SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training

Add code
Oct 07, 2022
Figure 1 for SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Figure 2 for SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Figure 3 for SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Figure 4 for SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Viaarxiv icon

The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

Add code
Jun 14, 2022
Figure 1 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
Figure 2 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
Figure 3 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
Figure 4 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
Viaarxiv icon