Picture for Jing-Xuan Zhang

Jing-Xuan Zhang

Self-Supervised Audio-Visual Speech Representations Learning By Multimodal Self-Distillation

Add code
Dec 06, 2022
Viaarxiv icon

Is Lip Region-of-Interest Sufficient for Lipreading?

Add code
Jun 02, 2022
Figure 1 for Is Lip Region-of-Interest Sufficient for Lipreading?
Figure 2 for Is Lip Region-of-Interest Sufficient for Lipreading?
Figure 3 for Is Lip Region-of-Interest Sufficient for Lipreading?
Figure 4 for Is Lip Region-of-Interest Sufficient for Lipreading?
Viaarxiv icon

TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos

Add code
Nov 19, 2020
Figure 1 for TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos
Figure 2 for TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos
Figure 3 for TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos
Figure 4 for TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos
Viaarxiv icon

Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis

Add code
Jul 18, 2018
Figure 1 for Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
Figure 2 for Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
Figure 3 for Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
Figure 4 for Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
Viaarxiv icon