Picture for Jian-Shu Zhang

Jian-Shu Zhang

Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition

Add code
Feb 15, 2022
Figure 1 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 2 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 3 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 4 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Viaarxiv icon