Picture for Xiaolou Li

Xiaolou Li

AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition

Add code
Oct 21, 2024
Figure 1 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 2 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 3 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 4 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Viaarxiv icon

Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective

Add code
Sep 29, 2024
Figure 1 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 2 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 3 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 4 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Viaarxiv icon

CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge

Add code
Jun 14, 2024
Viaarxiv icon

Zero-Shot Fake Video Detection by Audio-Visual Consistency

Add code
Jun 12, 2024
Viaarxiv icon

CN-Celeb-AV: A Multi-Genre Audio-Visual Dataset for Person Recognition

Add code
May 25, 2023
Viaarxiv icon