Picture for Lantian Li

Lantian Li

Neural Scoring, Not Embedding: A Novel Framework for Robust Speaker Verification

Add code
Oct 21, 2024
Viaarxiv icon

AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition

Add code
Oct 21, 2024
Figure 1 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 2 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 3 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 4 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Viaarxiv icon

Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective

Add code
Sep 29, 2024
Figure 1 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 2 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 3 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 4 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Viaarxiv icon

Serialized Output Training by Learned Dominance

Add code
Jul 04, 2024
Viaarxiv icon

CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge

Add code
Jun 14, 2024
Viaarxiv icon

SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition

Add code
Jun 12, 2024
Figure 1 for SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition
Figure 2 for SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition
Figure 3 for SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition
Figure 4 for SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition
Viaarxiv icon

Zero-Shot Fake Video Detection by Audio-Visual Consistency

Add code
Jun 12, 2024
Figure 1 for Zero-Shot Fake Video Detection by Audio-Visual Consistency
Figure 2 for Zero-Shot Fake Video Detection by Audio-Visual Consistency
Figure 3 for Zero-Shot Fake Video Detection by Audio-Visual Consistency
Figure 4 for Zero-Shot Fake Video Detection by Audio-Visual Consistency
Viaarxiv icon

A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition

Add code
Jun 11, 2024
Figure 1 for A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition
Figure 2 for A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition
Figure 3 for A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition
Figure 4 for A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition
Viaarxiv icon

How phonemes contribute to deep speaker models?

Add code
Feb 05, 2024
Viaarxiv icon

Adversarial Data Augmentation for Robust Speaker Verification

Add code
Feb 05, 2024
Viaarxiv icon