Picture for Yufei Zha

Yufei Zha

DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction

Add code
Mar 02, 2024
Figure 1 for DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
Figure 2 for DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
Figure 3 for DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
Figure 4 for DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
Viaarxiv icon

UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection

Add code
Sep 15, 2023
Viaarxiv icon

Induction Network: Audio-Visual Modality Gap-Bridging for Self-Supervised Sound Source Localization

Add code
Aug 09, 2023
Viaarxiv icon

FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction

Add code
Jul 08, 2023
Viaarxiv icon

CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective

Add code
Mar 11, 2023
Figure 1 for CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective
Figure 2 for CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective
Figure 3 for CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective
Figure 4 for CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective
Viaarxiv icon

An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection

Add code
Mar 10, 2022
Figure 1 for An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection
Figure 2 for An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection
Figure 3 for An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection
Figure 4 for An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection
Viaarxiv icon

Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild

Add code
Mar 08, 2022
Figure 1 for Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild
Figure 2 for Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild
Figure 3 for Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild
Figure 4 for Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild
Viaarxiv icon

Audio-visual speech separation based on joint feature representation with cross-modal attention

Add code
Mar 05, 2022
Figure 1 for Audio-visual speech separation based on joint feature representation with cross-modal attention
Figure 2 for Audio-visual speech separation based on joint feature representation with cross-modal attention
Figure 3 for Audio-visual speech separation based on joint feature representation with cross-modal attention
Figure 4 for Audio-visual speech separation based on joint feature representation with cross-modal attention
Viaarxiv icon

Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement

Add code
Mar 04, 2022
Figure 1 for Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
Figure 2 for Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
Figure 3 for Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
Figure 4 for Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
Viaarxiv icon

Unsupervised Cross-Modal Distillation for Thermal Infrared Tracking

Add code
Jul 31, 2021
Figure 1 for Unsupervised Cross-Modal Distillation for Thermal Infrared Tracking
Figure 2 for Unsupervised Cross-Modal Distillation for Thermal Infrared Tracking
Figure 3 for Unsupervised Cross-Modal Distillation for Thermal Infrared Tracking
Figure 4 for Unsupervised Cross-Modal Distillation for Thermal Infrared Tracking
Viaarxiv icon