Picture for Xiang-Dong Zhou

Xiang-Dong Zhou

Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer

Add code
Feb 04, 2023
Viaarxiv icon

Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering

Add code
May 09, 2022
Figure 1 for Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering
Figure 2 for Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering
Figure 3 for Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering
Figure 4 for Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering
Viaarxiv icon

Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering

Add code
Sep 10, 2021
Figure 1 for Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Figure 2 for Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Figure 3 for Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Figure 4 for Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Viaarxiv icon

STA-VPR: Spatio-temporal Alignment for Visual Place Recognition

Add code
Apr 09, 2021
Figure 1 for STA-VPR: Spatio-temporal Alignment for Visual Place Recognition
Figure 2 for STA-VPR: Spatio-temporal Alignment for Visual Place Recognition
Figure 3 for STA-VPR: Spatio-temporal Alignment for Visual Place Recognition
Figure 4 for STA-VPR: Spatio-temporal Alignment for Visual Place Recognition
Viaarxiv icon

Recognizing Micro-expression in Video Clip with Adaptive Key-frame Mining

Add code
Sep 19, 2020
Figure 1 for Recognizing Micro-expression in Video Clip with Adaptive Key-frame Mining
Figure 2 for Recognizing Micro-expression in Video Clip with Adaptive Key-frame Mining
Figure 3 for Recognizing Micro-expression in Video Clip with Adaptive Key-frame Mining
Figure 4 for Recognizing Micro-expression in Video Clip with Adaptive Key-frame Mining
Viaarxiv icon