Picture for Shinya Wada

Shinya Wada

xMTrans: Temporal Attentive Cross-Modality Fusion Transformer for Long-Term Traffic Prediction

Add code
May 08, 2024
Viaarxiv icon

VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning

Add code
Sep 27, 2023
Viaarxiv icon

VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering

Add code
May 23, 2022
Figure 1 for VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering
Figure 2 for VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering
Figure 3 for VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering
Figure 4 for VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering
Viaarxiv icon