Picture for Moyuru Yamada

Moyuru Yamada

Semantic Graph Consistency: Going Beyond Patches for Regularizing Self-Supervised Vision Transformers

Add code
Jun 18, 2024
Viaarxiv icon

GLoD: Composing Global Contexts and Local Details in Image Generation

Add code
Apr 23, 2024
Viaarxiv icon

D3: Data Diversity Design for Systematic Generalization in Visual Question Answering

Add code
Sep 15, 2023
Viaarxiv icon

HICO-DET-SG and V-COCO-SG: New Data Splits to Evaluate Systematic Generalization in Human-Object Interaction Detection

Add code
May 17, 2023
Viaarxiv icon

Detect Only What You Specify : Object Detection with Linguistic Target

Add code
Nov 18, 2022
Figure 1 for Detect Only What You Specify : Object Detection with Linguistic Target
Figure 2 for Detect Only What You Specify : Object Detection with Linguistic Target
Figure 3 for Detect Only What You Specify : Object Detection with Linguistic Target
Figure 4 for Detect Only What You Specify : Object Detection with Linguistic Target
Viaarxiv icon

Transformer Module Networks for Systematic Generalization in Visual Question Answering

Add code
Jan 27, 2022
Figure 1 for Transformer Module Networks for Systematic Generalization in Visual Question Answering
Figure 2 for Transformer Module Networks for Systematic Generalization in Visual Question Answering
Figure 3 for Transformer Module Networks for Systematic Generalization in Visual Question Answering
Figure 4 for Transformer Module Networks for Systematic Generalization in Visual Question Answering
Viaarxiv icon