Picture for Hammad A. Ayyubi

Hammad A. Ayyubi

Video Summarization: Towards Entity-Aware Captions

Add code
Dec 01, 2023
Viaarxiv icon

Learning from Children: Improving Image-Caption Pretraining via Curriculum

Add code
May 30, 2023
Viaarxiv icon

IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models

Add code
May 24, 2023
Viaarxiv icon

Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World

Add code
Jun 14, 2022
Figure 1 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Figure 2 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Figure 3 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Figure 4 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Viaarxiv icon

Generating Rationales in Visual Question Answering

Add code
Apr 04, 2020
Figure 1 for Generating Rationales in Visual Question Answering
Figure 2 for Generating Rationales in Visual Question Answering
Figure 3 for Generating Rationales in Visual Question Answering
Figure 4 for Generating Rationales in Visual Question Answering
Viaarxiv icon

Progressive Growing of Neural ODEs

Add code
Mar 08, 2020
Figure 1 for Progressive Growing of Neural ODEs
Figure 2 for Progressive Growing of Neural ODEs
Figure 3 for Progressive Growing of Neural ODEs
Figure 4 for Progressive Growing of Neural ODEs
Viaarxiv icon

GANspection

Add code
Oct 21, 2019
Viaarxiv icon

Enforcing Reasoning in Visual Commonsense Reasoning

Add code
Oct 21, 2019
Figure 1 for Enforcing Reasoning in Visual Commonsense Reasoning
Figure 2 for Enforcing Reasoning in Visual Commonsense Reasoning
Figure 3 for Enforcing Reasoning in Visual Commonsense Reasoning
Figure 4 for Enforcing Reasoning in Visual Commonsense Reasoning
Viaarxiv icon