Picture for Alireza Zareian

Alireza Zareian

Learning from Children: Improving Image-Caption Pretraining via Curriculum

Add code
May 30, 2023
Viaarxiv icon

GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning

Add code
Jul 20, 2022
Figure 1 for GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning
Figure 2 for GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning
Figure 3 for GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning
Figure 4 for GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning
Viaarxiv icon

SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning

Add code
Dec 16, 2021
Figure 1 for SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning
Figure 2 for SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning
Figure 3 for SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning
Figure 4 for SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning
Viaarxiv icon

Open-Vocabulary Object Detection Using Captions

Add code
Nov 20, 2020
Figure 1 for Open-Vocabulary Object Detection Using Captions
Figure 2 for Open-Vocabulary Object Detection Using Captions
Figure 3 for Open-Vocabulary Object Detection Using Captions
Figure 4 for Open-Vocabulary Object Detection Using Captions
Viaarxiv icon

Weakly-supervised VisualBERT: Pre-training without Parallel Images and Captions

Add code
Oct 24, 2020
Figure 1 for Weakly-supervised VisualBERT: Pre-training without Parallel Images and Captions
Figure 2 for Weakly-supervised VisualBERT: Pre-training without Parallel Images and Captions
Figure 3 for Weakly-supervised VisualBERT: Pre-training without Parallel Images and Captions
Figure 4 for Weakly-supervised VisualBERT: Pre-training without Parallel Images and Captions
Viaarxiv icon

Analogical Reasoning for Visually Grounded Language Acquisition

Add code
Jul 22, 2020
Figure 1 for Analogical Reasoning for Visually Grounded Language Acquisition
Figure 2 for Analogical Reasoning for Visually Grounded Language Acquisition
Figure 3 for Analogical Reasoning for Visually Grounded Language Acquisition
Figure 4 for Analogical Reasoning for Visually Grounded Language Acquisition
Viaarxiv icon

Learning Visual Commonsense for Robust Scene Graph Generation

Add code
Jun 17, 2020
Figure 1 for Learning Visual Commonsense for Robust Scene Graph Generation
Figure 2 for Learning Visual Commonsense for Robust Scene Graph Generation
Figure 3 for Learning Visual Commonsense for Robust Scene Graph Generation
Figure 4 for Learning Visual Commonsense for Robust Scene Graph Generation
Viaarxiv icon

Cross-media Structured Common Space for Multimedia Event Extraction

Add code
May 05, 2020
Figure 1 for Cross-media Structured Common Space for Multimedia Event Extraction
Figure 2 for Cross-media Structured Common Space for Multimedia Event Extraction
Figure 3 for Cross-media Structured Common Space for Multimedia Event Extraction
Figure 4 for Cross-media Structured Common Space for Multimedia Event Extraction
Viaarxiv icon

Weakly Supervised Visual Semantic Parsing

Add code
Jan 08, 2020
Figure 1 for Weakly Supervised Visual Semantic Parsing
Figure 2 for Weakly Supervised Visual Semantic Parsing
Figure 3 for Weakly Supervised Visual Semantic Parsing
Figure 4 for Weakly Supervised Visual Semantic Parsing
Viaarxiv icon

Bridging Knowledge Graphs to Generate Scene Graphs

Add code
Jan 07, 2020
Figure 1 for Bridging Knowledge Graphs to Generate Scene Graphs
Figure 2 for Bridging Knowledge Graphs to Generate Scene Graphs
Figure 3 for Bridging Knowledge Graphs to Generate Scene Graphs
Figure 4 for Bridging Knowledge Graphs to Generate Scene Graphs
Viaarxiv icon