Picture for Afsaneh Fazly

Afsaneh Fazly

CIC-BART-SSA: Controllable Image Captioning with Structured Semantic Augmentation

Add code
Jul 16, 2024
Viaarxiv icon

Graph Guided Question Answer Generation for Procedural Question-Answering

Add code
Jan 24, 2024
Figure 1 for Graph Guided Question Answer Generation for Procedural Question-Answering
Figure 2 for Graph Guided Question Answer Generation for Procedural Question-Answering
Figure 3 for Graph Guided Question Answer Generation for Procedural Question-Answering
Figure 4 for Graph Guided Question Answer Generation for Procedural Question-Answering
Viaarxiv icon

GePSAn: Generative Procedure Step Anticipation in Cooking Videos

Add code
Oct 12, 2023
Figure 1 for GePSAn: Generative Procedure Step Anticipation in Cooking Videos
Figure 2 for GePSAn: Generative Procedure Step Anticipation in Cooking Videos
Figure 3 for GePSAn: Generative Procedure Step Anticipation in Cooking Videos
Figure 4 for GePSAn: Generative Procedure Step Anticipation in Cooking Videos
Viaarxiv icon

SAGE: Saliency-Guided Mixup with Optimal Rearrangements

Add code
Oct 31, 2022
Viaarxiv icon

Visual Semantic Parsing: From Images to Abstract Meaning Representation

Add code
Oct 27, 2022
Viaarxiv icon

Graph2Vid: Flow graph to Video Grounding forWeakly-supervised Multi-Step Localization

Add code
Oct 10, 2022
Figure 1 for Graph2Vid: Flow graph to Video Grounding forWeakly-supervised Multi-Step Localization
Figure 2 for Graph2Vid: Flow graph to Video Grounding forWeakly-supervised Multi-Step Localization
Figure 3 for Graph2Vid: Flow graph to Video Grounding forWeakly-supervised Multi-Step Localization
Figure 4 for Graph2Vid: Flow graph to Video Grounding forWeakly-supervised Multi-Step Localization
Viaarxiv icon

Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations

Add code
Apr 20, 2022
Figure 1 for Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations
Figure 2 for Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations
Figure 3 for Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations
Figure 4 for Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations
Viaarxiv icon

VASTA: A Vision and Language-assisted Smartphone Task Automation System

Add code
Nov 04, 2019
Figure 1 for VASTA: A Vision and Language-assisted Smartphone Task Automation System
Figure 2 for VASTA: A Vision and Language-assisted Smartphone Task Automation System
Figure 3 for VASTA: A Vision and Language-assisted Smartphone Task Automation System
Figure 4 for VASTA: A Vision and Language-assisted Smartphone Task Automation System
Viaarxiv icon