Picture for Kohei Uehara

Kohei Uehara

Memory-Maze: Scenario Driven Benchmark and Visual Language Navigation Model for Guiding Blind People

Add code
May 11, 2024
Viaarxiv icon

Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation

Add code
Jan 18, 2024
Figure 1 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Figure 2 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Figure 3 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Figure 4 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Viaarxiv icon

Learning by Asking Questions for Knowledge-based Novel Object Recognition

Add code
Oct 12, 2022
Figure 1 for Learning by Asking Questions for Knowledge-based Novel Object Recognition
Figure 2 for Learning by Asking Questions for Knowledge-based Novel Object Recognition
Figure 3 for Learning by Asking Questions for Knowledge-based Novel Object Recognition
Figure 4 for Learning by Asking Questions for Knowledge-based Novel Object Recognition
Viaarxiv icon

K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition

Add code
Mar 15, 2022
Figure 1 for K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition
Figure 2 for K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition
Figure 3 for K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition
Figure 4 for K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition
Viaarxiv icon

ViNTER: Image Narrative Generation with Emotion-Arc-Aware Transformer

Add code
Feb 15, 2022
Figure 1 for ViNTER: Image Narrative Generation with Emotion-Arc-Aware Transformer
Figure 2 for ViNTER: Image Narrative Generation with Emotion-Arc-Aware Transformer
Figure 3 for ViNTER: Image Narrative Generation with Emotion-Arc-Aware Transformer
Figure 4 for ViNTER: Image Narrative Generation with Emotion-Arc-Aware Transformer
Viaarxiv icon

Unsupervised Keyword Extraction for Full-sentence VQA

Add code
Nov 23, 2019
Figure 1 for Unsupervised Keyword Extraction for Full-sentence VQA
Figure 2 for Unsupervised Keyword Extraction for Full-sentence VQA
Figure 3 for Unsupervised Keyword Extraction for Full-sentence VQA
Figure 4 for Unsupervised Keyword Extraction for Full-sentence VQA
Viaarxiv icon

Interactive Video Retrieval with Dialog

Add code
May 07, 2019
Figure 1 for Interactive Video Retrieval with Dialog
Figure 2 for Interactive Video Retrieval with Dialog
Figure 3 for Interactive Video Retrieval with Dialog
Figure 4 for Interactive Video Retrieval with Dialog
Viaarxiv icon

Visual Question Generation for Class Acquisition of Unknown Objects

Add code
Aug 06, 2018
Figure 1 for Visual Question Generation for Class Acquisition of Unknown Objects
Figure 2 for Visual Question Generation for Class Acquisition of Unknown Objects
Figure 3 for Visual Question Generation for Class Acquisition of Unknown Objects
Figure 4 for Visual Question Generation for Class Acquisition of Unknown Objects
Viaarxiv icon