Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering

Apr 03, 2016

Kan Chen, Jiang Wang, Liang-Chieh Chen, Haoyuan Gao, Wei Xu, Ram Nevatia

Figure 1 for ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering

Figure 2 for ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering

Figure 3 for ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering

Figure 4 for ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering

Share this with someone who'll enjoy it:

Abstract:We propose a novel attention based deep learning architecture for visual question answering task (VQA). Given an image and an image related natural language question, VQA generates the natural language answer for the question. Generating the correct answers requires the model's attention to focus on the regions corresponding to the question, because different questions inquire about the attributes of different image regions. We introduce an attention based configurable convolutional neural network (ABC-CNN) to learn such question-guided attention. ABC-CNN determines an attention map for an image-question pair by convolving the image feature map with configurable convolutional kernels derived from the question's semantics. We evaluate the ABC-CNN architecture on three benchmark VQA datasets: Toronto COCO-QA, DAQUAR, and VQA dataset. ABC-CNN model achieves significant improvements over state-of-the-art methods on these datasets. The question-guided attention generated by ABC-CNN is also shown to reflect the regions that are highly relevant to the questions.

View paper on

Share this with someone who'll enjoy it:

Title:ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering

Paper and Code