Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Transfer Learning via Unsupervised Task Discovery for Visual Question Answering

Oct 03, 2018

Hyeonwoo Noh, Taehoon Kim, Jonghwan Mun, Bohyung Han

Figure 1 for Transfer Learning via Unsupervised Task Discovery for Visual Question Answering

Figure 2 for Transfer Learning via Unsupervised Task Discovery for Visual Question Answering

Figure 3 for Transfer Learning via Unsupervised Task Discovery for Visual Question Answering

Figure 4 for Transfer Learning via Unsupervised Task Discovery for Visual Question Answering

Share this with someone who'll enjoy it:

Abstract:We study how to leverage off-the-shelf visual and linguistic data to cope with out-of-vocabulary answers in visual question answering. Existing large-scale visual data with annotations such as image class labels, bounding boxes and region descriptions are good sources for learning rich and diverse visual concepts. However, it is not straightforward how the visual concepts should be captured and transferred to visual question answering models due to missing link between question dependent answering models and visual data without question or task specification. We tackle this problem in two steps: 1) learning a task conditional visual classifier based on unsupervised task discovery and 2) transferring and adapting the task conditional visual classifier to visual question answering models. Specifically, we employ linguistic knowledge sources such as structured lexical database (e.g. Wordnet) and visual descriptions for unsupervised task discovery, and adapt a learned task conditional visual classifier to answering unit in a visual question answering model. We empirically show that the proposed algorithm generalizes to unseen answers successfully using the knowledge transferred from the visual data.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Transfer Learning via Unsupervised Task Discovery for Visual Question Answering

Paper and Code