Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation

Apr 13, 2021

Jae Won Cho, Dong-Jin Kim, Jinsoo Choi, Yunjae Jung, In So Kweon

Figure 1 for Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation

Figure 2 for Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation

Figure 3 for Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation

Figure 4 for Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation

Share this with someone who'll enjoy it:

Abstract:In this work, we address the issues of missing modalities that have arisen from the Visual Question Answer-Difference prediction task and find a novel method to solve the task at hand. We address the missing modality-the ground truth answers-that are not present at test time and use a privileged knowledge distillation scheme to deal with the issue of the missing modality. In order to efficiently do so, we first introduce a model, the "Big" Teacher, that takes the image/question/answer triplet as its input and outperforms the baseline, then use a combination of models to distill knowledge to a target network (student) that only takes the image/question pair as its inputs. We experiment our models on the VizWiz and VQA-V2 Answer Difference datasets and show through extensive experimentation and ablation the performances of our method and a diverse possibility for future research.

* To appear in CVPR MULA Workshop

View paper on

Share this with someone who'll enjoy it:

Title:Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation

Paper and Code