Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Leveraging Video Descriptions to Learn Video Question Answering

Dec 19, 2016

Kuo-Hao Zeng, Tseng-Hung Chen, Ching-Yao Chuang, Yuan-Hong Liao, Juan Carlos Niebles, Min Sun

Figure 1 for Leveraging Video Descriptions to Learn Video Question Answering

Figure 2 for Leveraging Video Descriptions to Learn Video Question Answering

Figure 3 for Leveraging Video Descriptions to Learn Video Question Answering

Figure 4 for Leveraging Video Descriptions to Learn Video Question Answering

Share this with someone who'll enjoy it:

Abstract:We propose a scalable approach to learn video-based question answering (QA): answer a "free-form natural language question" about a video content. Our approach automatically harvests a large number of videos and descriptions freely available online. Then, a large number of candidate QA pairs are automatically generated from descriptions rather than manually annotated. Next, we use these candidate QA pairs to train a number of video-based QA methods extended fromMN (Sukhbaatar et al. 2015), VQA (Antol et al. 2015), SA (Yao et al. 2015), SS (Venugopalan et al. 2015). In order to handle non-perfect candidate QA pairs, we propose a self-paced learning procedure to iteratively identify them and mitigate their effects in training. Finally, we evaluate performance on manually generated video-based QA pairs. The results show that our self-paced learning procedure is effective, and the extended SS model outperforms various baselines.

* 7 pages, 5 figures. Accepted to AAAI 2017. Camera-ready version

View paper on

Share this with someone who'll enjoy it:

Title:Leveraging Video Descriptions to Learn Video Question Answering

Paper and Code