Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Assma Boughoula

DeepQAMVS: Query-Aware Hierarchical Pointer Networks for Multi-Video Summarization

May 13, 2021

Safa Messaoud, Ismini Lourentzou, Assma Boughoula, Mona Zehni, Zhizhen Zhao, Chengxiang Zhai, Alexander G. Schwing

Figure 1 for DeepQAMVS: Query-Aware Hierarchical Pointer Networks for Multi-Video Summarization

Figure 2 for DeepQAMVS: Query-Aware Hierarchical Pointer Networks for Multi-Video Summarization

Figure 3 for DeepQAMVS: Query-Aware Hierarchical Pointer Networks for Multi-Video Summarization

Figure 4 for DeepQAMVS: Query-Aware Hierarchical Pointer Networks for Multi-Video Summarization

Abstract:The recent growth of web video sharing platforms has increased the demand for systems that can efficiently browse, retrieve and summarize video content. Query-aware multi-video summarization is a promising technique that caters to this demand. In this work, we introduce a novel Query-Aware Hierarchical Pointer Network for Multi-Video Summarization, termed DeepQAMVS, that jointly optimizes multiple criteria: (1) conciseness, (2) representativeness of important query-relevant events and (3) chronological soundness. We design a hierarchical attention model that factorizes over three distributions, each collecting evidence from a different modality, followed by a pointer network that selects frames to include in the summary. DeepQAMVS is trained with reinforcement learning, incorporating rewards that capture representativeness, diversity, query-adaptability and temporal coherence. We achieve state-of-the-art results on the MVS1K dataset, with inference time scaling linearly with the number of input video frames.

Via

Access Paper or Ask Questions