Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Towards Diverse Paragraph Captioning for Untrimmed Videos

May 30, 2021

Yuqing Song, Shizhe Chen, Qin Jin

Figure 1 for Towards Diverse Paragraph Captioning for Untrimmed Videos

Figure 2 for Towards Diverse Paragraph Captioning for Untrimmed Videos

Figure 3 for Towards Diverse Paragraph Captioning for Untrimmed Videos

Figure 4 for Towards Diverse Paragraph Captioning for Untrimmed Videos

Share this with someone who'll enjoy it:

Abstract:Video paragraph captioning aims to describe multiple events in untrimmed videos with descriptive paragraphs. Existing approaches mainly solve the problem in two steps: event detection and then event captioning. Such two-step manner makes the quality of generated paragraphs highly dependent on the accuracy of event proposal detection which is already a challenging task. In this paper, we propose a paragraph captioning model which eschews the problematic event detection stage and directly generates paragraphs for untrimmed videos. To describe coherent and diverse events, we propose to enhance the conventional temporal attention with dynamic video memories, which progressively exposes new video features and suppresses over-accessed video contents to control visual focuses of the model. In addition, a diversity-driven training strategy is proposed to improve diversity of paragraph on the language perspective. Considering that untrimmed videos generally contain massive but redundant frames, we further augment the video encoder with keyframe awareness to improve efficiency. Experimental results on the ActivityNet and Charades datasets show that our proposed model significantly outperforms the state-of-the-art performance on both accuracy and diversity metrics without using any event boundary annotations. Code will be released at https://github.com/syuqings/video-paragraph.

* Accepted by CVPR 2021

View paper on

Share this with someone who'll enjoy it:

Title:Towards Diverse Paragraph Captioning for Untrimmed Videos

Paper and Code