Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alexandra Fenster

Audiovisual Highlight Detection in Videos

Feb 11, 2021

Karel Mundnich, Alexandra Fenster, Aparna Khare, Shiva Sundaram

Figure 1 for Audiovisual Highlight Detection in Videos

Figure 2 for Audiovisual Highlight Detection in Videos

Figure 3 for Audiovisual Highlight Detection in Videos

Figure 4 for Audiovisual Highlight Detection in Videos

Abstract:In this paper, we test the hypothesis that interesting events in unstructured videos are inherently audiovisual. We combine deep image representations for object recognition and scene understanding with representations from an audiovisual affect recognition model. To this set, we include content agnostic audio-visual synchrony representations and mel-frequency cepstral coefficients to capture other intrinsic properties of audio. These features are used in a modular supervised model. We present results from two experiments: efficacy study of single features on the task, and an ablation study where we leave one feature out at a time. For the video summarization task, our results indicate that the visual features carry most information, and including audiovisual features improves over visual-only information. To better study the task of highlight detection, we run a pilot experiment with highlights annotations for a small subset of video clips and fine-tune our best model on it. Results indicate that we can transfer knowledge from the video summarization task to a model trained specifically for the task of highlight detection.

* 5 pages, 2 figures, conference paper

Via

Access Paper or Ask Questions