Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding

Jul 14, 2017

Fu Li, Chuang Gan, Xiao Liu, Yunlong Bian, Xiang Long, Yandong Li, Zhichao Li, Jie Zhou, Shilei Wen

Figure 1 for Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding

Figure 2 for Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding

Figure 3 for Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding

Figure 4 for Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding

Share this with someone who'll enjoy it:

Abstract:This paper describes our solution for the video recognition task of the Google Cloud and YouTube-8M Video Understanding Challenge that ranked the 3rd place. Because the challenge provides pre-extracted visual and audio features instead of the raw videos, we mainly investigate various temporal modeling approaches to aggregate the frame-level features for multi-label video recognition. Our system contains three major components: two-stream sequence model, fast-forward sequence model and temporal residual neural networks. Experiment results on the challenging Youtube-8M dataset demonstrate that our proposed temporal modeling approaches can significantly improve existing temporal modeling approaches in the large-scale video recognition tasks. To be noted, our fast-forward LSTM with a depth of 7 layers achieves 82.75% in term of GAP@20 on the Kaggle Public test set.

* To appear on CVPR 2017 YouTube-8M Workshop(Rank 3rd out of 650 teams)

View paper on

Share this with someone who'll enjoy it:

Title:Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding

Paper and Code