Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Effectively leveraging Multi-modal Features for Movie Genre Classification

Mar 24, 2022

Zhongping Zhang, Yiwen Gu, Bryan A. Plummer, Xin Miao, Jiayi Liu, Huayan Wang

Figure 1 for Effectively leveraging Multi-modal Features for Movie Genre Classification

Figure 2 for Effectively leveraging Multi-modal Features for Movie Genre Classification

Figure 3 for Effectively leveraging Multi-modal Features for Movie Genre Classification

Figure 4 for Effectively leveraging Multi-modal Features for Movie Genre Classification

Share this with someone who'll enjoy it:

Abstract:Movie genre classification has been widely studied in recent years due to its various applications in video editing, summarization, and recommendation. Prior work has typically addressed this task by predicting genres based solely on the visual content. As a result, predictions from these methods often perform poorly for genres such as documentary or musical, since non-visual modalities like audio or language play an important role in correctly classifying these genres. In addition, the analysis of long videos at frame level is always associated with high computational cost and makes the prediction less efficient. To address these two issues, we propose a Multi-Modal approach leveraging shot information, MMShot, to classify video genres in an efficient and effective way. We evaluate our method on MovieNet and Condensed Movies for genre classification, achieving 17% ~ 21% improvement on mean Average Precision (mAP) over the state-of-the-art. Extensive experiments are conducted to demonstrate the ability of MMShot for long video analysis and uncover the correlations between genres and multiple movie elements. We also demonstrate our approach's ability to generalize by evaluating the scene boundary detection task, achieving 1.1% improvement on Average Precision (AP) over the state-of-the-art.

View paper on

Share this with someone who'll enjoy it:

Title:Effectively leveraging Multi-modal Features for Movie Genre Classification

Paper and Code