Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Miriam Sánchez-Manzano

Universitat Pompeu Fabra, Barcelona, Spain

Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods

Oct 21, 2024

Adam Phillips, Daniel Grandes Rodriguez, Miriam Sánchez-Manzano, Alan Salvadó, Manuel Garin, Gloria Haro, Coloma Ballester

Figure 1 for Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods

Figure 2 for Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods

Figure 3 for Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods

Figure 4 for Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods

Abstract:In cinema, visual motifs are recurrent iconographic compositions that carry artistic or aesthetic significance. Their use throughout the history of visual arts and media is interesting to researchers and filmmakers alike. Our goal in this work is to recognise and classify these motifs by proposing a new machine learning model that uses a custom dataset to that end. We show how features extracted from a CLIP model can be leveraged by using a shallow network and an appropriate loss to classify images into 20 different motifs, with surprisingly good results: an $F_1$-score of 0.91 on our test set. We also present several ablation studies justifying the input features, architecture and hyperparameters used.

* 17 pages, 11 figures, one table, to be published in the conference proceedings of ECCV 2024

Via

Access Paper or Ask Questions