Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jesús Armenta-Segura

Anime Popularity Prediction Before Huge Investments: a Multimodal Approach Using Deep Learning

Jun 21, 2024

Jesús Armenta-Segura, Grigori Sidorov

Figure 1 for Anime Popularity Prediction Before Huge Investments: a Multimodal Approach Using Deep Learning

Figure 2 for Anime Popularity Prediction Before Huge Investments: a Multimodal Approach Using Deep Learning

Figure 3 for Anime Popularity Prediction Before Huge Investments: a Multimodal Approach Using Deep Learning

Figure 4 for Anime Popularity Prediction Before Huge Investments: a Multimodal Approach Using Deep Learning

Abstract:In the japanese anime industry, predicting whether an upcoming product will be popular is crucial. This paper presents a dataset and methods on predicting anime popularity using a multimodal textimage dataset constructed exclusively from freely available internet sources. The dataset was built following rigorous standards based on real-life investment experiences. A deep neural network architecture leveraging GPT-2 and ResNet-50 to embed the data was employed to investigate the correlation between the multimodal text-image input and a popularity score, discovering relevant strengths and weaknesses in the dataset. To measure the accuracy of the model, mean squared error (MSE) was used, obtaining a best result of 0.011 when considering all inputs and the full version of the deep neural network, compared to the benchmark MSE 0.412 obtained with traditional TF-IDF and PILtotensor vectorizations. This is the first proposal to address such task with multimodal datasets, revealing the substantial benefit of incorporating image information, even when a relatively small model (ResNet-50) was used to embed them.

* 13 pages, 6 figures, 11 tables

Via

Access Paper or Ask Questions