Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Bofan Xue

A Dataset and Benchmarks for Multimedia Social Analysis

Jun 05, 2020

Bofan Xue, David Chan, John Canny

Figure 1 for A Dataset and Benchmarks for Multimedia Social Analysis

Figure 2 for A Dataset and Benchmarks for Multimedia Social Analysis

Figure 3 for A Dataset and Benchmarks for Multimedia Social Analysis

Figure 4 for A Dataset and Benchmarks for Multimedia Social Analysis

Abstract:We present a new publicly available dataset with the goal of advancing multi-modality learning by offering vision and language data within the same context. This is achieved by obtaining data from a social media website with posts containing multiple paired images/videos and text, along with comment trees containing images/videos and/or text. With a total of 677k posts, 2.9 million post images, 488k post videos, 1.4 million comment images, 4.6 million comment videos, and 96.9 million comments, data from different modalities can be jointly used to improve performances for a variety of tasks such as image captioning, image classification, next frame prediction, sentiment analysis, and language modeling. We present a wide range of statistics for our dataset. Finally, we provide baseline performance analysis for one of the regression tasks using pre-trained models and several fully connected networks.

* Published as a workshop paper at "Multimodality Learning" (CVPR 2020)

Via

Access Paper or Ask Questions