Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Vision-Language Pre-Training for Multimodal Aspect-Based Sentiment Analysis

Apr 21, 2022

Yan Ling, Jianfei Yu, Rui Xia

Figure 1 for Vision-Language Pre-Training for Multimodal Aspect-Based Sentiment Analysis

Figure 2 for Vision-Language Pre-Training for Multimodal Aspect-Based Sentiment Analysis

Figure 3 for Vision-Language Pre-Training for Multimodal Aspect-Based Sentiment Analysis

Figure 4 for Vision-Language Pre-Training for Multimodal Aspect-Based Sentiment Analysis

Share this with someone who'll enjoy it:

Abstract:As an important task in sentiment analysis, Multimodal Aspect-Based Sentiment Analysis (MABSA) has attracted increasing attention in recent years. However, previous approaches either (i) use separately pre-trained visual and textual models, which ignore the crossmodal alignment or (ii) use vision-language models pre-trained with general pre-training tasks, which are inadequate to identify finegrained aspects, opinions, and their alignments across modalities. To tackle these limitations, we propose a task-specific Vision-Language Pre-training framework for MABSA (VLPMABSA), which is a unified multimodal encoder-decoder architecture for all the pretraining and downstream tasks. We further design three types of task-specific pre-training tasks from the language, vision, and multimodal modalities, respectively. Experimental results show that our approach generally outperforms the state-of-the-art approaches on three MABSA subtasks. Further analysis demonstrates the effectiveness of each pretraining task. The source code is publicly released at https://github.com/NUSTM/VLP-MABSA.

* Accepted by ACL 2022 (long paper)

View paper on

Share this with someone who'll enjoy it:

Title:Vision-Language Pre-Training for Multimodal Aspect-Based Sentiment Analysis

Paper and Code