Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

Nov 09, 2022

Bin Shan, Yaqian Han, Weichong Yin, Shuohuan Wang, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Figure 1 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

Figure 2 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

Figure 3 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

Figure 4 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

Share this with someone who'll enjoy it:

Abstract:Recent cross-lingual cross-modal works attempt to extend Vision-Language Pre-training (VLP) models to non-English inputs and achieve impressive performance. However, these models focus only on understanding tasks utilizing encoder-only architecture. In this paper, we propose ERNIE-UniX2, a unified cross-lingual cross-modal pre-training framework for both generation and understanding tasks. ERNIE-UniX2 integrates multiple pre-training paradigms (e.g., contrastive learning and language modeling) based on encoder-decoder architecture and attempts to learn a better joint representation across languages and modalities. Furthermore, ERNIE-UniX2 can be seamlessly fine-tuned for varieties of generation and understanding downstream tasks. Pre-trained on both multilingual text-only and image-text datasets, ERNIE-UniX2 achieves SOTA results on various cross-lingual cross-modal generation and understanding tasks such as multimodal machine translation and multilingual visual question answering.

* 13 pages, 2 figures

View paper on

Share this with someone who'll enjoy it:

Title:ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

Paper and Code