Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Align after Pre-train: Improving Multilingual Generative Models with Cross-lingual Alignment

Nov 14, 2023

Chong Li, Shaonan Wang, Jiajun Zhang, Chengqing Zong

Share this with someone who'll enjoy it:

Abstract:Multilingual generative models obtain remarkable cross-lingual capabilities through pre-training on large-scale corpora. However, they still exhibit a performance bias toward high-resource languages, and learn isolated distributions of sentence representations across languages. To bridge this gap, we propose a simple yet effective alignment framework exploiting pairs of translation sentences. It aligns the internal sentence representations across different languages via multilingual contrastive learning and aligns model outputs by answering prompts in different languages. Experimental results demonstrate that even with less than 0.1 {\textperthousand} of pre-training tokens, our alignment framework significantly boosts the cross-lingual abilities of generative models and mitigates the performance gap. Further analysis reveals that it results in a better internal multilingual representation distribution of multilingual models.

* Work in progress

View paper on

Share this with someone who'll enjoy it:

Title:Align after Pre-train: Improving Multilingual Generative Models with Cross-lingual Alignment

Paper and Code