Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings

Apr 27, 2020

Masoud Jalili Sabet, Philipp Dufter, Hinrich Schütze

Figure 1 for SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings

Figure 2 for SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings

Figure 3 for SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings

Figure 4 for SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings

Share this with someone who'll enjoy it:

Abstract:Word alignments are useful for tasks like statistical and neural machine translation (NMT) and annotation projection. Statistical word aligners perform well, as do methods that extract alignments jointly with translations in NMT. However, most approaches require parallel training data and quality decreases as less training data is available. We propose word alignment methods that require no parallel data. The key idea is to leverage multilingual word embeddings, both static and contextualized, for word alignment. Our multilingual embeddings are created from monolingual data only without relying on any parallel data or dictionaries. We find that alignments created from embeddings are competitive and mostly superior to traditional statistical aligners, even in scenarios with abundant parallel data. For example, for a set of 100k parallel sentences, contextualized embeddings achieve a word alignment F1 for English-German that is more than 5% higher (absolute) than eflomal, a high quality alignment model.

View paper on

Share this with someone who'll enjoy it:

Title:SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings

Paper and Code