Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Jun 09, 2021

Cunxiao Du, Zhaopeng Tu, Jing Jiang

Figure 1 for Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Figure 2 for Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Figure 3 for Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Figure 4 for Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Share this with someone who'll enjoy it:

Abstract:We propose a new training objective named order-agnostic cross entropy (OaXE) for fully non-autoregressive translation (NAT) models. OaXE improves the standard cross-entropy loss to ameliorate the effect of word reordering, which is a common source of the critical multimodality problem in NAT. Concretely, OaXE removes the penalty for word order errors, and computes the cross entropy loss based on the best possible alignment between model predictions and target tokens. Since the log loss is very sensitive to invalid references, we leverage cross entropy initialization and loss truncation to ensure the model focuses on a good part of the search space. Extensive experiments on major WMT benchmarks show that OaXE substantially improves translation performance, setting new state of the art for fully NAT models. Further analyses show that OaXE alleviates the multimodality problem by reducing token repetitions and increasing prediction confidence. Our code, data, and trained models are available at https://github.com/tencent-ailab/ICML21_OAXE.

* ICML 2021 (Oral), Code at https://github.com/tencent-ailab/ICML21_OAXE

View paper on

Share this with someone who'll enjoy it:

Title:Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Paper and Code