Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Huadong Chen

Enhancing Cross-lingual Transfer by Manifold Mixup

May 09, 2022

Huiyun Yang, Huadong Chen, Hao Zhou, Lei Li

Figure 1 for Enhancing Cross-lingual Transfer by Manifold Mixup

Figure 2 for Enhancing Cross-lingual Transfer by Manifold Mixup

Figure 3 for Enhancing Cross-lingual Transfer by Manifold Mixup

Figure 4 for Enhancing Cross-lingual Transfer by Manifold Mixup

Abstract:Based on large-scale pre-trained multilingual representations, recent cross-lingual transfer methods have achieved impressive transfer performances. However, the performance of target languages still lags far behind the source language. In this paper, our analyses indicate such a performance gap is strongly associated with the cross-lingual representation discrepancy. To achieve better cross-lingual transfer performance, we propose the cross-lingual manifold mixup (X-Mixup) method, which adaptively calibrates the representation discrepancy and gives a compromised representation for target languages. Experiments on the XTREME benchmark show X-Mixup achieves 1.8% performance gains on multiple text understanding tasks, compared with strong baselines, and significantly reduces the cross-lingual representation discrepancy.

* Accepted to ICLR2022

Via

Access Paper or Ask Questions

Top-Rank Enhanced Listwise Optimization for Statistical Machine Translation

Jul 18, 2017

Huadong Chen, Shujian Huang, David Chiang, Xinyu Dai, Jiajun Chen

Figure 1 for Top-Rank Enhanced Listwise Optimization for Statistical Machine Translation

Figure 2 for Top-Rank Enhanced Listwise Optimization for Statistical Machine Translation

Figure 3 for Top-Rank Enhanced Listwise Optimization for Statistical Machine Translation

Figure 4 for Top-Rank Enhanced Listwise Optimization for Statistical Machine Translation

Abstract:Pairwise ranking methods are the basis of many widely used discriminative training approaches for structure prediction problems in natural language processing(NLP). Decomposing the problem of ranking hypotheses into pairwise comparisons enables simple and efficient solutions. However, neglecting the global ordering of the hypothesis list may hinder learning. We propose a listwise learning framework for structure prediction problems such as machine translation. Our framework directly models the entire translation list's ordering to learn parameters which may better fit the given listwise samples. Furthermore, we propose top-rank enhanced loss functions, which are more sensitive to ranking errors at higher positions. Experiments on a large-scale Chinese-English translation task show that both our listwise learning framework and top-rank enhanced listwise losses lead to significant improvements in translation quality.

* Accepted to CONLL 2017

Via

Access Paper or Ask Questions

Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder

Jul 18, 2017

Huadong Chen, Shujian Huang, David Chiang, Jiajun Chen

Figure 1 for Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder

Figure 2 for Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder

Figure 3 for Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder

Figure 4 for Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder

Abstract:Most neural machine translation (NMT) models are based on the sequential encoder-decoder framework, which makes no use of syntactic information. In this paper, we improve this model by explicitly incorporating source-side syntactic trees. More specifically, we propose (1) a bidirectional tree encoder which learns both sequential and tree structured representations; (2) a tree-coverage model that lets the attention depend on the source-side syntax. Experiments on Chinese-English translation demonstrate that our proposed models outperform the sequential attentional model as well as a stronger baseline with a bottom-up tree encoder and word coverage.

* Accepted as a long paper by ACL 2017

Via

Access Paper or Ask Questions

Non-linear Learning for Statistical Machine Translation

Feb 28, 2015

Shujian Huang, Huadong Chen, Xinyu Dai, Jiajun Chen

Figure 1 for Non-linear Learning for Statistical Machine Translation

Figure 2 for Non-linear Learning for Statistical Machine Translation

Figure 3 for Non-linear Learning for Statistical Machine Translation

Figure 4 for Non-linear Learning for Statistical Machine Translation

Abstract:Modern statistical machine translation (SMT) systems usually use a linear combination of features to model the quality of each translation hypothesis. The linear combination assumes that all the features are in a linear relationship and constrains that each feature interacts with the rest features in an linear manner, which might limit the expressive power of the model and lead to a under-fit model on the current data. In this paper, we propose a non-linear modeling for the quality of translation hypotheses based on neural networks, which allows more complex interaction between features. A learning framework is presented for training the non-linear models. We also discuss possible heuristics in designing the network structure which may improve the non-linear learning performance. Experimental results show that with the basic features of a hierarchical phrase-based machine translation system, our method produce translations that are better than a linear model.

* submitted to a conference

Via

Access Paper or Ask Questions