Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jianze Liang

Finding Sparse Structure for Domain Specific Neural Machine Translation

Dec 19, 2020

Jianze Liang, Chengqi Zhao, Mingxuan Wang, Xipeng Qiu, Lei Li

Figure 1 for Finding Sparse Structure for Domain Specific Neural Machine Translation

Figure 2 for Finding Sparse Structure for Domain Specific Neural Machine Translation

Figure 3 for Finding Sparse Structure for Domain Specific Neural Machine Translation

Figure 4 for Finding Sparse Structure for Domain Specific Neural Machine Translation

Abstract:Fine-tuning is a major approach for domain adaptation in Neural Machine Translation (NMT). However, unconstrained fine-tuning requires very careful hyper-parameter tuning otherwise it is easy to fall into over-fitting on the target domain and degradation on the general domain. To mitigate it, we propose PRUNE-TUNE, a novel domain adaptation method via gradual pruning. It learns tiny domain-specific subnetworks for tuning. During adaptation to a new domain, we only tune its corresponding subnetwork. PRUNE-TUNE alleviates the over-fitting and the degradation problem without model modification. Additionally, with no overlapping between domain-specific subnetworks, PRUNE-TUNE is also capable of sequential multi-domain learning. Empirical experiment results show that PRUNE-TUNE outperforms several strong competitors in the target domain test set without the quality degradation of the general domain in both single and multiple domain settings.

* Accepted to AAAI 2021

Via

Access Paper or Ask Questions

Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation

May 14, 2019

Ning Dai, Jianze Liang, Xipeng Qiu, Xuanjing Huang

Figure 1 for Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation

Figure 2 for Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation

Figure 3 for Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation

Figure 4 for Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation

Abstract:Disentangling the content and style in the latent space is prevalent in unpaired text style transfer. However, two major issues exist in most of the current neural models. 1) It is difficult to completely strip the style information from the semantics for a sentence. 2) The recurrent neural network (RNN) based encoder and decoder, mediated by the latent representation, cannot well deal with the issue of the long-term dependency, resulting in poor preservation of non-stylistic semantic content.In this paper, we propose the Style Transformer, which makes no assumption about the latent representation of source sentence and equips the power of attention mechanism in Transformer to achieve better style transfer and better content preservation.

* ACL 2019

Via

Access Paper or Ask Questions