Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

WonKee Lee

Towards Semi-Supervised Learning of Automatic Post-Editing: Data-Synthesis by Infilling Mask with Erroneous Tokens

Apr 08, 2022

WonKee Lee, Seong-Hwan Heo, Baikjin Jung, Jong-Hyeok Lee

Figure 1 for Towards Semi-Supervised Learning of Automatic Post-Editing: Data-Synthesis by Infilling Mask with Erroneous Tokens

Figure 2 for Towards Semi-Supervised Learning of Automatic Post-Editing: Data-Synthesis by Infilling Mask with Erroneous Tokens

Figure 3 for Towards Semi-Supervised Learning of Automatic Post-Editing: Data-Synthesis by Infilling Mask with Erroneous Tokens

Figure 4 for Towards Semi-Supervised Learning of Automatic Post-Editing: Data-Synthesis by Infilling Mask with Erroneous Tokens

Abstract:Semi-supervised learning that leverages synthetic training data has been widely adopted in the field of Automatic post-editing (APE) to overcome the lack of human-annotated training data. In that context, data-synthesis methods to create high-quality synthetic data have also received much attention. Considering that APE takes machine-translation outputs containing translation errors as input, we propose a noising-based data-synthesis method that uses a mask language model to create noisy texts through substituting masked tokens with erroneous tokens, yet following the error-quantity statistics appearing in genuine APE data. In addition, we propose corpus interleaving, which is to combine two separate synthetic data by taking only advantageous samples, to further enhance the quality of the synthetic data created with our noising method. Experimental results reveal that using the synthetic data created with our approach results in significant improvements in APE performance upon using other synthetic data created with different existing data-synthesis methods.

Via

Access Paper or Ask Questions

mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling

Mar 24, 2022

Seong-Hwan Heo, WonKee Lee, Jong-Hyeok Lee

Figure 1 for mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling

Figure 2 for mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling

Figure 3 for mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling

Figure 4 for mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling

Abstract:Zero-shot slot filling has received considerable attention to cope with the problem of limited available data for the target domain. One of the important factors in zero-shot learning is to make the model learn generalized and reliable representations. For this purpose, we present mcBERT, which stands for momentum contrastive learning with BERT, to develop a robust zero-shot slot filling model. mcBERT uses BERT to initialize the two encoders, the query encoder and key encoder, and is trained by applying momentum contrastive learning. Our experimental results on the SNIPS benchmark show that mcBERT substantially outperforms the previous models, recording a new state-of-the-art. Besides, we also show that each component composing mcBERT contributes to the performance improvement.

* Submitted to INTERSPEECH 2022

Via

Access Paper or Ask Questions

Transformer-based Automatic Post-Editing with a Context-Aware Encoding Approach for Multi-Source Inputs

Aug 15, 2019

WonKee Lee, Junsu Park, Byung-Hyun Go, Jong-Hyeok Lee

Figure 1 for Transformer-based Automatic Post-Editing with a Context-Aware Encoding Approach for Multi-Source Inputs

Figure 2 for Transformer-based Automatic Post-Editing with a Context-Aware Encoding Approach for Multi-Source Inputs

Figure 3 for Transformer-based Automatic Post-Editing with a Context-Aware Encoding Approach for Multi-Source Inputs

Figure 4 for Transformer-based Automatic Post-Editing with a Context-Aware Encoding Approach for Multi-Source Inputs

Abstract:Recent approaches to the Automatic Post-Editing (APE) research have shown that better results are obtained by multi-source models, which jointly encode both source (src) and machine translation output (mt) to produce post-edited sentence (pe). Along this trend, we present a new multi-source APE model based on the Transformer. To construct effective joint representations, our model internally learns to incorporate src context into mt representation. With this approach, we achieve a significant improvement over baseline systems, as well as the state-of-the-art multi-source APE model. Moreover, to demonstrate the capability of our model to incorporate src context, we show that the word alignment of the unknown MT system is successfully captured in our encoding results.

* 6 pages, 3 figures

Via

Access Paper or Ask Questions