Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vitaliy Atrasevych

GECToR -- Grammatical Error Correction: Tag, Not Rewrite

May 29, 2020

Kostiantyn Omelianchuk, Vitaliy Atrasevych, Artem Chernodub, Oleksandr Skurzhanskyi

Figure 1 for GECToR -- Grammatical Error Correction: Tag, Not Rewrite

Figure 2 for GECToR -- Grammatical Error Correction: Tag, Not Rewrite

Figure 3 for GECToR -- Grammatical Error Correction: Tag, Not Rewrite

Figure 4 for GECToR -- Grammatical Error Correction: Tag, Not Rewrite

Abstract:In this paper, we present a simple and efficient GEC sequence tagger using a Transformer encoder. Our system is pre-trained on synthetic data and then fine-tuned in two stages: first on errorful corpora, and second on a combination of errorful and error-free parallel corpora. We design custom token-level transformations to map input tokens to target corrections. Our best single-model/ensemble GEC tagger achieves an $F_{0.5}$ of 65.3/66.5 on CoNLL-2014 (test) and $F_{0.5}$ of 72.4/73.6 on BEA-2019 (test). Its inference speed is up to 10 times as fast as a Transformer-based seq2seq GEC system. The code and trained models are publicly available.

* Accepted for publication in BEA workshop (15th Workshop on Innovative Use of NLP for Building Educational Applications; co-located with ACL)

Via

Access Paper or Ask Questions