Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gabor Szolnok

A Three Step Training Approach with Data Augmentation for Morphological Inflection

Sep 14, 2021

Gabor Szolnok, Botond Barta, Dorina Lakatos, Judit Acs

Figure 1 for A Three Step Training Approach with Data Augmentation for Morphological Inflection

Figure 2 for A Three Step Training Approach with Data Augmentation for Morphological Inflection

Figure 3 for A Three Step Training Approach with Data Augmentation for Morphological Inflection

Figure 4 for A Three Step Training Approach with Data Augmentation for Morphological Inflection

Abstract:We present the BME submission for the SIGMORPHON 2021 Task 0 Part 1, Generalization Across Typologically Diverse Languages shared task. We use an LSTM encoder-decoder model with three step training that is first trained on all languages, then fine-tuned on each language families and finally finetuned on individual languages. We use a different type of data augmentation technique in the first two steps. Our system outperformed the only other submission. Although it remains worse than the Transformer baseline released by the organizers, our model is simpler and our data augmentation techniques are easily applicable to new languages. We perform ablation studies and show that the augmentation techniques and the three training steps often help but sometimes have a negative effect.

Via

Access Paper or Ask Questions