Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Encodings of Source Syntax: Similarities in NMT Representations Across Target Languages

May 17, 2020

Tyler A. Chang, Anna N. Rafferty

Figure 1 for Encodings of Source Syntax: Similarities in NMT Representations Across Target Languages

Figure 2 for Encodings of Source Syntax: Similarities in NMT Representations Across Target Languages

Figure 3 for Encodings of Source Syntax: Similarities in NMT Representations Across Target Languages

Figure 4 for Encodings of Source Syntax: Similarities in NMT Representations Across Target Languages

Share this with someone who'll enjoy it:

Abstract:We train neural machine translation (NMT) models from English to six target languages, using NMT encoder representations to predict ancestor constituent labels of source language words. We find that NMT encoders learn similar source syntax regardless of NMT target language, relying on explicit morphosyntactic cues to extract syntactic features from source sentences. Furthermore, the NMT encoders outperform RNNs trained directly on several of the constituent label prediction tasks, suggesting that NMT encoder representations can be used effectively for natural language tasks involving syntax. However, both the NMT encoders and the directly-trained RNNs learn substantially different syntactic information from a probabilistic context-free grammar (PCFG) parser. Despite lower overall accuracy scores, the PCFG often performs well on sentences for which the RNN-based models perform poorly, suggesting that RNN architectures are constrained in the types of syntax they can learn.

* To appear at the 5th Workshop on Representation Learning for NLP

View paper on

Share this with someone who'll enjoy it:

Title:Encodings of Source Syntax: Similarities in NMT Representations Across Target Languages

Paper and Code