Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Compositional generalization in a deep seq2seq model by separating syntax and semantics

May 23, 2019

Jake Russin, Jason Jo, Randall C. O'Reilly, Yoshua Bengio

Figure 1 for Compositional generalization in a deep seq2seq model by separating syntax and semantics

Figure 2 for Compositional generalization in a deep seq2seq model by separating syntax and semantics

Figure 3 for Compositional generalization in a deep seq2seq model by separating syntax and semantics

Figure 4 for Compositional generalization in a deep seq2seq model by separating syntax and semantics

Share this with someone who'll enjoy it:

Abstract:Standard methods in deep learning for natural language processing fail to capture the compositional structure of human language that allows for systematic generalization outside of the training distribution. However, human learners readily generalize in this way, e.g. by applying known grammatical rules to novel words. Inspired by work in neuroscience suggesting separate brain systems for syntactic and semantic processing, we implement a modification to standard approaches in neural machine translation, imposing an analogous separation. The novel model, which we call Syntactic Attention, substantially outperforms standard methods in deep learning on the SCAN dataset, a compositional generalization task, without any hand-engineered features or additional supervision. Our work suggests that separating syntactic from semantic learning may be a useful heuristic for capturing compositional structure.

* 18 pages, 15 figures, preprint version of submission to NeurIPS 2019, under review

View paper on

Share this with someone who'll enjoy it:

Title:Compositional generalization in a deep seq2seq model by separating syntax and semantics

Paper and Code