Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning synchronous context-free grammars with multiple specialised non-terminals for hierarchical phrase-based translation

Apr 03, 2020

Felipe Sánchez-Martínez, Juan Antonio Pérez-Ortiz, Rafael C. Carrasco

Figure 1 for Learning synchronous context-free grammars with multiple specialised non-terminals for hierarchical phrase-based translation

Figure 2 for Learning synchronous context-free grammars with multiple specialised non-terminals for hierarchical phrase-based translation

Figure 3 for Learning synchronous context-free grammars with multiple specialised non-terminals for hierarchical phrase-based translation

Figure 4 for Learning synchronous context-free grammars with multiple specialised non-terminals for hierarchical phrase-based translation

Share this with someone who'll enjoy it:

Abstract:Translation models based on hierarchical phrase-based statistical machine translation (HSMT) have shown better performances than the non-hierarchical phrase-based counterparts for some language pairs. The standard approach to HSMT learns and apply a synchronous context-free grammar with a single non-terminal. The hypothesis behind the grammar refinement algorithm presented in this work is that this single non-terminal is overloaded, and insufficiently discriminative, and therefore, an adequate split of it into more specialised symbols could lead to improved models. This paper presents a method to learn synchronous context-free grammars with a huge number of initial non-terminals, which are then grouped via a clustering algorithm. Our experiments show that the resulting smaller set of non-terminals correctly capture the contextual information that makes it possible to statistically significantly improve the BLEU score of the standard HSMT approach.

View paper on

Share this with someone who'll enjoy it:

Title:Learning synchronous context-free grammars with multiple specialised non-terminals for hierarchical phrase-based translation

Paper and Code