Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Giancarlo D. Salton

Exploring the Use of Attention within an Neural Machine Translation Decoder States to Translate Idioms

Oct 10, 2018

Giancarlo D. Salton, Robert J. Ross, John D. Kelleher

Figure 1 for Exploring the Use of Attention within an Neural Machine Translation Decoder States to Translate Idioms

Figure 2 for Exploring the Use of Attention within an Neural Machine Translation Decoder States to Translate Idioms

Figure 3 for Exploring the Use of Attention within an Neural Machine Translation Decoder States to Translate Idioms

Abstract:Idioms pose problems to almost all Machine Translation systems. This type of language is very frequent in day-to-day language use and cannot be simply ignored. The recent interest in memory augmented models in the field of Language Modelling has aided the systems to achieve good results by bridging long-distance dependencies. In this paper we explore the use of such techniques into a Neural Machine Translation system to help in translation of idiomatic language.

Via

Access Paper or Ask Questions

Persistence pays off: Paying Attention to What the LSTM Gating Mechanism Persists

Oct 10, 2018

Giancarlo D. Salton, John D. Kelleher

Figure 1 for Persistence pays off: Paying Attention to What the LSTM Gating Mechanism Persists

Figure 2 for Persistence pays off: Paying Attention to What the LSTM Gating Mechanism Persists

Figure 3 for Persistence pays off: Paying Attention to What the LSTM Gating Mechanism Persists

Figure 4 for Persistence pays off: Paying Attention to What the LSTM Gating Mechanism Persists

Abstract:Language Models (LMs) are important components in several Natural Language Processing systems. Recurrent Neural Network LMs composed of LSTM units, especially those augmented with an external memory, have achieved state-of-the-art results. However, these models still struggle to process long sequences which are more likely to contain long-distance dependencies because of information fading and a bias towards more recent information. In this paper we demonstrate an effective mechanism for retrieving information in a memory augmented LSTM LM based on attending to information in memory in proportion to the number of timesteps the LSTM gating mechanism persisted the information.

Via

Access Paper or Ask Questions

Is it worth it? Budget-related evaluation metrics for model selection

Jul 18, 2018

Filip Klubička, Giancarlo D. Salton, John D. Kelleher

Figure 1 for Is it worth it? Budget-related evaluation metrics for model selection

Figure 2 for Is it worth it? Budget-related evaluation metrics for model selection

Figure 3 for Is it worth it? Budget-related evaluation metrics for model selection

Figure 4 for Is it worth it? Budget-related evaluation metrics for model selection

Abstract:Creating a linguistic resource is often done by using a machine learning model that filters the content that goes through to a human annotator, before going into the final resource. However, budgets are often limited, and the amount of available data exceeds the amount of affordable annotation. In order to optimize the benefit from the invested human work, we argue that deciding on which model one should employ depends not only on generalized evaluation metrics such as F-score, but also on the gain metric. Because the model with the highest F-score may not necessarily have the best sequencing of predicted classes, this may lead to wasting funds on annotating false positives, yielding zero improvement of the linguistic resource. We exemplify our point with a case study, using real data from a task of building a verb-noun idiom dictionary. We show that, given the choice of three systems with varying F-scores, the system with the highest F-score does not yield the highest profits. In other words, in our case the cost-benefit trade off is more favorable for a system with a lower F-score.

* 7 pages, 1 figure, 5 tables, In proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

Via

Access Paper or Ask Questions