Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Cost-Sensitive Training for Autoregressive Models

Dec 08, 2019

Irina Saparina, Anton Osokin

Figure 1 for Cost-Sensitive Training for Autoregressive Models

Figure 2 for Cost-Sensitive Training for Autoregressive Models

Figure 3 for Cost-Sensitive Training for Autoregressive Models

Figure 4 for Cost-Sensitive Training for Autoregressive Models

Share this with someone who'll enjoy it:

Abstract:Training autoregressive models to better predict under the test metric, instead of maximizing the likelihood, has been reported to be beneficial in several use cases but brings additional complications, which prevent wider adoption. In this paper, we follow the learning-to-search approach (Daum\'e III et al., 2009; Leblond et al., 2018) and investigate its several components. First, we propose a way to construct a reference policy based on an alignment between the model output and ground truth. Our reference policy is optimal when applied to the Kendall-tau distance between permutations (appear in the task of word ordering) and helps when working with the METEOR score for machine translation. Second, we observe that the learning-to-search approach benefits from choosing the costs related to the test metrics. Finally, we study the effect of different learning objectives and find that the standard KL loss only learns several high-probability tokens and can be replaced with ranking objectives that target these tokens explicitly.

View paper on

Share this with someone who'll enjoy it:

Title:Cost-Sensitive Training for Autoregressive Models

Paper and Code