Cost-Sensitive Training for Autoregressive Models

Saparina, Irina; Osokin, Anton

Computer Science > Machine Learning

arXiv:1912.03771 (cs)

[Submitted on 8 Dec 2019]

Title:Cost-Sensitive Training for Autoregressive Models

Authors:Irina Saparina, Anton Osokin

View PDF

Abstract:Training autoregressive models to better predict under the test metric, instead of maximizing the likelihood, has been reported to be beneficial in several use cases but brings additional complications, which prevent wider adoption. In this paper, we follow the learning-to-search approach (Daumé III et al., 2009; Leblond et al., 2018) and investigate its several components. First, we propose a way to construct a reference policy based on an alignment between the model output and ground truth. Our reference policy is optimal when applied to the Kendall-tau distance between permutations (appear in the task of word ordering) and helps when working with the METEOR score for machine translation. Second, we observe that the learning-to-search approach benefits from choosing the costs related to the test metrics. Finally, we study the effect of different learning objectives and find that the standard KL loss only learns several high-probability tokens and can be replaced with ranking objectives that target these tokens explicitly.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1912.03771 [cs.LG]
	(or arXiv:1912.03771v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1912.03771

Submission history

From: Anton Osokin [view email]
[v1] Sun, 8 Dec 2019 21:57:56 UTC (100 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-12

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Anton Osokin

export BibTeX citation

Computer Science > Machine Learning

Title:Cost-Sensitive Training for Autoregressive Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Cost-Sensitive Training for Autoregressive Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators