Large Margin Neural Language Model

Huang, Jiaji; Li, Yi; Ping, Wei; Huang, Liang

Computer Science > Computation and Language

arXiv:1808.08987 (cs)

[Submitted on 27 Aug 2018]

Title:Large Margin Neural Language Model

Authors:Jiaji Huang, Yi Li, Wei Ping, Liang Huang

View PDF

Abstract:We propose a large margin criterion for training neural language models. Conventionally, neural language models are trained by minimizing perplexity (PPL) on grammatical sentences. However, we demonstrate that PPL may not be the best metric to optimize in some tasks, and further propose a large margin formulation. The proposed method aims to enlarge the margin between the "good" and "bad" sentences in a task-specific sense. It is trained end-to-end and can be widely applied to tasks that involve re-scoring of generated text. Compared with minimum-PPL training, our method gains up to 1.1 WER reduction for speech recognition and 1.0 BLEU increase for machine translation.

Comments:	9 pages. Accepted as a long paper in EMNLP2018
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1808.08987 [cs.CL]
	(or arXiv:1808.08987v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1808.08987

Submission history

From: Jiaji Huang Dr. [view email]
[v1] Mon, 27 Aug 2018 18:31:33 UTC (126 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2018-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jiaji Huang
Yi Li
Wei Ping
Liang Huang

export BibTeX citation

Computer Science > Computation and Language

Title:Large Margin Neural Language Model

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Large Margin Neural Language Model

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators