Norm-Based Curriculum Learning for Neural Machine Translation

Liu, Xuebo; Lai, Houtim; Wong, Derek F.; Chao, Lidia S.

Computer Science > Computation and Language

arXiv:2006.02014 (cs)

[Submitted on 3 Jun 2020]

Title:Norm-Based Curriculum Learning for Neural Machine Translation

Authors:Xuebo Liu, Houtim Lai, Derek F. Wong, Lidia S. Chao

View PDF

Abstract:A neural machine translation (NMT) system is expensive to train, especially with high-resource settings. As the NMT architectures become deeper and wider, this issue gets worse and worse. In this paper, we aim to improve the efficiency of training an NMT by introducing a novel norm-based curriculum learning method. We use the norm (aka length or module) of a word embedding as a measure of 1) the difficulty of the sentence, 2) the competence of the model, and 3) the weight of the sentence. The norm-based sentence difficulty takes the advantages of both linguistically motivated and model-based sentence difficulties. It is easy to determine and contains learning-dependent features. The norm-based model competence makes NMT learn the curriculum in a fully automated way, while the norm-based sentence weight further enhances the learning of the vector representation of the NMT. Experimental results for the WMT'14 English-German and WMT'17 Chinese-English translation tasks demonstrate that the proposed method outperforms strong baselines in terms of BLEU score (+1.17/+1.56) and training speedup (2.22x/3.33x).

Comments:	Accepted to ACL 2020
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2006.02014 [cs.CL]
	(or arXiv:2006.02014v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2006.02014

Submission history

From: Xuebo Liu [view email]
[v1] Wed, 3 Jun 2020 02:22:00 UTC (140 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-06

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xuebo Liu
Derek F. Wong
Lidia S. Chao

export BibTeX citation

Computer Science > Computation and Language

Title:Norm-Based Curriculum Learning for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Norm-Based Curriculum Learning for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators