Modeling Source Syntax for Neural Machine Translation

Li, Junhui; Xiong, Deyi; Tu, Zhaopeng; Zhu, Muhua; Zhang, Min; Zhou, Guodong

Computer Science > Computation and Language

arXiv:1705.01020 (cs)

[Submitted on 2 May 2017]

Title:Modeling Source Syntax for Neural Machine Translation

Authors:Junhui Li, Deyi Xiong, Zhaopeng Tu, Muhua Zhu, Min Zhang, Guodong Zhou

View PDF

Abstract:Even though a linguistics-free sequence to sequence model in neural machine translation (NMT) has certain capability of implicitly learning syntactic information of source sentences, this paper shows that source syntax can be explicitly incorporated into NMT effectively to provide further improvements. Specifically, we linearize parse trees of source sentences to obtain structural label sequences. On the basis, we propose three different sorts of encoders to incorporate source syntax into NMT: 1) Parallel RNN encoder that learns word and label annotation vectors parallelly; 2) Hierarchical RNN encoder that learns word and label annotation vectors in a two-level hierarchy; and 3) Mixed RNN encoder that stitchingly learns word and label annotation vectors over sequences where words and labels are mixed. Experimentation on Chinese-to-English translation demonstrates that all the three proposed syntactic encoders are able to improve translation accuracy. It is interesting to note that the simplest RNN encoder, i.e., Mixed RNN encoder yields the best performance with an significant improvement of 1.4 BLEU points. Moreover, an in-depth analysis from several perspectives is provided to reveal how source syntax benefits NMT.

Comments:	Accepted by ACL 2017
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1705.01020 [cs.CL]
	(or arXiv:1705.01020v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1705.01020

Submission history

From: Junhui Li [view email]
[v1] Tue, 2 May 2017 15:21:46 UTC (785 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2017-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Junhui Li
Deyi Xiong
Zhaopeng Tu
Muhua Zhu
Min Zhang

…

export BibTeX citation

Computer Science > Computation and Language

Title:Modeling Source Syntax for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Modeling Source Syntax for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators