Data-Driven Adaptive Simultaneous Machine Translation

Xun, Guangxu; Ma, Mingbo; Bian, Yuchen; Cai, Xingyu; Huang, Jiaji; Zheng, Renjie; Chen, Junkun; Yuan, Jiahong; Church, Kenneth; Huang, Liang

Computer Science > Computation and Language

arXiv:2204.12672 (cs)

[Submitted on 27 Apr 2022]

Title:Data-Driven Adaptive Simultaneous Machine Translation

Authors:Guangxu Xun, Mingbo Ma, Yuchen Bian, Xingyu Cai, Jiaji Huang, Renjie Zheng, Junkun Chen, Jiahong Yuan, Kenneth Church, Liang Huang

View PDF

Abstract:In simultaneous translation (SimulMT), the most widely used strategy is the wait-k policy thanks to its simplicity and effectiveness in balancing translation quality and latency. However, wait-k suffers from two major limitations: (a) it is a fixed policy that can not adaptively adjust latency given context, and (b) its training is much slower than full-sentence translation. To alleviate these issues, we propose a novel and efficient training scheme for adaptive SimulMT by augmenting the training corpus with adaptive prefix-to-prefix pairs, while the training complexity remains the same as that of training full-sentence translation models. Experiments on two language pairs show that our method outperforms all strong baselines in terms of translation quality and latency.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2204.12672 [cs.CL]
	(or arXiv:2204.12672v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2204.12672

Submission history

From: Guangxu Xun [view email]
[v1] Wed, 27 Apr 2022 02:40:21 UTC (1,926 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2022-04

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Data-Driven Adaptive Simultaneous Machine Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Data-Driven Adaptive Simultaneous Machine Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators