Active Learning for Interactive Neural Machine Translation of Data Streams

Peris, Álvaro; Casacuberta, Francisco

Computer Science > Computation and Language

arXiv:1807.11243 (cs)

[Submitted on 30 Jul 2018 (v1), last revised 25 Oct 2018 (this version, v2)]

Title:Active Learning for Interactive Neural Machine Translation of Data Streams

Authors:Álvaro Peris, Francisco Casacuberta

View PDF

Abstract:We study the application of active learning techniques to the translation of unbounded data streams via interactive neural machine translation. The main idea is to select, from an unbounded stream of source sentences, those worth to be supervised by a human agent. The user will interactively translate those samples. Once validated, these data is useful for adapting the neural machine translation model.
We propose two novel methods for selecting the samples to be validated. We exploit the information from the attention mechanism of a neural machine translation system. Our experiments show that the inclusion of active learning techniques into this pipeline allows to reduce the effort required during the process, while increasing the quality of the translation system. Moreover, it enables to balance the human effort required for achieving a certain translation quality. Moreover, our neural system outperforms classical approaches by a large margin.

Comments:	Accepted at The SIGNLL Conference on Computational Natural Language Learning (CoNLL'18)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1807.11243 [cs.CL]
	(or arXiv:1807.11243v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1807.11243

Submission history

From: Álvaro Peris [view email]
[v1] Mon, 30 Jul 2018 09:11:26 UTC (37 KB)
[v2] Thu, 25 Oct 2018 08:54:52 UTC (37 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2018-07

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Álvaro Peris
Francisco Casacuberta

export BibTeX citation

Computer Science > Computation and Language

Title:Active Learning for Interactive Neural Machine Translation of Data Streams

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Active Learning for Interactive Neural Machine Translation of Data Streams

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators