Adversarial reconstruction for Multi-modal Machine Translation

Delbrouck, Jean-Benoit; Dupont, Stéphane

Computer Science > Computation and Language

arXiv:1910.02766 (cs)

[Submitted on 7 Oct 2019]

Title:Adversarial reconstruction for Multi-modal Machine Translation

Authors:Jean-Benoit Delbrouck, Stéphane Dupont

View PDF

Abstract:Even with the growing interest in problems at the intersection of Computer Vision and Natural Language, grounding (i.e. identifying) the components of a structured description in an image still remains a challenging task. This contribution aims to propose a model which learns grounding by reconstructing the visual features for the Multi-modal translation task. Previous works have partially investigated standard approaches such as regression methods to approximate the reconstruction of a visual input. In this paper, we propose a different and novel approach which learns grounding by adversarial feedback. To do so, we modulate our network following the recent promising adversarial architectures and evaluate how the adversarial response from a visual reconstruction as an auxiliary task helps the model in its learning. We report the highest scores in term of BLEU and METEOR metrics on the different datasets.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1910.02766 [cs.CL]
	(or arXiv:1910.02766v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1910.02766

Submission history

From: Jean-Benoit Delbrouck [view email]
[v1] Mon, 7 Oct 2019 13:08:07 UTC (976 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jean-Benoit Delbrouck
Stéphane Dupont

export BibTeX citation

Computer Science > Computation and Language

Title:Adversarial reconstruction for Multi-modal Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Adversarial reconstruction for Multi-modal Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators