Distilling Translations with Visual Awareness

Ive, Julia; Madhyastha, Pranava; Specia, Lucia

Computer Science > Computation and Language

arXiv:1906.07701 (cs)

[Submitted on 18 Jun 2019]

Title:Distilling Translations with Visual Awareness

Authors:Julia Ive, Pranava Madhyastha, Lucia Specia

View PDF

Abstract:Previous work on multimodal machine translation has shown that visual information is only needed in very specific cases, for example in the presence of ambiguous words where the textual context is not sufficient. As a consequence, models tend to learn to ignore this information. We propose a translate-and-refine approach to this problem where images are only used by a second stage decoder. This approach is trained jointly to generate a good first draft translation and to improve over this draft by (i) making better use of the target language textual context (both left and right-side contexts) and (ii) making use of visual context. This approach leads to the state of the art results. Additionally, we show that it has the ability to recover from erroneous or missing words in the source language.

Comments:	accepted to ACL 2019
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1906.07701 [cs.CL]
	(or arXiv:1906.07701v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1906.07701

Submission history

From: Julia Ive [view email]
[v1] Tue, 18 Jun 2019 17:30:30 UTC (2,526 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Julia Ive
Pranava Madhyastha
Lucia Specia

export BibTeX citation

Computer Science > Computation and Language

Title:Distilling Translations with Visual Awareness

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Distilling Translations with Visual Awareness

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators