Better Text Understanding Through Image-To-Text Transfer

Kurach, Karol; Gelly, Sylvain; Jastrzebski, Michal; Haeusser, Philip; Teytaud, Olivier; Vincent, Damien; Bousquet, Olivier

Computer Science > Computation and Language

arXiv:1705.08386 (cs)

[Submitted on 23 May 2017 (v1), last revised 26 May 2017 (this version, v2)]

Title:Better Text Understanding Through Image-To-Text Transfer

Authors:Karol Kurach, Sylvain Gelly, Michal Jastrzebski, Philip Haeusser, Olivier Teytaud, Damien Vincent, Olivier Bousquet

View PDF

Abstract:Generic text embeddings are successfully used in a variety of tasks. However, they are often learnt by capturing the co-occurrence structure from pure text corpora, resulting in limitations of their ability to generalize. In this paper, we explore models that incorporate visual information into the text representation. Based on comprehensive ablation studies, we propose a conceptually simple, yet well performing architecture. It outperforms previous multimodal approaches on a set of well established benchmarks. We also improve the state-of-the-art results for image-related text datasets, using orders of magnitude less data.

Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1705.08386 [cs.CL]
	(or arXiv:1705.08386v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1705.08386

Submission history

From: Karol Kurach [view email]
[v1] Tue, 23 May 2017 16:06:32 UTC (2,837 KB)
[v2] Fri, 26 May 2017 08:08:20 UTC (2,837 KB)

Computer Science > Computation and Language

Title:Better Text Understanding Through Image-To-Text Transfer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Better Text Understanding Through Image-To-Text Transfer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators