Multilingual Multi-modal Embeddings for Natural Language Processing

Calixto, Iacer; Liu, Qun; Campbell, Nick

Computer Science > Computation and Language

arXiv:1702.01101 (cs)

[Submitted on 3 Feb 2017]

Title:Multilingual Multi-modal Embeddings for Natural Language Processing

Authors:Iacer Calixto, Qun Liu, Nick Campbell

View PDF

Abstract:We propose a novel discriminative model that learns embeddings from multilingual and multi-modal data, meaning that our model can take advantage of images and descriptions in multiple languages to improve embedding quality. To that end, we introduce a modification of a pairwise contrastive estimation optimisation function as our training objective. We evaluate our embeddings on an image-sentence ranking (ISR), a semantic textual similarity (STS), and a neural machine translation (NMT) task. We find that the additional multilingual signals lead to improvements on both the ISR and STS tasks, and the discriminative cost can also be used in re-ranking $n$-best lists produced by NMT models, yielding strong improvements.

Comments:	4 pages (5 including references), no figures
Subjects:	Computation and Language (cs.CL)
ACM classes:	I.2.7
Cite as:	arXiv:1702.01101 [cs.CL]
	(or arXiv:1702.01101v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1702.01101

Submission history

From: Iacer Calixto [view email]
[v1] Fri, 3 Feb 2017 18:19:47 UTC (25 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2017-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Iacer Calixto
Qun Liu
Nick Campbell

export BibTeX citation

Computer Science > Computation and Language

Title:Multilingual Multi-modal Embeddings for Natural Language Processing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multilingual Multi-modal Embeddings for Natural Language Processing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators