Image Captioning with Unseen Objects

Demirel, Berkan; Cinbis, Ramazan Gokberk; Ikizler-Cinbis, Nazli

Computer Science > Computer Vision and Pattern Recognition

arXiv:1908.00047 (cs)

[Submitted on 31 Jul 2019]

Title:Image Captioning with Unseen Objects

Authors:Berkan Demirel, Ramazan Gokberk Cinbis, Nazli Ikizler-Cinbis

View PDF

Abstract:Image caption generation is a long standing and challenging problem at the intersection of computer vision and natural language processing. A number of recently proposed approaches utilize a fully supervised object recognition model within the captioning approach. Such models, however, tend to generate sentences which only consist of objects predicted by the recognition models, excluding instances of the classes without labelled training examples. In this paper, we propose a new challenging scenario that targets the image captioning problem in a fully zero-shot learning setting, where the goal is to be able to generate captions of test images containing objects that are not seen during training. The proposed approach jointly uses a novel zero-shot object detection model and a template-based sentence generator. Our experiments show promising results on the COCO dataset.

Comments:	To appear in British Machine Vision Conference (BMVC) 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1908.00047 [cs.CV]
	(or arXiv:1908.00047v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1908.00047

Submission history

From: Berkan Demirel [view email]
[v1] Wed, 31 Jul 2019 18:48:52 UTC (5,304 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Berkan Demirel
Ramazan Gokberk Cinbis
Nazli Ikizler-Cinbis

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Image Captioning with Unseen Objects

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Image Captioning with Unseen Objects

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators