Learning Canonical Representations for Scene Graph to Image Generation

Herzig, Roei; Bar, Amir; Xu, Huijuan; Chechik, Gal; Darrell, Trevor; Globerson, Amir

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.07414 (cs)

[Submitted on 16 Dec 2019 (v1), last revised 24 Aug 2020 (this version, v5)]

Title:Learning Canonical Representations for Scene Graph to Image Generation

Authors:Roei Herzig, Amir Bar, Huijuan Xu, Gal Chechik, Trevor Darrell, Amir Globerson

View PDF

Abstract:Generating realistic images of complex visual scenes becomes challenging when one wishes to control the structure of the generated images. Previous approaches showed that scenes with few entities can be controlled using scene graphs, but this approach struggles as the complexity of the graph (the number of objects and edges) increases. In this work, we show that one limitation of current methods is their inability to capture semantic equivalence in graphs. We present a novel model that addresses these issues by learning canonical graph representations from the data, resulting in improved image generation for complex visual scenes. Our model demonstrates improved empirical performance on large scene graphs, robustness to noise in the input scene graph, and generalization on semantically equivalent graphs. Finally, we show improved performance of the model on three different benchmarks: Visual Genome, COCO, and CLEVR.

Comments:	ECCV 2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1912.07414 [cs.CV]
	(or arXiv:1912.07414v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.07414

Submission history

From: Roei Herzig [view email]
[v1] Mon, 16 Dec 2019 14:39:45 UTC (4,981 KB)
[v2] Wed, 18 Mar 2020 10:46:32 UTC (7,205 KB)
[v3] Sat, 18 Jul 2020 10:54:01 UTC (9,331 KB)
[v4] Wed, 5 Aug 2020 11:36:06 UTC (9,331 KB)
[v5] Mon, 24 Aug 2020 12:29:45 UTC (9,331 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Roei Herzig
Amir Bar
Huijuan Xu
Gal Chechik
Trevor Darrell

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Canonical Representations for Scene Graph to Image Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Canonical Representations for Scene Graph to Image Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators