DepGAN: Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition

Ghoneim, Amr; Poovvancheri, Jiju; Akiyama, Yasushi; Chen, Dong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.11890 (cs)

[Submitted on 16 Jul 2024]

Title:DepGAN: Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition

Authors:Amr Ghoneim, Jiju Poovvancheri, Yasushi Akiyama, Dong Chen

View PDF HTML (experimental)

Abstract:Image composition is a complex task which requires a lot of information about the scene for an accurate and realistic composition, such as perspective, lighting, shadows, occlusions, and object interactions. Previous methods have predominantly used 2D information for image composition, neglecting the potentials of 3D spatial information. In this work, we propose DepGAN, a Generative Adversarial Network that utilizes depth maps and alpha channels to rectify inaccurate occlusions and enhance transparency effects in image composition. Central to our network is a novel loss function called Depth Aware Loss which quantifies the pixel wise depth difference to accurately delineate occlusion boundaries while compositing objects at different depth levels. Furthermore, we enhance our network's learning process by utilizing opacity data, enabling it to effectively manage compositions involving transparent and semi-transparent objects. We tested our model against state-of-the-art image composition GANs on benchmark (both real and synthetic) datasets. The results reveal that DepGAN significantly outperforms existing methods in terms of accuracy of object placement semantics, transparency and occlusion handling, both visually and quantitatively. Our code is available at this https URL.

Comments:	10 pages, 13 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.11890 [cs.CV]
	(or arXiv:2407.11890v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.11890

Submission history

From: Jiju Peethambaran Poovvancheri [view email]
[v1] Tue, 16 Jul 2024 16:18:40 UTC (2,873 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DepGAN: Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DepGAN: Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators