ID-Unet: Iterative Soft and Hard Deformation for View Synthesis

Yin, Mingyu; Sun, Li; Li, Qingli

Computer Science > Computer Vision and Pattern Recognition

arXiv:2103.02264 (cs)

[Submitted on 3 Mar 2021 (v1), last revised 18 Mar 2021 (this version, v5)]

Title:ID-Unet: Iterative Soft and Hard Deformation for View Synthesis

Authors:Mingyu Yin, Li Sun, Qingli Li

View PDF

Abstract:View synthesis is usually done by an autoencoder, in which the encoder maps a source view image into a latent content code, and the decoder transforms it into a target view image according to the condition. However, the source contents are often not well kept in this setting, which leads to unnecessary changes during the view translation. Although adding skipped connections, like Unet, alleviates the problem, but it often causes the failure on the view conformity. This paper proposes a new architecture by performing the source-to-target deformation in an iterative way. Instead of simply incorporating the features from multiple layers of the encoder, we design soft and hard deformation modules, which warp the encoder features to the target view at different resolutions, and give results to the decoder to complement the details. Particularly, the current warping flow is not only used to align the feature of the same resolution, but also as an approximation to coarsely deform the high resolution feature. Then the residual flow is estimated and applied in the high resolution, so that the deformation is built up in the coarse-to-fine fashion. To better constrain the model, we synthesize a rough target view image based on the intermediate flows and their warped features. The extensive ablation studies and the final results on two different data sets show the effectiveness of the proposed model.

Comments:	CVPR2021(Oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2103.02264 [cs.CV]
	(or arXiv:2103.02264v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2103.02264

Submission history

From: Mingyu Yin [view email]
[v1] Wed, 3 Mar 2021 09:02:00 UTC (4,312 KB)
[v2] Thu, 4 Mar 2021 06:14:32 UTC (4,314 KB)
[v3] Mon, 8 Mar 2021 06:43:16 UTC (4,431 KB)
[v4] Sun, 14 Mar 2021 03:03:57 UTC (4,444 KB)
[v5] Thu, 18 Mar 2021 06:13:30 UTC (41,126 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ID-Unet: Iterative Soft and Hard Deformation for View Synthesis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ID-Unet: Iterative Soft and Hard Deformation for View Synthesis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators