HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting

Lu, Wenquan; Xu, Yufei; Zhang, Jing; Wang, Chaoyue; Tao, Dacheng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.17957 (cs)

[Submitted on 29 Nov 2023 (v1), last revised 16 Aug 2024 (this version, v2)]

Title:HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting

Authors:Wenquan Lu, Yufei Xu, Jing Zhang, Chaoyue Wang, Dacheng Tao

View PDF HTML (experimental)

Abstract:Diffusion models have achieved remarkable success in generating realistic images but suffer from generating accurate human hands, such as incorrect finger counts or irregular shapes. This difficulty arises from the complex task of learning the physical structure and pose of hands from training images, which involves extensive deformations and occlusions. For correct hand generation, our paper introduces a lightweight post-processing solution called $\textbf{HandRefiner}$. HandRefiner employs a conditional inpainting approach to rectify malformed hands while leaving other parts of the image untouched. We leverage the hand mesh reconstruction model that consistently adheres to the correct number of fingers and hand shape, while also being capable of fitting the desired hand pose in the generated image. Given a generated failed image due to malformed hands, we utilize ControlNet modules to re-inject such correct hand information. Additionally, we uncover a phase transition phenomenon within ControlNet as we vary the control strength. It enables us to take advantage of more readily available synthetic data without suffering from the domain gap between realistic and synthetic hands. Experiments demonstrate that HandRefiner can significantly improve the generation quality quantitatively and qualitatively. The code is available at this https URL .

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.17957 [cs.CV]
	(or arXiv:2311.17957v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.17957

Submission history

From: Wenquan Lu [view email]
[v1] Wed, 29 Nov 2023 08:52:08 UTC (46,268 KB)
[v2] Fri, 16 Aug 2024 05:35:21 UTC (31,636 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators