Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation

Yu, Qiao; Li, Xianzhi; Tang, Yuan; Han, Xu; Hu, Long; Hao, Yixue; Chen, Min

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.16185 (cs)

[Submitted on 25 Nov 2024]

Title:Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation

Authors:Qiao Yu, Xianzhi Li, Yuan Tang, Xu Han, Long Hu, Yixue Hao, Min Chen

View PDF HTML (experimental)

Abstract:Generating 3D meshes from a single image is an important but ill-posed task. Existing methods mainly adopt 2D multiview diffusion models to generate intermediate multiview images, and use the Large Reconstruction Model (LRM) to create the final meshes. However, the multiview images exhibit local inconsistencies, and the meshes often lack fidelity to the input image or look blurry. We propose Fancy123, featuring two enhancement modules and an unprojection operation to address the above three issues, respectively. The appearance enhancement module deforms the 2D multiview images to realign misaligned pixels for better multiview consistency. The fidelity enhancement module deforms the 3D mesh to match the input image. The unprojection of the input image and deformed multiview images onto LRM's generated mesh ensures high clarity, discarding LRM's predicted blurry-looking mesh colors. Extensive qualitative and quantitative experiments verify Fancy123's SoTA performance with significant improvement. Also, the two enhancement modules are plug-and-play and work at inference time, allowing seamless integration into various existing single-image-to-3D methods.

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2411.16185 [cs.CV]
	(or arXiv:2411.16185v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.16185

Submission history

From: Qiao Yu [view email]
[v1] Mon, 25 Nov 2024 08:31:55 UTC (8,920 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators