DiMeR: Disentangled Mesh Reconstruction Model

Jiang, Lutao; Lin, Jiantao; Chen, Kanghao; Ge, Wenhang; Yang, Xin; Jiang, Yifan; Lyu, Yuanhuiyi; Zheng, Xu; Chen, Yingcong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2504.17670 (cs)

[Submitted on 24 Apr 2025]

Title:DiMeR: Disentangled Mesh Reconstruction Model

Authors:Lutao Jiang, Jiantao Lin, Kanghao Chen, Wenhang Ge, Xin Yang, Yifan Jiang, Yuanhuiyi Lyu, Xu Zheng, Yingcong Chen

View PDF HTML (experimental)

Abstract:With the advent of large-scale 3D datasets, feed-forward 3D generative models, such as the Large Reconstruction Model (LRM), have gained significant attention and achieved remarkable success. However, we observe that RGB images often lead to conflicting training objectives and lack the necessary clarity for geometry reconstruction. In this paper, we revisit the inductive biases associated with mesh reconstruction and introduce DiMeR, a novel disentangled dual-stream feed-forward model for sparse-view mesh reconstruction. The key idea is to disentangle both the input and framework into geometry and texture parts, thereby reducing the training difficulty for each part according to the Principle of Occam's Razor. Given that normal maps are strictly consistent with geometry and accurately capture surface variations, we utilize normal maps as exclusive input for the geometry branch to reduce the complexity between the network's input and output. Moreover, we improve the mesh extraction algorithm to introduce 3D ground truth supervision. As for texture branch, we use RGB images as input to obtain the textured mesh. Overall, DiMeR demonstrates robust capabilities across various tasks, including sparse-view reconstruction, single-image-to-3D, and text-to-3D. Numerous experiments show that DiMeR significantly outperforms previous methods, achieving over 30% improvement in Chamfer Distance on the GSO and OmniObject3D dataset.

Comments:	Project Page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.17670 [cs.CV]
	(or arXiv:2504.17670v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2504.17670

Submission history

From: Lutao Jiang [view email]
[v1] Thu, 24 Apr 2025 15:39:20 UTC (15,445 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DiMeR: Disentangled Mesh Reconstruction Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DiMeR: Disentangled Mesh Reconstruction Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators