Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

Chen, Rui; Zhang, Jianfeng; Liang, Yixun; Luo, Guan; Li, Weiyu; Liu, Jiarui; Li, Xiu; Long, Xiaoxiao; Feng, Jiashi; Tan, Ping

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.17808 (cs)

[Submitted on 23 Dec 2024 (v1), last revised 24 Dec 2024 (this version, v2)]

Title:Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

Authors:Rui Chen, Jianfeng Zhang, Yixun Liang, Guan Luo, Weiyu Li, Jiarui Liu, Xiu Li, Xiaoxiao Long, Jiashi Feng, Ping Tan

View PDF HTML (experimental)

Abstract:Recent 3D content generation pipelines commonly employ Variational Autoencoders (VAEs) to encode shapes into compact latent representations for diffusion-based generation. However, the widely adopted uniform point sampling strategy in Shape VAE training often leads to a significant loss of geometric details, limiting the quality of shape reconstruction and downstream generation tasks. We present Dora-VAE, a novel approach that enhances VAE reconstruction through our proposed sharp edge sampling strategy and a dual cross-attention mechanism. By identifying and prioritizing regions with high geometric complexity during training, our method significantly improves the preservation of fine-grained shape features. Such sampling strategy and the dual attention mechanism enable the VAE to focus on crucial geometric details that are typically missed by uniform sampling approaches. To systematically evaluate VAE reconstruction quality, we additionally propose Dora-bench, a benchmark that quantifies shape complexity through the density of sharp edges, introducing a new metric focused on reconstruction accuracy at these salient geometric features. Extensive experiments on the Dora-bench demonstrate that Dora-VAE achieves comparable reconstruction quality to the state-of-the-art dense XCube-VAE while requiring a latent space at least 8$\times$ smaller (1,280 vs. > 10,000 codes). We will release our code and benchmark dataset to facilitate future research in 3D shape modeling.

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.17808 [cs.CV]
	(or arXiv:2412.17808v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.17808

Submission history

From: Rui Chen [view email]
[v1] Mon, 23 Dec 2024 18:59:06 UTC (25,026 KB)
[v2] Tue, 24 Dec 2024 11:02:29 UTC (25,026 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators