GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text

Shim, Gyumin; Lee, Sangmin; Choo, Jaegul

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.11642 (cs)

[Submitted on 17 Feb 2025]

Title:GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text

Authors:Gyumin Shim, Sangmin Lee, Jaegul Choo

View PDF HTML (experimental)

Abstract:In this paper, we introduce GaussianMotion, a novel human rendering model that generates fully animatable scenes aligned with textual descriptions using Gaussian Splatting. Although existing methods achieve reasonable text-to-3D generation of human bodies using various 3D representations, they often face limitations in fidelity and efficiency, or primarily focus on static models with limited pose control. In contrast, our method generates fully animatable 3D avatars by combining deformable 3D Gaussian Splatting with text-to-3D score distillation, achieving high fidelity and efficient rendering for arbitrary poses. By densely generating diverse random poses during optimization, our deformable 3D human model learns to capture a wide range of natural motions distilled from a pose-conditioned diffusion model in an end-to-end manner. Furthermore, we propose Adaptive Score Distillation that effectively balances realistic detail and smoothness to achieve optimal 3D results. Experimental results demonstrate that our approach outperforms existing baselines by producing high-quality textures in both static and animated results, and by generating diverse 3D human models from various textual inputs.

Comments:	8 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2502.11642 [cs.CV]
	(or arXiv:2502.11642v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.11642

Submission history

From: GyuMin Shim [view email]
[v1] Mon, 17 Feb 2025 10:36:36 UTC (26,460 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators