SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations

Zhao, Shuting; Bai, Linxin; Shao, Liangjing; Zhang, Ye; Chen, Xinrong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2504.18332 (cs)

[Submitted on 25 Apr 2025]

Title:SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations

Authors:Shuting Zhao, Linxin Bai, Liangjing Shao, Ye Zhang, Xinrong Chen

View PDF HTML (experimental)

Abstract:The growing applications of AR/VR increase the demand for real-time full-body pose estimation from Head-Mounted Displays (HMDs). Although HMDs provide joint signals from the head and hands, reconstructing a full-body pose remains challenging due to the unconstrained lower body. Recent advancements often rely on conventional neural networks and generative models to improve performance in this task, such as Transformers and diffusion models. However, these approaches struggle to strike a balance between achieving precise pose reconstruction and maintaining fast inference speed. To overcome these challenges, a lightweight and efficient model, SSD-Poser, is designed for robust full-body motion estimation from sparse observations. SSD-Poser incorporates a well-designed hybrid encoder, State Space Attention Encoders, to adapt the state space duality to complex motion poses and enable real-time realistic pose reconstruction. Moreover, a Frequency-Aware Decoder is introduced to mitigate jitter caused by variable-frequency motion signals, remarkably enhancing the motion smoothness. Comprehensive experiments on the AMASS dataset demonstrate that SSD-Poser achieves exceptional accuracy and computational efficiency, showing outstanding inference efficiency compared to state-of-the-art methods.

Comments:	9 pages, 6 figures, conference ICMR 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
MSC classes:	68U05
Cite as:	arXiv:2504.18332 [cs.CV]
	(or arXiv:2504.18332v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2504.18332

Submission history

From: Shuting Zhao [view email]
[v1] Fri, 25 Apr 2025 13:18:06 UTC (1,904 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators