Expressive Forecasting of 3D Whole-body Human Motions

Ding, Pengxiang; Cui, Qiongjie; Zhang, Min; Liu, Mengyuan; Wang, Haofan; Wang, Donglin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.11972 (cs)

[Submitted on 19 Dec 2023 (v1), last revised 4 Apr 2024 (this version, v2)]

Title:Expressive Forecasting of 3D Whole-body Human Motions

Authors:Pengxiang Ding, Qiongjie Cui, Min Zhang, Mengyuan Liu, Haofan Wang, Donglin Wang

View PDF HTML (experimental)

Abstract:Human motion forecasting, with the goal of estimating future human behavior over a period of time, is a fundamental task in many real-world applications. However, existing works typically concentrate on predicting the major joints of the human body without considering the delicate movements of the human hands. In practical applications, hand gesture plays an important role in human communication with the real world, and expresses the primary intention of human beings. In this work, we are the first to formulate a whole-body human pose forecasting task, which jointly predicts the future body and hand activities. Correspondingly, we propose a novel Encoding-Alignment-Interaction (EAI) framework that aims to predict both coarse (body joints) and fine-grained (gestures) activities collaboratively, enabling expressive and cross-facilitated forecasting of 3D whole-body human motions. Specifically, our model involves two key constituents: cross-context alignment (XCA) and cross-context interaction (XCI). Considering the heterogeneous information within the whole-body, XCA aims to align the latent features of various human components, while XCI focuses on effectively capturing the context interaction among the human components. We conduct extensive experiments on a newly-introduced large-scale benchmark and achieve state-of-the-art performance. The code is public for research purposes at this https URL.

Comments:	Accepted by AAAI24
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.11972 [cs.CV]
	(or arXiv:2312.11972v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.11972

Submission history

From: Pengxiang Ding [view email]
[v1] Tue, 19 Dec 2023 09:09:46 UTC (5,607 KB)
[v2] Thu, 4 Apr 2024 16:41:22 UTC (5,607 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Expressive Forecasting of 3D Whole-body Human Motions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Expressive Forecasting of 3D Whole-body Human Motions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators