DiffusionPhase: Motion Diffusion in Frequency Domain

Wan, Weilin; Huang, Yiming; Wu, Shutong; Komura, Taku; Wang, Wenping; Jayaraman, Dinesh; Liu, Lingjie

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.04036 (cs)

[Submitted on 7 Dec 2023]

Title:DiffusionPhase: Motion Diffusion in Frequency Domain

Authors:Weilin Wan, Yiming Huang, Shutong Wu, Taku Komura, Wenping Wang, Dinesh Jayaraman, Lingjie Liu

View PDF

Abstract:In this study, we introduce a learning-based method for generating high-quality human motion sequences from text descriptions (e.g., ``A person walks forward"). Existing techniques struggle with motion diversity and smooth transitions in generating arbitrary-length motion sequences, due to limited text-to-motion datasets and the pose representations used that often lack expressiveness or compactness. To address these issues, we propose the first method for text-conditioned human motion generation in the frequency domain of motions. We develop a network encoder that converts the motion space into a compact yet expressive parameterized phase space with high-frequency details encoded, capturing the local periodicity of motions in time and space with high accuracy. We also introduce a conditional diffusion model for predicting periodic motion parameters based on text descriptions and a start pose, efficiently achieving smooth transitions between motion sequences associated with different text descriptions. Experiments demonstrate that our approach outperforms current methods in generating a broader variety of high-quality motions, and synthesizing long sequences with natural transitions.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2312.04036 [cs.CV]
	(or arXiv:2312.04036v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.04036

Submission history

From: Weilin Wan [view email]
[v1] Thu, 7 Dec 2023 04:39:22 UTC (19,221 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DiffusionPhase: Motion Diffusion in Frequency Domain

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DiffusionPhase: Motion Diffusion in Frequency Domain

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators