ModSkill: Physical Character Skill Modularization

Huang, Yiming; Dou, Zhiyang; Liu, Lingjie

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.14140 (cs)

[Submitted on 19 Feb 2025]

Title:ModSkill: Physical Character Skill Modularization

Authors:Yiming Huang, Zhiyang Dou, Lingjie Liu

View PDF HTML (experimental)

Abstract:Human motion is highly diverse and dynamic, posing challenges for imitation learning algorithms that aim to generalize motor skills for controlling simulated characters. Previous methods typically rely on a universal full-body controller for tracking reference motion (tracking-based model) or a unified full-body skill embedding space (skill embedding). However, these approaches often struggle to generalize and scale to larger motion datasets. In this work, we introduce a novel skill learning framework, ModSkill, that decouples complex full-body skills into compositional, modular skills for independent body parts. Our framework features a skill modularization attention layer that processes policy observations into modular skill embeddings that guide low-level controllers for each body part. We also propose an Active Skill Learning approach with Generative Adaptive Sampling, using large motion generation models to adaptively enhance policy learning in challenging tracking scenarios. Our results show that this modularized skill learning framework, enhanced by generative sampling, outperforms existing methods in precise full-body motion tracking and enables reusable skill embeddings for diverse goal-driven tasks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Robotics (cs.RO)
Cite as:	arXiv:2502.14140 [cs.CV]
	(or arXiv:2502.14140v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.14140

Submission history

From: Yiming Huang [view email]
[v1] Wed, 19 Feb 2025 22:55:49 UTC (5,015 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ModSkill: Physical Character Skill Modularization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ModSkill: Physical Character Skill Modularization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators