Incorporating Test-Time Optimization into Training with Dual Networks for Human Mesh Recovery

Nie, Yongwei; Fan, Mingxian; Long, Chengjiang; Zhang, Qing; Zhu, Jian; Xu, Xuemiao

Computer Science > Computer Vision and Pattern Recognition

arXiv:2401.14121 (cs)

[Submitted on 25 Jan 2024 (v1), last revised 30 Oct 2024 (this version, v2)]

Title:Incorporating Test-Time Optimization into Training with Dual Networks for Human Mesh Recovery

Authors:Yongwei Nie, Mingxian Fan, Chengjiang Long, Qing Zhang, Jian Zhu, Xuemiao Xu

View PDF HTML (experimental)

Abstract:Human Mesh Recovery (HMR) is the task of estimating a parameterized 3D human mesh from an image. There is a kind of methods first training a regression model for this problem, then further optimizing the pretrained regression model for any specific sample individually at test time. However, the pretrained model may not provide an ideal optimization starting point for the test-time optimization. Inspired by meta-learning, we incorporate the test-time optimization into training, performing a step of test-time optimization for each sample in the training batch before really conducting the training optimization over all the training samples. In this way, we obtain a meta-model, the meta-parameter of which is friendly to the test-time optimization. At test time, after several test-time optimization steps starting from the meta-parameter, we obtain much higher HMR accuracy than the test-time optimization starting from the simply pretrained regression model. Furthermore, we find test-time HMR objectives are different from training-time objectives, which reduces the effectiveness of the learning of the meta-model. To solve this problem, we propose a dual-network architecture that unifies the training-time and test-time objectives. Our method, armed with meta-learning and the dual networks, outperforms state-of-the-art regression-based and optimization-based HMR approaches, as validated by the extensive experiments. The codes are available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2401.14121 [cs.CV]
	(or arXiv:2401.14121v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2401.14121

Submission history

From: Mingxian Fan [view email]
[v1] Thu, 25 Jan 2024 12:04:53 UTC (23,577 KB)
[v2] Wed, 30 Oct 2024 07:24:42 UTC (23,833 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Incorporating Test-Time Optimization into Training with Dual Networks for Human Mesh Recovery

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Incorporating Test-Time Optimization into Training with Dual Networks for Human Mesh Recovery

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators