HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining

Tang, Shixiang; Chen, Cheng; Xie, Qingsong; Chen, Meilin; Wang, Yizhou; Ci, Yuanzheng; Bai, Lei; Zhu, Feng; Yang, Haiyang; Yi, Li; Zhao, Rui; Ouyang, Wanli

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.05675 (cs)

[Submitted on 10 Mar 2023]

Title:HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining

Authors:Shixiang Tang, Cheng Chen, Qingsong Xie, Meilin Chen, Yizhou Wang, Yuanzheng Ci, Lei Bai, Feng Zhu, Haiyang Yang, Li Yi, Rui Zhao, Wanli Ouyang

View PDF

Abstract:Human-centric perceptions include a variety of vision tasks, which have widespread industrial applications, including surveillance, autonomous driving, and the metaverse. It is desirable to have a general pretrain model for versatile human-centric downstream tasks. This paper forges ahead along this path from the aspects of both benchmark and pretraining methods. Specifically, we propose a \textbf{HumanBench} based on existing datasets to comprehensively evaluate on the common ground the generalization abilities of different pretraining methods on 19 datasets from 6 diverse downstream tasks, including person ReID, pose estimation, human parsing, pedestrian attribute recognition, pedestrian detection, and crowd counting. To learn both coarse-grained and fine-grained knowledge in human bodies, we further propose a \textbf{P}rojector \textbf{A}ssis\textbf{T}ed \textbf{H}ierarchical pretraining method (\textbf{PATH}) to learn diverse knowledge at different granularity levels. Comprehensive evaluations on HumanBench show that our PATH achieves new state-of-the-art results on 17 downstream datasets and on-par results on the other 2 datasets. The code will be publicly at \href{this https URL}{this https URL}.

Comments:	Accepted to CVPR2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.05675 [cs.CV]
	(or arXiv:2303.05675v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.05675

Submission history

From: Shixiang Tang [view email]
[v1] Fri, 10 Mar 2023 02:57:07 UTC (2,297 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators