Graph-Boosted Attentive Network for Semantic Body Parsing

Wang, Tinghuai; Wang, Huiling

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.05924 (cs)

[Submitted on 8 Jul 2024]

Title:Graph-Boosted Attentive Network for Semantic Body Parsing

Authors:Tinghuai Wang, Huiling Wang

View PDF HTML (experimental)

Abstract:Human body parsing remains a challenging problem in natural scenes due to multi-instance and inter-part semantic confusions as well as occlusions. This paper proposes a novel approach to decomposing multiple human bodies into semantic part regions in unconstrained environments. Specifically we propose a convolutional neural network (CNN) architecture which comprises of novel semantic and contour attention mechanisms across feature hierarchy to resolve the semantic ambiguities and boundary localization issues related to semantic body parsing. We further propose to encode estimated pose as higher-level contextual information which is combined with local semantic cues in a novel graphical model in a principled manner. In this proposed model, the lower-level semantic cues can be recursively updated by propagating higher-level contextual information from estimated pose and vice versa across the graph, so as to alleviate erroneous pose information and pixel level predictions. We further propose an optimization technique to efficiently derive the solutions. Our proposed method achieves the state-of-art results on the challenging Pascal Person-Part dataset.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.05924 [cs.CV]
	(or arXiv:2407.05924v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.05924

Submission history

From: Tinghuai Wang [view email]
[v1] Mon, 8 Jul 2024 13:32:01 UTC (6,440 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Graph-Boosted Attentive Network for Semantic Body Parsing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Graph-Boosted Attentive Network for Semantic Body Parsing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators