Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping

Kwon, Hyeongjun; Jang, Jinhyun; Kim, Jin; Kim, Kwonyoung; Sohn, Kwanghoon

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.00974 (cs)

[Submitted on 1 Apr 2024]

Title:Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping

Authors:Hyeongjun Kwon, Jinhyun Jang, Jin Kim, Kwonyoung Kim, Kwanghoon Sohn

View PDF HTML (experimental)

Abstract:Visual scenes are naturally organized in a hierarchy, where a coarse semantic is recursively comprised of several fine details. Exploring such a visual hierarchy is crucial to recognize the complex relations of visual elements, leading to a comprehensive scene understanding. In this paper, we propose a Visual Hierarchy Mapper (Hi-Mapper), a novel approach for enhancing the structured understanding of the pre-trained Deep Neural Networks (DNNs). Hi-Mapper investigates the hierarchical organization of the visual scene by 1) pre-defining a hierarchy tree through the encapsulation of probability densities; and 2) learning the hierarchical relations in hyperbolic space with a novel hierarchical contrastive loss. The pre-defined hierarchy tree recursively interacts with the visual features of the pre-trained DNNs through hierarchy decomposition and encoding procedures, thereby effectively identifying the visual hierarchy and enhancing the recognition of an entire scene. Extensive experiments demonstrate that Hi-Mapper significantly enhances the representation capability of DNNs, leading to an improved performance on various tasks, including image classification and dense prediction tasks.

Comments:	This paper is accepted to CVPR 2024. The supplementary material is included. The code is available at \url{this https URL}
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.00974 [cs.CV]
	(or arXiv:2404.00974v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.00974

Submission history

From: HyeongJun Kwon [view email]
[v1] Mon, 1 Apr 2024 07:45:42 UTC (13,056 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators