Image Re-Identification: Where Self-supervision Meets Vision-Language Learning

Wang, Bin; Liang, Yuying; Cai, Lei; Huang, Huakun; Zeng, Huanqiang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.20647 (cs)

[Submitted on 30 Jul 2024]

Title:Image Re-Identification: Where Self-supervision Meets Vision-Language Learning

Authors:Bin Wang, Yuying Liang, Lei Cai, Huakun Huang, Huanqiang Zeng

View PDF HTML (experimental)

Abstract:Recently, large-scale vision-language pre-trained models like CLIP have shown impressive performance in image re-identification (ReID). In this work, we explore whether self-supervision can aid in the use of CLIP for image ReID tasks. Specifically, we propose SVLL-ReID, the first attempt to integrate self-supervision and pre-trained CLIP via two training stages to facilitate the image ReID. We observe that: 1) incorporating language self-supervision in the first training stage can make the learnable text prompts more distinguishable, and 2) incorporating vision self-supervision in the second training stage can make the image features learned by the image encoder more discriminative. These observations imply that: 1) the text prompt learning in the first stage can benefit from the language self-supervision, and 2) the image feature learning in the second stage can benefit from the vision self-supervision. These benefits jointly facilitate the performance gain of the proposed SVLL-ReID. By conducting experiments on six image ReID benchmark datasets without any concrete text labels, we find that the proposed SVLL-ReID achieves the overall best performances compared with state-of-the-arts. Codes will be publicly available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.20647 [cs.CV]
	(or arXiv:2407.20647v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.20647

Submission history

From: Bin Wang [view email]
[v1] Tue, 30 Jul 2024 08:43:53 UTC (749 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Image Re-Identification: Where Self-supervision Meets Vision-Language Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Image Re-Identification: Where Self-supervision Meets Vision-Language Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators