Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval

Ma, Zhe; Dong, Jianfeng; Ji, Shouling; Liu, Zhenguang; Zhang, Xuhong; Wang, Zonghui; He, Sifeng; Qian, Feng; Zhang, Xiaobo; Yang, Lei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.09716 (cs)

[Submitted on 15 Dec 2023]

Title:Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval

Authors:Zhe Ma, Jianfeng Dong, Shouling Ji, Zhenguang Liu, Xuhong Zhang, Zonghui Wang, Sifeng He, Feng Qian, Xiaobo Zhang, Lei Yang

View PDF HTML (experimental)

Abstract:Visual retrieval aims to search for the most relevant visual items, e.g., images and videos, from a candidate gallery with a given query item. Accuracy and efficiency are two competing objectives in retrieval tasks. Instead of crafting a new method pursuing further improvement on accuracy, in this paper we propose a multi-teacher distillation framework Whiten-MTD, which is able to transfer knowledge from off-the-shelf pre-trained retrieval models to a lightweight student model for efficient visual retrieval. Furthermore, we discover that the similarities obtained by different retrieval models are diversified and incommensurable, which makes it challenging to jointly distill knowledge from multiple models. Therefore, we propose to whiten the output of teacher models before fusion, which enables effective multi-teacher distillation for retrieval models. Whiten-MTD is conceptually simple and practically effective. Extensive experiments on two landmark image retrieval datasets and one video retrieval dataset demonstrate the effectiveness of our proposed method, and its good balance of retrieval performance and efficiency. Our source code is released at this https URL.

Comments:	Accepted by AAAI 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.09716 [cs.CV]
	(or arXiv:2312.09716v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.09716

Submission history

From: Zhe Ma [view email]
[v1] Fri, 15 Dec 2023 11:43:56 UTC (504 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators