Weakly-Supervised Conditional Embedding for Referred Visual Search

Lepage, Simon; Mary, Jérémie; Picard, David

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.02928v1 (cs)

[Submitted on 5 Jun 2023 (this version), latest version 15 May 2024 (v3)]

Title:Weakly-Supervised Conditional Embedding for Referred Visual Search

Authors:Simon Lepage, Jérémie Mary, David Picard

View PDF

Abstract:This paper presents a new approach to image similarity search in the context of fashion, a domain with inherent ambiguity due to the multiple ways in which images can be considered similar. We introduce the concept of Referred Visual Search (RVS), where users provide additional information to define the desired similarity. We present a new dataset, LAION-RVS-Fashion, consisting of 272K fashion products with 842K images extracted from LAION, designed explicitly for this task. We then propose an innovative method for learning conditional embeddings using weakly-supervised training, achieving a 6% increase in Recall at one (R@1) against a gallery with 2M distractors, compared to classical approaches based on explicit attention and filtering. The proposed method demonstrates robustness, maintaining similar R@1 when dealing with 2.5 times as many distractors as the baseline methods. We believe this is a step forward in the emerging field of Referred Visual Search both in terms of accessible data and approach. Code, data and models are available at this https URL .

Comments:	20 pages, 13 figures, 4 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
MSC classes:	68T07 (Primary) 68T45 (Secondary)
ACM classes:	I.2.10
Cite as:	arXiv:2306.02928 [cs.CV]
	(or arXiv:2306.02928v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.02928

Submission history

From: Simon Lepage [view email]
[v1] Mon, 5 Jun 2023 14:45:38 UTC (7,116 KB)
[v2] Wed, 27 Mar 2024 08:21:17 UTC (7,402 KB)
[v3] Wed, 15 May 2024 12:17:48 UTC (15,353 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Weakly-Supervised Conditional Embedding for Referred Visual Search

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Weakly-Supervised Conditional Embedding for Referred Visual Search

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators