Segment to Recognize Robustly -- Enhancing Recognition by Image Decomposition

Janouskova, Klara; Gavrus, Cristian; Matas, Jiri

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.15933 (cs)

[Submitted on 24 Nov 2024]

Title:Segment to Recognize Robustly -- Enhancing Recognition by Image Decomposition

Authors:Klara Janouskova, Cristian Gavrus, Jiri Matas

View PDF HTML (experimental)

Abstract:In image recognition, both foreground (FG) and background (BG) play an important role; however, standard deep image recognition often leads to unintended over-reliance on the BG, limiting model robustness in real-world deployment settings. Current solutions mainly suppress the BG, sacrificing BG information for improved generalization. We propose "Segment to Recognize Robustly" (S2R^2), a novel recognition approach which decouples the FG and BG modelling and combines them in a simple, robust, and interpretable manner. S2R^2 leverages recent advances in zero-shot segmentation to isolate the FG and the BG before or during recognition. By combining FG and BG, potentially also with a standard full-image classifier, S2R^2 achieves state-of-the-art results on in-domain data while maintaining robustness to BG shifts. The results confirm that segmentation before recognition is now possible.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2411.15933 [cs.CV]
	(or arXiv:2411.15933v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.15933

Submission history

From: Klara Janouskova [view email]
[v1] Sun, 24 Nov 2024 17:39:39 UTC (27,830 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2024-11

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Segment to Recognize Robustly -- Enhancing Recognition by Image Decomposition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Segment to Recognize Robustly -- Enhancing Recognition by Image Decomposition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators