Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning

Munir, Ans; Qureshi, Faisal Z.; Khan, Muhammad Haris; Ali, Mohsen

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.13715 (cs)

[Submitted on 18 Jul 2024]

Title:Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning

Authors:Ans Munir, Faisal Z. Qureshi, Muhammad Haris Khan, Mohsen Ali

View PDF HTML (experimental)

Abstract:Compositional Zero-Shot Learning (CZSL) aims to predict unknown compositions made up of attribute and object pairs. Predicting compositions unseen during training is a challenging task. We are exploring Open World Compositional Zero-Shot Learning (OW-CZSL) in this study, where our test space encompasses all potential combinations of attributes and objects. Our approach involves utilizing the self-attention mechanism between attributes and objects to achieve better generalization from seen to unseen compositions. Utilizing a self-attention mechanism facilitates the model's ability to identify relationships between attribute and objects. The similarity between the self-attended textual and visual features is subsequently calculated to generate predictions during the inference phase. The potential test space may encompass implausible object-attribute combinations arising from unrestricted attribute-object pairings. To mitigate this issue, we leverage external knowledge from ConceptNet to restrict the test space to realistic compositions. Our proposed model, Attention-based Simple Primitives (ASP), demonstrates competitive performance, achieving results comparable to the state-of-the-art.

Comments:	10 pages, 6 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2407.13715 [cs.CV]
	(or arXiv:2407.13715v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.13715

Submission history

From: Ans Munir [view email]
[v1] Thu, 18 Jul 2024 17:11:29 UTC (5,257 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators