Exploring Task-Level Optimal Prompts for Visual In-Context Learning

Zhu, Yan; Ma, Huan; Zhang, Changqing

Computer Science > Artificial Intelligence

arXiv:2501.08841 (cs)

[Submitted on 15 Jan 2025]

Title:Exploring Task-Level Optimal Prompts for Visual In-Context Learning

Authors:Yan Zhu, Huan Ma, Changqing Zhang

View PDF HTML (experimental)

Abstract:With the development of Vision Foundation Models (VFMs) in recent years, Visual In-Context Learning (VICL) has become a better choice compared to modifying models in most scenarios. Different from retraining or fine-tuning model, VICL does not require modifications to the model's weights or architecture, and only needs a prompt with demonstrations to teach VFM how to solve tasks. Currently, significant computational cost for finding optimal prompts for every test sample hinders the deployment of VICL, as determining which demonstrations to use for constructing prompts is very costly. In this paper, however, we find a counterintuitive phenomenon that most test samples actually achieve optimal performance under the same prompts, and searching for sample-level prompts only costs more time but results in completely identical prompts. Therefore, we propose task-level prompting to reduce the cost of searching for prompts during the inference stage and introduce two time-saving yet effective task-level prompt search strategies. Extensive experimental results show that our proposed method can identify near-optimal prompts and reach the best VICL performance with a minimal cost that prior work has never achieved.

Subjects:	Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.08841 [cs.AI]
	(or arXiv:2501.08841v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2501.08841

Submission history

From: Yan Zhu [view email]
[v1] Wed, 15 Jan 2025 14:52:20 UTC (2,457 KB)

Computer Science > Artificial Intelligence

Title:Exploring Task-Level Optimal Prompts for Visual In-Context Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Exploring Task-Level Optimal Prompts for Visual In-Context Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators