InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Chen, Lichang; Chen, Jiuhai; Goldstein, Tom; Huang, Heng; Zhou, Tianyi

Computer Science > Artificial Intelligence

arXiv:2306.03082 (cs)

[Submitted on 5 Jun 2023 (v1), last revised 8 Aug 2023 (this version, v2)]

Title:InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Authors:Lichang Chen, Jiuhai Chen, Tom Goldstein, Heng Huang, Tianyi Zhou

View PDF

Abstract:Large language models~(LLMs) are instruction followers, but it can be challenging to find the best instruction for different situations, especially for black-box LLMs on which backpropagation is forbidden. Instead of directly optimizing the discrete instruction, we optimize a low-dimensional soft prompt applied to an open-source LLM to generate the instruction for the black-box LLM. On each iteration of the proposed method, which we call InstructZero, a soft prompt is converted into an instruction using the open-source LLM, which is then submitted to the black-box LLM for zero-shot evaluation, and the performance is sent to Bayesian optimization to produce new soft prompts improving the zero-shot performance. We evaluate InstructZero on different combinations of open-source LLMs and APIs including Vicuna and ChatGPT. Our results show that InstructZero outperforms SOTA auto-instruction methods across a variety of downstream tasks. Our code and data are publicly available at this https URL.

Comments:	15 pages; 9 figures; Our code is available at this https URL
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.03082 [cs.AI]
	(or arXiv:2306.03082v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2306.03082

Submission history

From: Lichang Chen [view email]
[v1] Mon, 5 Jun 2023 17:55:22 UTC (5,633 KB)
[v2] Tue, 8 Aug 2023 17:33:54 UTC (5,635 KB)

🚨2024-09-29: arxiv.org is experience DB issues. The announce tonight will be 3 hours later than usual.🚨

Computer Science > Artificial Intelligence

Title:InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

🚨2024-09-29: arxiv.org is experience DB issues. The announce tonight will be 3 hours later than usual.🚨

Computer Science > Artificial Intelligence

Title:InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators