Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models

Chen, Jiaao; Pan, Xiaoman; Yu, Dian; Song, Kaiqiang; Wang, Xiaoyang; Yu, Dong; Chen, Jianshu

Computer Science > Computation and Language

arXiv:2308.00304 (cs)

[Submitted on 1 Aug 2023 (v1), last revised 16 Jul 2024 (this version, v3)]

Title:Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models

Authors:Jiaao Chen, Xiaoman Pan, Dian Yu, Kaiqiang Song, Xiaoyang Wang, Dong Yu, Jianshu Chen

View PDF HTML (experimental)

Abstract:We investigate how to elicit compositional generalization capabilities in large language models (LLMs). Compositional generalization empowers LLMs to solve complex problems by combining foundational skills, a critical reasoning ability akin to human intelligence. However, even the most advanced LLMs currently struggle with this form of reasoning. We examine this problem within the framework of in-context learning and find that demonstrating both foundational skills and compositional examples grounded in these skills within the same prompt context is crucial. We refer to this prompt structure as skills-in-context (SKiC). With as few as two exemplars, this in-context learning structure enables LLMs to tackle more challenging problems requiring innovative skill combinations, achieving near-perfect systematic generalization across a broad range of tasks. Intriguingly, SKiC also unlocks the latent potential of LLMs, allowing them to more actively utilize pre-existing internal skills acquired during earlier pretraining stages to solve complex reasoning problems. The SKiC structure is robust across different skill constructions and exemplar choices and demonstrates strong transferability to new tasks. Finally, inspired by our in-context learning study, we show that fine-tuning LLMs with SKiC-style data can elicit zero-shot weak-to-strong generalization, enabling the models to solve much harder problems directly with standard prompting.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2308.00304 [cs.CL]
	(or arXiv:2308.00304v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2308.00304

Submission history

From: Jiaao Chen [view email]
[v1] Tue, 1 Aug 2023 05:54:12 UTC (2,375 KB)
[v2] Mon, 14 Aug 2023 08:11:15 UTC (3,519 KB)
[v3] Tue, 16 Jul 2024 20:09:47 UTC (3,281 KB)

Computer Science > Computation and Language

Title:Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators