From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation

Wan, Xingchen; Zhou, Han; Sun, Ruoxi; Nakhost, Hootan; Jiang, Ke; Arık, Sercan Ö.

Computer Science > Machine Learning

arXiv:2502.00330 (cs)

[Submitted on 1 Feb 2025]

Title:From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation

Authors:Xingchen Wan, Han Zhou, Ruoxi Sun, Hootan Nakhost, Ke Jiang, Sercan Ö. Arık

View PDF HTML (experimental)

Abstract:Recent advances in long-context large language models (LLMs) have led to the emerging paradigm of many-shot in-context learning (ICL), where it is observed that scaling many more demonstrating examples beyond the conventional few-shot setup in the context can lead to performance benefits. However, despite its promise, it is unclear what aspects dominate the benefits and whether simply scaling to more examples is the most effective way of improving many-shot ICL. In this work, we first provide an analysis of the factors driving many-shot ICL, and we find that 1) many-shot performance can still be attributed to often a few disproportionately influential examples and 2) identifying such influential examples ("optimize") and using them as demonstrations to regenerate new examples ("generate") can lead to further improvements. Inspired by the findings, we propose BRIDGE, an algorithm that alternates between the optimize step with Bayesian optimization to discover the influential sets of examples and the generate step to reuse this set to expand the reasoning paths of the examples back to the many-shot regime automatically. On Gemini, Claude, and Mistral LLMs of different sizes, we show that BRIDGE to significant improvements across a diverse set of tasks, including symbolic reasoning, numerical reasoning, and code generation.

Comments:	Expanded version of the ICLR 2025 paper
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2502.00330 [cs.LG]
	(or arXiv:2502.00330v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.00330

Submission history

From: Xingchen Wan [view email]
[v1] Sat, 1 Feb 2025 06:23:24 UTC (896 KB)

Computer Science > Machine Learning

Title:From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators