Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors

Nie, Fan; Feng, Lan; Ye, Haotian; Liang, Weixin; Lu, Pan; Yao, Huaxiu; Alahi, Alexandre; Zou, James

Abstract:Efficiently leveraging of the capabilities of contemporary large language models (LLMs) is increasingly challenging, particularly when direct fine-tuning is expensive and often impractical. Existing training-free methods, including manually or automated designed workflows, typically demand substantial human effort or yield suboptimal results. This paper proposes Weak-for-Strong Harnessing (W4S), a novel framework that customizes smaller, cost-efficient language models to design and optimize workflows for harnessing stronger models. W4S formulates workflow design as a multi-turn markov decision process and introduces reinforcement learning for agentic workflow optimization (RLAO) to train a weak meta-agent. Through iterative interaction with the environment, the meta-agent learns to design increasingly effective workflows without manual intervention. Empirical results demonstrate the superiority of W4S that our 7B meta-agent, trained with just one GPU hour, outperforms the strongest baseline by 2.9% ~ 24.6% across eleven benchmarks, successfully elevating the performance of state-of-the-art models such as GPT-3.5-Turbo and GPT-4o. Notably, W4S exhibits strong generalization capabilities across both seen and unseen tasks, offering an efficient, high-performing alternative to directly fine-tuning strong models.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2504.04785 [cs.AI]
	(or arXiv:2504.04785v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2504.04785

Computer Science > Artificial Intelligence

Title:Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators