AgentRM: Enhancing Agent Generalization with Reward Modeling

Xia, Yu; Fan, Jingru; Chen, Weize; Yan, Siyu; Cong, Xin; Zhang, Zhong; Lu, Yaxi; Lin, Yankai; Liu, Zhiyuan; Sun, Maosong

Computer Science > Computation and Language

arXiv:2502.18407 (cs)

[Submitted on 25 Feb 2025]

Title:AgentRM: Enhancing Agent Generalization with Reward Modeling

Authors:Yu Xia, Jingru Fan, Weize Chen, Siyu Yan, Xin Cong, Zhong Zhang, Yaxi Lu, Yankai Lin, Zhiyuan Liu, Maosong Sun

View PDF HTML (experimental)

Abstract:Existing LLM-based agents have achieved strong performance on held-in tasks, but their generalizability to unseen tasks remains poor. Hence, some recent work focus on fine-tuning the policy model with more diverse tasks to improve the generalizability. In this work, we find that finetuning a reward model to guide the policy model is more robust than directly finetuning the policy model. Based on this finding, we propose AgentRM, a generalizable reward model, to guide the policy model for effective test-time search. We comprehensively investigate three approaches to construct the reward model, including explicit reward modeling, implicit reward modeling and LLM-as-a-judge. We then use AgentRM to guide the answer generation with Best-of-N sampling and step-level beam search. On four types of nine agent tasks, AgentRM enhances the base policy model by $8.8$ points on average, surpassing the top general agent by $4.0$. Moreover, it demonstrates weak-to-strong generalization, yielding greater improvement of $12.6$ on LLaMA-3-70B policy model. As for the specializability, AgentRM can also boost a finetuned policy model and outperform the top specialized agent by $11.4$ on three held-in tasks. Further analysis verifies its effectiveness in test-time scaling. Codes will be released to facilitate the research in this area.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2502.18407 [cs.CL]
	(or arXiv:2502.18407v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.18407

Submission history

From: Yu Xia [view email]
[v1] Tue, 25 Feb 2025 17:58:02 UTC (288 KB)

Computer Science > Computation and Language

Title:AgentRM: Enhancing Agent Generalization with Reward Modeling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:AgentRM: Enhancing Agent Generalization with Reward Modeling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators