TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Ye, Junjie; Wu, Yilong; Li, Sixian; Yang, Yuming; Gui, Tao; Zhang, Qi; Huang, Xuanjing; Wang, Peng; Shi, Zhongchao; Fan, Jianping; Du, Zhengyin

Computer Science > Computation and Language

arXiv:2412.15495 (cs)

[Submitted on 20 Dec 2024]

Title:TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Authors:Junjie Ye, Yilong Wu, Sixian Li, Yuming Yang, Tao Gui, Qi Zhang, Xuanjing Huang, Peng Wang, Zhongchao Shi, Jianping Fan, Zhengyin Du

View PDF HTML (experimental)

Abstract:Large language models (LLMs) achieve remarkable advancements by leveraging tools to interact with external environments, a critical step toward generalized AI. However, the standard supervised fine-tuning (SFT) approach, which relies on large-scale datasets, often overlooks task-specific characteristics in tool use, leading to performance bottlenecks. To address this issue, we analyze three existing LLMs and uncover key insights: training data can inadvertently impede tool-use behavior, token importance is distributed unevenly, and errors in tool calls fall into a small set of distinct categories. Building on these findings, we propose TL-Training, a task-feature-based framework that mitigates the effects of suboptimal training data, dynamically adjusts token weights to prioritize key tokens during SFT, and incorporates a robust reward mechanism tailored to error categories, optimized through proximal policy optimization. We validate TL-Training by training CodeLLaMA-2-7B and evaluating it on four diverse open-source test sets. Our results demonstrate that the LLM trained by our method matches or surpasses both open- and closed-source LLMs in tool-use performance using only 1,217 training data points. Additionally, our method enhances robustness in noisy environments and improves general task performance, offering a scalable and efficient paradigm for tool-use training in LLMs. The code and data are available at this https URL.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2412.15495 [cs.CL]
	(or arXiv:2412.15495v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.15495

Submission history

From: Junjie Ye [view email]
[v1] Fri, 20 Dec 2024 02:21:36 UTC (316 KB)

Computer Science > Computation and Language

Title:TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators