From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning

Li, Yafu; Wang, Zhilin; Fu, Tingchen; Cui, Ganqu; Yang, Sen; Cheng, Yu

Computer Science > Computation and Language

arXiv:2501.11877 (cs)

[Submitted on 21 Jan 2025]

Title:From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning

Authors:Yafu Li, Zhilin Wang, Tingchen Fu, Ganqu Cui, Sen Yang, Yu Cheng

View PDF HTML (experimental)

Abstract:Scaling data and model size has been proven effective for boosting the performance of large language models. In addition to training-time scaling, recent studies have revealed that increasing test-time computational resources can further improve performance. In this work, we introduce Aggregation Fine-Tuning (AFT), a supervised finetuning paradigm where the model learns to synthesize multiple draft responses, referred to as proposals, into a single, refined answer, termed aggregation. At inference time, a propose-and-aggregate strategy further boosts performance by iteratively generating proposals and aggregating them. Empirical evaluations on benchmark datasets show that AFT-trained models substantially outperform standard SFT. Notably, an AFT model, fine-tuned from Llama3.1-8B-Base with only 64k data, achieves a 41.3% LC win rate on AlpacaEval 2, surpassing significantly larger LLMs such as Llama3.1-405B-Instruct and GPT4. By combining sequential refinement and parallel sampling, the propose-and-aggregate framework scales inference-time computation in a flexible manner. Overall, These findings position AFT as a promising approach to unlocking additional capabilities of LLMs without resorting to increasing data volume or model size.

Comments:	20 pages; work in progress
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2501.11877 [cs.CL]
	(or arXiv:2501.11877v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.11877

Submission history

From: Yafu Li [view email]
[v1] Tue, 21 Jan 2025 04:11:59 UTC (1,401 KB)

Computer Science > Computation and Language

Title:From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators