Learning Dynamics of LLM Finetuning

Ren, Yi; Sutherland, Danica J.

Computer Science > Machine Learning

arXiv:2407.10490 (cs)

[Submitted on 15 Jul 2024]

Title:Learning Dynamics of LLM Finetuning

Authors:Yi Ren, Danica J. Sutherland

View PDF

Abstract:Learning dynamics, which describes how the learning of specific training examples influences the model's prediction of other examples, give us a powerful tool for understanding the behavior of deep learning systems. We study the learning dynamics of large language models during finetuning, by analyzing the step-wise decomposition and accumulated influence among different responses. Our framework allows a uniform interpretation of many interesting observations about the training of popular algorithms for both instruction tuning and preference tuning. The analysis not only explains where the benefits of these methods come from but also inspires a simple, effective method to further improve the alignment performance. Code for experiments is available at this https URL.

Comments:	32 pages
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2407.10490 [cs.LG]
	(or arXiv:2407.10490v1 [cs.LG] for this version)

Submission history

From: Yi Ren [view email]
[v1] Mon, 15 Jul 2024 07:30:28 UTC (4,295 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-07

Change to browse by:

cs
cs.AI
cs.CL

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Learning Dynamics of LLM Finetuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Dynamics of LLM Finetuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators