Knowledge Injection via Prompt Distillation

Kujanpää, Kalle; Valpola, Harri; Ilin, Alexander

Computer Science > Computation and Language

arXiv:2412.14964 (cs)

[Submitted on 19 Dec 2024]

Title:Knowledge Injection via Prompt Distillation

Authors:Kalle Kujanpää, Harri Valpola, Alexander Ilin

View PDF HTML (experimental)

Abstract:In many practical applications, large language models (LLMs) need to incorporate new knowledge not present in their pre-training data. The primary methods for this are fine-tuning and retrieval-augmented generation (RAG). Although RAG has emerged as the industry standard for knowledge injection, fine-tuning has not yet achieved comparable success. In this paper, we propose a new fine-tuning technique for learning new knowledge and show that it can reach the performance of RAG. The proposed method is based on the self-distillation approach, which we call prompt distillation. First, we generate question-answer pairs about the new knowledge. Then, we fine-tune a student model on the question-answer pairs to imitate the output distributions of a teacher model, which additionally receives the new knowledge in its prompt. The student model is identical to the teacher, except it is equipped with a LoRA adapter. This training procedure facilitates distilling the new knowledge from the teacher's prompt into the student's weights.

Comments:	Preprint
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2412.14964 [cs.CL]
	(or arXiv:2412.14964v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.14964

Submission history

From: Kalle Kujanpää [view email]
[v1] Thu, 19 Dec 2024 15:44:01 UTC (78 KB)

Computer Science > Computation and Language

Title:Knowledge Injection via Prompt Distillation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Knowledge Injection via Prompt Distillation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators