Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization

Fu, Yi-Fu; Tu, Yu-Chieh; Cheng, Tzu-Ling; Lin, Cheng-Yu; Yang, Yi-Ting; Liu, Heng-Yi; Liao, Keng-Te; Juan, Da-Cheng; Lin, Shou-De

Computer Science > Computation and Language

arXiv:2412.18497 (cs)

[Submitted on 24 Dec 2024]

Title:Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization

Authors:Yi-Fu Fu, Yu-Chieh Tu, Tzu-Ling Cheng, Cheng-Yu Lin, Yi-Ting Yang, Heng-Yi Liu, Keng-Te Liao, Da-Cheng Juan, Shou-De Lin

View PDF HTML (experimental)

Abstract:In this paper, we explore the foundational mechanisms of memorization and generalization in Large Language Models (LLMs), inspired by the functional specialization observed in the human brain. Our investigation serves as a case study leveraging specially designed datasets and experimental-scale LLMs to lay the groundwork for understanding these behaviors. Specifically, we aim to first enable LLMs to exhibit both memorization and generalization by training with the designed dataset, then (a) examine whether LLMs exhibit neuron-level spatial differentiation for memorization and generalization, (b) predict these behaviors using model internal representations, and (c) steer the behaviors through inference-time interventions. Our findings reveal that neuron-wise differentiation of memorization and generalization is observable in LLMs, and targeted interventions can successfully direct their behavior.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.18497 [cs.CL]
	(or arXiv:2412.18497v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.18497

Submission history

From: YiFu Fu [view email]
[v1] Tue, 24 Dec 2024 15:28:56 UTC (4,461 KB)

Computer Science > Computation and Language

Title:Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators