Unveiling LLMs: The Evolution of Latent Representations in a Dynamic Knowledge Graph

Bronzini, Marco; Nicolini, Carlo; Lepri, Bruno; Staiano, Jacopo; Passerini, Andrea

Computer Science > Computation and Language

arXiv:2404.03623 (cs)

[Submitted on 4 Apr 2024 (v1), last revised 6 Aug 2024 (this version, v2)]

Title:Unveiling LLMs: The Evolution of Latent Representations in a Dynamic Knowledge Graph

Authors:Marco Bronzini, Carlo Nicolini, Bruno Lepri, Jacopo Staiano, Andrea Passerini

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) demonstrate an impressive capacity to recall a vast range of factual knowledge. However, understanding their underlying reasoning and internal mechanisms in exploiting this knowledge remains a key research area. This work unveils the factual information an LLM represents internally for sentence-level claim verification. We propose an end-to-end framework to decode factual knowledge embedded in token representations from a vector space to a set of ground predicates, showing its layer-wise evolution using a dynamic knowledge graph. Our framework employs activation patching, a vector-level technique that alters a token representation during inference, to extract encoded knowledge. Accordingly, we neither rely on training nor external models. Using factual and common-sense claims from two claim verification datasets, we showcase interpretability analyses at local and global levels. The local analysis highlights entity centrality in LLM reasoning, from claim-related information and multi-hop reasoning to representation errors causing erroneous evaluation. On the other hand, the global reveals trends in the underlying evolution, such as word-based knowledge evolving into claim-related facts. By interpreting semantics from LLM latent representations and enabling graph-related analyses, this work enhances the understanding of the factual knowledge resolution process.

Comments:	Accepted at COLM 2024
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
Cite as:	arXiv:2404.03623 [cs.CL]
	(or arXiv:2404.03623v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2404.03623

Submission history

From: Marco Bronzini [view email]
[v1] Thu, 4 Apr 2024 17:45:59 UTC (5,378 KB)
[v2] Tue, 6 Aug 2024 15:02:33 UTC (6,657 KB)

Computer Science > Computation and Language

Title:Unveiling LLMs: The Evolution of Latent Representations in a Dynamic Knowledge Graph

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Unveiling LLMs: The Evolution of Latent Representations in a Dynamic Knowledge Graph

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators