Large Language Models as Reliable Knowledge Bases?

Zheng, Danna; Lapata, Mirella; Pan, Jeff Z.

Computer Science > Computation and Language

arXiv:2407.13578 (cs)

[Submitted on 18 Jul 2024]

Title:Large Language Models as Reliable Knowledge Bases?

Authors:Danna Zheng, Mirella Lapata, Jeff Z. Pan

View PDF HTML (experimental)

Abstract:The NLP community has recently shown a growing interest in leveraging Large Language Models (LLMs) for knowledge-intensive tasks, viewing LLMs as potential knowledge bases (KBs). However, the reliability and extent to which LLMs can function as KBs remain underexplored. While previous studies suggest LLMs can encode knowledge within their parameters, the amount of parametric knowledge alone is not sufficient to evaluate their effectiveness as KBs. This study defines criteria that a reliable LLM-as-KB should meet, focusing on factuality and consistency, and covering both seen and unseen knowledge. We develop several metrics based on these criteria and use them to evaluate 26 popular LLMs, while providing a comprehensive analysis of the effects of model size, instruction tuning, and in-context learning (ICL). Our results paint a worrying picture. Even a high-performant model like GPT-3.5-turbo is not factual or consistent, and strategies like ICL and fine-tuning are unsuccessful at making LLMs better KBs.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2407.13578 [cs.CL]
	(or arXiv:2407.13578v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.13578

Submission history

From: Danna Zheng [view email]
[v1] Thu, 18 Jul 2024 15:20:18 UTC (41,695 KB)

Computer Science > Computation and Language

Title:Large Language Models as Reliable Knowledge Bases?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Large Language Models as Reliable Knowledge Bases?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators