Intrinsic Barriers to Explaining Deep Foundation Models

Tan, Zhen; Liu, Huan

Computer Science > Computers and Society

arXiv:2504.16948 (cs)

[Submitted on 21 Apr 2025]

Title:Intrinsic Barriers to Explaining Deep Foundation Models

Authors:Zhen Tan, Huan Liu

View PDF HTML (experimental)

Abstract:Deep Foundation Models (DFMs) offer unprecedented capabilities but their increasing complexity presents profound challenges to understanding their internal workings-a critical need for ensuring trust, safety, and accountability. As we grapple with explaining these systems, a fundamental question emerges: Are the difficulties we face merely temporary hurdles, awaiting more sophisticated analytical techniques, or do they stem from \emph{intrinsic barriers} deeply rooted in the nature of these large-scale models themselves? This paper delves into this critical question by examining the fundamental characteristics of DFMs and scrutinizing the limitations encountered by current explainability methods when confronted with this inherent challenge. We probe the feasibility of achieving satisfactory explanations and consider the implications for how we must approach the verification and governance of these powerful technologies.

Subjects:	Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET)
Cite as:	arXiv:2504.16948 [cs.CY]
	(or arXiv:2504.16948v1 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2504.16948

Submission history

From: Zhen Tan [view email]
[v1] Mon, 21 Apr 2025 21:19:23 UTC (641 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CY

< prev | next >

new | recent | 2025-04

Change to browse by:

cs
cs.AI
cs.ET

References & Citations

export BibTeX citation

Computer Science > Computers and Society

Title:Intrinsic Barriers to Explaining Deep Foundation Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:Intrinsic Barriers to Explaining Deep Foundation Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators