Designing explainable artificial intelligence with active inference: A framework for transparent introspection and decision-making

Albarracin, Mahault; Hipólito, Inês; Tremblay, Safae Essafi; Fox, Jason G.; René, Gabriel; Friston, Karl; Ramstead, Maxwell J. D.

Computer Science > Artificial Intelligence

arXiv:2306.04025 (cs)

[Submitted on 6 Jun 2023]

Title:Designing explainable artificial intelligence with active inference: A framework for transparent introspection and decision-making

Authors:Mahault Albarracin, Inês Hipólito, Safae Essafi Tremblay, Jason G. Fox, Gabriel René, Karl Friston, Maxwell J. D. Ramstead

View PDF

Abstract:This paper investigates the prospect of developing human-interpretable, explainable artificial intelligence (AI) systems based on active inference and the free energy principle. We first provide a brief overview of active inference, and in particular, of how it applies to the modeling of decision-making, introspection, as well as the generation of overt and covert actions. We then discuss how active inference can be leveraged to design explainable AI systems, namely, by allowing us to model core features of ``introspective'' processes and by generating useful, human-interpretable models of the processes involved in decision-making. We propose an architecture for explainable AI systems using active inference. This architecture foregrounds the role of an explicit hierarchical generative model, the operation of which enables the AI system to track and explain the factors that contribute to its own decisions, and whose structure is designed to be interpretable and auditable by human users. We outline how this architecture can integrate diverse sources of information to make informed decisions in an auditable manner, mimicking or reproducing aspects of human-like consciousness and introspection. Finally, we discuss the implications of our findings for future research in AI, and the potential ethical considerations of developing AI systems with (the appearance of) introspective capabilities.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.04025 [cs.AI]
	(or arXiv:2306.04025v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2306.04025

Submission history

From: Mahault Albarracin Mx [view email]
[v1] Tue, 6 Jun 2023 21:38:09 UTC (510 KB)

Computer Science > Artificial Intelligence

Title:Designing explainable artificial intelligence with active inference: A framework for transparent introspection and decision-making

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Designing explainable artificial intelligence with active inference: A framework for transparent introspection and decision-making

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators