Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study

Walters, Michael; Kaufmann, Rafael; Sefas, Justice; Kopinski, Thomas

Computer Science > Artificial Intelligence

arXiv:2502.04249 (cs)

[Submitted on 6 Feb 2025]

Title:Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study

Authors:Michael Walters, Rafael Kaufmann, Justice Sefas, Thomas Kopinski

View PDF HTML (experimental)

Abstract:We investigate the Free Energy Principle as a foundation for measuring risk in agentic and multi-agent systems. From these principles we introduce a Cumulative Risk Exposure metric that is flexible to differing contexts and needs. We contrast this to other popular theories for safe AI that hinge on massive amounts of data or describing arbitrarily complex world models. In our framework, stakeholders need only specify their preferences over system outcomes, providing straightforward and transparent decision rules for risk governance and mitigation. This framework naturally accounts for uncertainty in both world model and preference model, allowing for decision-making that is epistemically and axiologically humble, parsimonious, and future-proof. We demonstrate this novel approach in a simplified autonomous vehicle environment with multi-agent vehicles whose driving policies are mediated by gatekeepers that evaluate, in an online fashion, the risk to the collective safety in their neighborhood, and intervene through each vehicle's policy when appropriate. We show that the introduction of gatekeepers in an AV fleet, even at low penetration, can generate significant positive externalities in terms of increased system safety.

Comments:	9 pages, 1 figure
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
Cite as:	arXiv:2502.04249 [cs.AI]
	(or arXiv:2502.04249v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2502.04249

Submission history

From: Michael Walters [view email]
[v1] Thu, 6 Feb 2025 17:38:45 UTC (112 KB)

Computer Science > Artificial Intelligence

Title:Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators