A Three-Branch Checks-and-Balances Frameworkfor Context-Aware Ethical Alignment of Large Language Models

Chang, Edward Y.

Computer Science > Computation and Language

arXiv:2502.00136 (cs)

[Submitted on 31 Jan 2025]

Title:A Three-Branch Checks-and-Balances Frameworkfor Context-Aware Ethical Alignment of Large Language Models

Authors:Edward Y. Chang

View PDF

Abstract:This paper introduces a three-branch checks-and-balances framework for ethical alignment of Large Language Models (LLMs), inspired by governmental systems. It implements three independent yet interacting components: LLMs as the executive branch for knowledge generation, DIKE as the legislative branch establishing ethical guardrails, and ERIS as the judicial branch for contextual interpretation. The adversarial DIKE-ERIS duality enables adaptation to diverse cultural contexts while upholding consistent ethical principles. This architecture addresses limitations of reinforcement learning with human feedback (RLHF) by providing interpretable, adaptable, and culturally-aware ethical reasoning. Through self-supervised learning and adversarial testing, our framework demonstrates how emotional modeling can guide linguistic behaviors toward ethical outcomes while preserving independence across knowledge generation, ethical oversight, and contextual interpretation.

Comments:	17 pages, 6 tables, 6 figures. arXiv admin note: substantial text overlap with arXiv:2405.07076
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
ACM classes:	F.2.2
Cite as:	arXiv:2502.00136 [cs.CL]
	(or arXiv:2502.00136v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.00136

Submission history

From: Edward Chang [view email]
[v1] Fri, 31 Jan 2025 19:41:28 UTC (5,373 KB)

Computer Science > Computation and Language

Title:A Three-Branch Checks-and-Balances Frameworkfor Context-Aware Ethical Alignment of Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Three-Branch Checks-and-Balances Frameworkfor Context-Aware Ethical Alignment of Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators