Policy-as-Prompt: Rethinking Content Moderation in the Age of Large Language Models

Palla, Konstantina; García, José Luis Redondo; Hauff, Claudia; Fabbri, Francesco; Lindström, Henrik; Taber, Daniel R.; Damianou, Andreas; Lalmas, Mounia

Computer Science > Computers and Society

arXiv:2502.18695 (cs)

[Submitted on 25 Feb 2025]

Title:Policy-as-Prompt: Rethinking Content Moderation in the Age of Large Language Models

Authors:Konstantina Palla, José Luis Redondo García, Claudia Hauff, Francesco Fabbri, Henrik Lindström, Daniel R. Taber, Andreas Damianou, Mounia Lalmas

View PDF HTML (experimental)

Abstract:Content moderation plays a critical role in shaping safe and inclusive online environments, balancing platform standards, user expectations, and regulatory frameworks. Traditionally, this process involves operationalising policies into guidelines, which are then used by downstream human moderators for enforcement, or to further annotate datasets for training machine learning moderation models. However, recent advancements in large language models (LLMs) are transforming this landscape. These models can now interpret policies directly as textual inputs, eliminating the need for extensive data curation. This approach offers unprecedented flexibility, as moderation can be dynamically adjusted through natural language interactions. This paradigm shift raises important questions about how policies are operationalised and the implications for content moderation practices. In this paper, we formalise the emerging policy-as-prompt framework and identify five key challenges across four domains: Technical Implementation (1. translating policy to prompts, 2. sensitivity to prompt structure and formatting), Sociotechnical (3. the risk of technological determinism in policy formation), Organisational (4. evolving roles between policy and machine learning teams), and Governance (5. model governance and accountability). Through analysing these challenges across technical, sociotechnical, organisational, and governance dimensions, we discuss potential mitigation approaches. This research provides actionable insights for practitioners and lays the groundwork for future exploration of scalable and adaptive content moderation systems in digital ecosystems.

Comments:	14 pages, 5 figures
Subjects:	Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
Cite as:	arXiv:2502.18695 [cs.CY]
	(or arXiv:2502.18695v1 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2502.18695

Submission history

From: Konstantina Palla Miss [view email]
[v1] Tue, 25 Feb 2025 23:15:16 UTC (2,362 KB)

Computer Science > Computers and Society

Title:Policy-as-Prompt: Rethinking Content Moderation in the Age of Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:Policy-as-Prompt: Rethinking Content Moderation in the Age of Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators