SKDU at De-Factify 4.0: Natural Language Features for AI-Generated Text-Detection

Malviya, Shrikant; Arnau-González, Pablo; Arevalillo-Herráez, Miguel; Katsigiannis, Stamos

Computer Science > Computation and Language

arXiv:2503.22338 (cs)

[Submitted on 28 Mar 2025]

Title:SKDU at De-Factify 4.0: Natural Language Features for AI-Generated Text-Detection

Authors:Shrikant Malviya, Pablo Arnau-González, Miguel Arevalillo-Herráez, Stamos Katsigiannis

View PDF HTML (experimental)

Abstract:The rapid advancement of large language models (LLMs) has introduced new challenges in distinguishing human-written text from AI-generated content. In this work, we explored a pipelined approach for AI-generated text detection that includes a feature extraction step (i.e. prompt-based rewriting features inspired by RAIDAR and content-based features derived from the NELA toolkit) followed by a classification module. Comprehensive experiments were conducted on the Defactify4.0 dataset, evaluating two tasks: binary classification to differentiate human-written and AI-generated text, and multi-class classification to identify the specific generative model used to generate the input text. Our findings reveal that NELA features significantly outperform RAIDAR features in both tasks, demonstrating their ability to capture nuanced linguistic, stylistic, and content-based differences. Combining RAIDAR and NELA features provided minimal improvement, highlighting the redundancy introduced by less discriminative features. Among the classifiers tested, XGBoost emerged as the most effective, leveraging the rich feature sets to achieve high accuracy and generalisation.

Comments:	De-Factify 4.0 Workshop at the 39th AAAI Conference on Artificial Intelligence (AAAI 2025)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2503.22338 [cs.CL]
	(or arXiv:2503.22338v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.22338

Submission history

From: Stamos Katsigiannis [view email]
[v1] Fri, 28 Mar 2025 11:25:05 UTC (462 KB)

Computer Science > Computation and Language

Title:SKDU at De-Factify 4.0: Natural Language Features for AI-Generated Text-Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SKDU at De-Factify 4.0: Natural Language Features for AI-Generated Text-Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators