Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs

Mancera, Gonzalo; DeAlcala, Daniel; Fierrez, Julian; Tolosana, Ruben; Morales, Aythami

Computer Science > Computation and Language

arXiv:2503.07384 (cs)

[Submitted on 10 Mar 2025 (v1), last revised 13 Mar 2025 (this version, v2)]

Title:Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs

Authors:Gonzalo Mancera, Daniel DeAlcala, Julian Fierrez, Ruben Tolosana, Aythami Morales

View PDF HTML (experimental)

Abstract:This work adapts and studies the gradient-based Membership Inference Test (gMINT) to the classification of text based on LLMs. MINT is a general approach intended to determine if given data was used for training machine learning models, and this work focuses on its application to the domain of Natural Language Processing. Using gradient-based analysis, the MINT model identifies whether particular data samples were included during the language model training phase, addressing growing concerns about data privacy in machine learning. The method was evaluated in seven Transformer-based models and six datasets comprising over 2.5 million sentences, focusing on text classification tasks. Experimental results demonstrate MINTs robustness, achieving AUC scores between 85% and 99%, depending on data size and model architecture. These findings highlight MINTs potential as a scalable and reliable tool for auditing machine learning models, ensuring transparency, safeguarding sensitive data, and fostering ethical compliance in the deployment of AI/NLP technologies.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2503.07384 [cs.CL]
	(or arXiv:2503.07384v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.07384

Submission history

From: Gonzalo Mancera [view email]
[v1] Mon, 10 Mar 2025 14:32:56 UTC (695 KB)
[v2] Thu, 13 Mar 2025 12:37:37 UTC (695 KB)

Computer Science > Computation and Language

Title:Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators