Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models

Ma, Olivia; Passerat-Palmbach, Jonathan; Usynin, Dmitrii

Computer Science > Machine Learning

arXiv:2411.15831 (cs)

[Submitted on 24 Nov 2024]

Title:Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models

Authors:Olivia Ma, Jonathan Passerat-Palmbach, Dmitrii Usynin

View PDF HTML (experimental)

Abstract:Fine-tuning large language models (LLMs) for specific tasks introduces privacy risks, as models may inadvertently memorise and leak sensitive training data. While Differential Privacy (DP) offers a solution to mitigate these risks, it introduces significant computational and performance trade-offs, particularly with standard fine-tuning approaches. Previous work has primarily focused on full-parameter updates, which are computationally intensive and may not fully leverage DPs potential in large models. In this work, we address these shortcomings by investigating Parameter-Efficient Fine-Tuning (PEFT) methods under DP constraints. We show that PEFT methods achieve comparable performance to standard fine-tuning while requiring fewer parameters and significantly reducing privacy leakage. Furthermore, we incorporate a data poisoning experiment involving intentional mislabelling to assess model memorisation and directly measure privacy risks. Our findings indicate that PEFT methods not only provide a promising alternative but also serve as a complementary approach for privacy-preserving, resource-efficient fine-tuning of LLMs.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2411.15831 [cs.LG]
	(or arXiv:2411.15831v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.15831

Submission history

From: Dmitrii Usynin [view email]
[v1] Sun, 24 Nov 2024 13:17:36 UTC (1,096 KB)

Computer Science > Machine Learning

Title:Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators