Mitigating Unintended Memorization in Language Models via Alternating Teaching

Liu, Zhe; Zhang, Xuedong; Peng, Fuchun

Computer Science > Computation and Language

arXiv:2210.06772 (cs)

[Submitted on 13 Oct 2022]

Title:Mitigating Unintended Memorization in Language Models via Alternating Teaching

Authors:Zhe Liu, Xuedong Zhang, Fuchun Peng

View PDF

Abstract:Recent research has shown that language models have a tendency to memorize rare or unique sequences in the training corpora which can thus leak sensitive attributes of user data. We employ a teacher-student framework and propose a novel approach called alternating teaching to mitigate unintended memorization in sequential modeling. In our method, multiple teachers are trained on disjoint training sets whose privacy one wishes to protect, and teachers' predictions supervise the training of a student model in an alternating manner at each time step. Experiments on LibriSpeech datasets show that the proposed method achieves superior privacy-preserving results than other counterparts. In comparison with no prevention for unintended memorization, the overall utility loss is small when training records are sufficient.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2210.06772 [cs.CL]
	(or arXiv:2210.06772v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.06772

Submission history

From: Zhe Liu [view email]
[v1] Thu, 13 Oct 2022 06:26:41 UTC (20 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2022-10

Change to browse by:

cs
cs.LG

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Mitigating Unintended Memorization in Language Models via Alternating Teaching

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Mitigating Unintended Memorization in Language Models via Alternating Teaching

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators