Data Redaction from Pre-trained GANs

Kong, Zhifeng; Chaudhuri, Kamalika

Computer Science > Machine Learning

arXiv:2206.14389 (cs)

[Submitted on 29 Jun 2022 (v1), last revised 18 Jan 2023 (this version, v3)]

Title:Data Redaction from Pre-trained GANs

Authors:Zhifeng Kong, Kamalika Chaudhuri

View PDF

Abstract:Large pre-trained generative models are known to occasionally output undesirable samples, which undermines their trustworthiness. The common way to mitigate this is to re-train them differently from scratch using different data or different regularization -- which uses a lot of computational resources and does not always fully address the problem.
In this work, we take a different, more compute-friendly approach and investigate how to post-edit a model after training so that it ''redacts'', or refrains from outputting certain kinds of samples. We show that redaction is a fundamentally different task from data deletion, and data deletion may not always lead to redaction. We then consider Generative Adversarial Networks (GANs), and provide three different algorithms for data redaction that differ on how the samples to be redacted are described. Extensive evaluations on real-world image datasets show that our algorithms out-perform data deletion baselines, and are capable of redacting data while retaining high generation quality at a fraction of the cost of full re-training.

Comments:	SaTML 2023
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2206.14389 [cs.LG]
	(or arXiv:2206.14389v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.14389

Submission history

From: Zhifeng Kong [view email]
[v1] Wed, 29 Jun 2022 03:46:16 UTC (7,682 KB)
[v2] Sun, 25 Dec 2022 22:32:09 UTC (17,033 KB)
[v3] Wed, 18 Jan 2023 01:25:50 UTC (17,033 KB)

Computer Science > Machine Learning

Title:Data Redaction from Pre-trained GANs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Data Redaction from Pre-trained GANs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators