Unleashing the Power of Randomization in Auditing Differentially Private ML

Pillutla, Krishna; Andrew, Galen; Kairouz, Peter; McMahan, H. Brendan; Oprea, Alina; Oh, Sewoong

Computer Science > Machine Learning

arXiv:2305.18447 (cs)

[Submitted on 29 May 2023]

Title:Unleashing the Power of Randomization in Auditing Differentially Private ML

Authors:Krishna Pillutla, Galen Andrew, Peter Kairouz, H. Brendan McMahan, Alina Oprea, Sewoong Oh

View PDF

Abstract:We present a rigorous methodology for auditing differentially private machine learning algorithms by adding multiple carefully designed examples called canaries. We take a first principles approach based on three key components. First, we introduce Lifted Differential Privacy (LiDP) that expands the definition of differential privacy to handle randomized datasets. This gives us the freedom to design randomized canaries. Second, we audit LiDP by trying to distinguish between the model trained with $K$ canaries versus $K - 1$ canaries in the dataset, leaving one canary out. By drawing the canaries i.i.d., LiDP can leverage the symmetry in the design and reuse each privately trained model to run multiple statistical tests, one for each canary. Third, we introduce novel confidence intervals that take advantage of the multiple test statistics by adapting to the empirical higher-order correlations. Together, this new recipe demonstrates significant improvements in sample complexity, both theoretically and empirically, using synthetic and real data. Further, recent advances in designing stronger canaries can be readily incorporated into the new framework.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Information Theory (cs.IT); Statistics Theory (math.ST)
Cite as:	arXiv:2305.18447 [cs.LG]
	(or arXiv:2305.18447v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.18447

Submission history

From: Krishna Pillutla [view email]
[v1] Mon, 29 May 2023 03:53:40 UTC (4,960 KB)

Computer Science > Machine Learning

Title:Unleashing the Power of Randomization in Auditing Differentially Private ML

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Unleashing the Power of Randomization in Auditing Differentially Private ML

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators