Black Box Differential Privacy Auditing Using Total Variation Distance

Koskela, Antti; Mohammadi, Jafar

Computer Science > Machine Learning

arXiv:2406.04827v1 (cs)

[Submitted on 7 Jun 2024 (this version), latest version 5 Jul 2024 (v2)]

Title:Black Box Differential Privacy Auditing Using Total Variation Distance

Authors:Antti Koskela, Jafar Mohammadi

View PDF HTML (experimental)

Abstract:We present a practical method to audit the differential privacy (DP) guarantees of a machine learning model using a small hold-out dataset that is not exposed to the model during the training. Having a score function such as the loss function employed during the training, our method estimates the total variation (TV) distance between scores obtained with a subset of the training data and the hold-out dataset. With some meta information about the underlying DP training algorithm, these TV distance values can be converted to $(\varepsilon,\delta)$-guarantees for any $\delta$. We show that these score distributions asymptotically give lower bounds for the DP guarantees of the underlying training algorithm, however, we perform a one-shot estimation for practicality reasons. We specify conditions that lead to lower bounds for the DP guarantees with high probability. To estimate the TV distance between the score distributions, we use a simple density estimation method based on histograms. We show that the TV distance gives a very close to optimally robust estimator and has an error rate $\mathcal{O}(k^{-1/3})$, where $k$ is the total number of samples. Numerical experiments on benchmark datasets illustrate the effectiveness of our approach and show improvements over baseline methods for black-box auditing.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:2406.04827 [cs.LG]
	(or arXiv:2406.04827v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.04827

Submission history

From: Antti Koskela [view email]
[v1] Fri, 7 Jun 2024 10:52:15 UTC (149 KB)
[v2] Fri, 5 Jul 2024 21:38:38 UTC (115 KB)

Computer Science > Machine Learning

Title:Black Box Differential Privacy Auditing Using Total Variation Distance

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Black Box Differential Privacy Auditing Using Total Variation Distance

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators