Analyzing Privacy Leakage in Machine Learning via Multiple Hypothesis Testing: A Lesson From Fano

Guo, Chuan; Sablayrolles, Alexandre; Sanjabi, Maziar

Computer Science > Machine Learning

arXiv:2210.13662v1 (cs)

[Submitted on 24 Oct 2022 (this version), latest version 10 Aug 2023 (v2)]

Title:Analyzing Privacy Leakage in Machine Learning via Multiple Hypothesis Testing: A Lesson From Fano

Authors:Chuan Guo, Alexandre Sablayrolles, Maziar Sanjabi

View PDF

Abstract:Differential privacy (DP) is by far the most widely accepted framework for mitigating privacy risks in machine learning. However, exactly how small the privacy parameter $\epsilon$ needs to be to protect against certain privacy risks in practice is still not well-understood. In this work, we study data reconstruction attacks for discrete data and analyze it under the framework of multiple hypothesis testing. We utilize different variants of the celebrated Fano's inequality to derive upper bounds on the inferential power of a data reconstruction adversary when the model is trained differentially privately. Importantly, we show that if the underlying private data takes values from a set of size $M$, then the target privacy parameter $\epsilon$ can be $O(\log M)$ before the adversary gains significant inferential power. Our analysis offers theoretical evidence for the empirical effectiveness of DP against data reconstruction attacks even at relatively large values of $\epsilon$.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Information Theory (cs.IT)
Cite as:	arXiv:2210.13662 [cs.LG]
	(or arXiv:2210.13662v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.13662

Submission history

From: Chuan Guo [view email]
[v1] Mon, 24 Oct 2022 23:50:12 UTC (823 KB)
[v2] Thu, 10 Aug 2023 03:02:21 UTC (858 KB)

Computer Science > Machine Learning

Title:Analyzing Privacy Leakage in Machine Learning via Multiple Hypothesis Testing: A Lesson From Fano

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Analyzing Privacy Leakage in Machine Learning via Multiple Hypothesis Testing: A Lesson From Fano

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators