An end-to-end Differentially Private Latent Dirichlet Allocation Using a Spectral Algorithm

DeCarolis, Christopher; Ram, Mukul; Esmaeili, Seyed A.; Wang, Yu-Xiang; Huang, Furong

Statistics > Machine Learning

arXiv:1805.10341 (stat)

[Submitted on 25 May 2018 (v1), last revised 17 Jan 2020 (this version, v3)]

Title:An end-to-end Differentially Private Latent Dirichlet Allocation Using a Spectral Algorithm

Authors:Christopher DeCarolis, Mukul Ram, Seyed A. Esmaeili, Yu-Xiang Wang, Furong Huang

View PDF

Abstract:We provide an end-to-end differentially private spectral algorithm for learning LDA, based on matrix/tensor decompositions, and establish theoretical guarantees on utility/consistency of the estimated model parameters. The spectral algorithm consists of multiple algorithmic steps, named as "{edges}", to which noise could be injected to obtain differential privacy. We identify \emph{subsets of edges}, named as "{configurations}", such that adding noise to all edges in such a subset guarantees differential privacy of the end-to-end spectral algorithm. We characterize the sensitivity of the edges with respect to the input and thus estimate the amount of noise to be added to each edge for any required privacy level. We then characterize the utility loss for each configuration as a function of injected noise. Overall, by combining the sensitivity and utility characterization, we obtain an end-to-end differentially private spectral algorithm for LDA and identify the corresponding configuration that outperforms others in any specific regime. We are the first to achieve utility guarantees under the required level of differential privacy for learning in LDA. Overall our method systematically outperforms differentially private variational inference.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1805.10341 [stat.ML]
	(or arXiv:1805.10341v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1805.10341

Submission history

From: Furong Huang [view email]
[v1] Fri, 25 May 2018 19:30:47 UTC (57 KB)
[v2] Fri, 7 Sep 2018 15:25:47 UTC (1 KB) (withdrawn)
[v3] Fri, 17 Jan 2020 06:01:52 UTC (82 KB)

Statistics > Machine Learning

Title:An end-to-end Differentially Private Latent Dirichlet Allocation Using a Spectral Algorithm

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:An end-to-end Differentially Private Latent Dirichlet Allocation Using a Spectral Algorithm

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators