Hot PATE: Private Aggregation of Distributions for Diverse Task

Cohen, Edith; Cohen-Wang, Benjamin; Lyu, Xin; Nelson, Jelani; Sarlos, Tamas; Stemmer, Uri

Computer Science > Machine Learning

arXiv:2312.02132 (cs)

[Submitted on 4 Dec 2023 (v1), last revised 17 May 2024 (this version, v2)]

Title:Hot PATE: Private Aggregation of Distributions for Diverse Task

Authors:Edith Cohen, Benjamin Cohen-Wang, Xin Lyu, Jelani Nelson, Tamas Sarlos, Uri Stemmer

View PDF HTML (experimental)

Abstract:The Private Aggregation of Teacher Ensembles (PATE) framework is a versatile approach to privacy-preserving machine learning. In PATE, teacher models that are not privacy-preserving are trained on distinct portions of sensitive data. Privacy-preserving knowledge transfer to a student model is then facilitated by privately aggregating teachers' predictions on new examples. Employing PATE with generative auto-regressive models presents both challenges and opportunities. These models excel in open ended \emph{diverse} (aka hot) tasks with multiple valid responses. Moreover, the knowledge of models is often encapsulated in the response distribution itself and preserving this diversity is critical for fluid and effective knowledge transfer from teachers to student. In all prior designs, higher diversity resulted in lower teacher agreement and thus -- a tradeoff between diversity and privacy. Prior works with PATE thus focused on non-diverse settings or limiting diversity to improve utility.
We propose \emph{hot PATE}, a design tailored for the diverse setting. In hot PATE, each teacher model produces a response distribution that can be highly diverse. We mathematically model the notion of \emph{preserving diversity} and propose an aggregation method, \emph{coordinated ensembles}, that preserves privacy and transfers diversity with \emph{no penalty} to privacy or efficiency. We demonstrate empirically the benefits of hot PATE for in-context learning via prompts and potential to unleash more of the capabilities of generative models.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:2312.02132 [cs.LG]
	(or arXiv:2312.02132v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2312.02132

Submission history

From: Edith Cohen [view email]
[v1] Mon, 4 Dec 2023 18:54:34 UTC (664 KB)
[v2] Fri, 17 May 2024 18:40:36 UTC (2,778 KB)

Computer Science > Machine Learning

Title:Hot PATE: Private Aggregation of Distributions for Diverse Task

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hot PATE: Private Aggregation of Distributions for Diverse Task

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators