Differentially Private Optimization on Large Model at Small Cost

Bu, Zhiqi; Wang, Yu-Xiang; Zha, Sheng; Karypis, George

Computer Science > Machine Learning

arXiv:2210.00038 (cs)

[Submitted on 30 Sep 2022 (v1), last revised 19 Sep 2023 (this version, v2)]

Title:Differentially Private Optimization on Large Model at Small Cost

Authors:Zhiqi Bu, Yu-Xiang Wang, Sheng Zha, George Karypis

View PDF

Abstract:Differentially private (DP) optimization is the standard paradigm to learn large neural networks that are accurate and privacy-preserving. The computational cost for DP deep learning, however, is notoriously heavy due to the per-sample gradient clipping. Existing DP implementations are 2-1000X more costly in time and space complexity than the standard (non-private) training. In this work, we develop a novel Book-Keeping (BK) technique that implements existing DP optimizers (thus achieving the same accuracy), with a substantial improvement on the computational cost. Specifically, BK enables DP training on large models and high dimensional data to be roughly as fast and memory-saving as the standard training, whereas previous DP algorithms can be inefficient or incapable of training due to memory error. The computational advantage of BK is supported by the complexity analysis as well as extensive experiments on vision and language tasks. Our implementation achieves state-of-the-art (SOTA) accuracy with very small extra cost: on GPT2 and at almost the same memory cost (<1% overhead), BK has 1.03X the time complexity of the standard training (0.83X training speed in practice), and 0.61X the time complexity of the most efficient DP implementation (1.36X training speed in practice). We open-source the codebase for the BK algorithm at the FastDP library (this https URL).

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2210.00038 [cs.LG]
	(or arXiv:2210.00038v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.00038

Submission history

From: Zhiqi Bu [view email]
[v1] Fri, 30 Sep 2022 18:38:53 UTC (719 KB)
[v2] Tue, 19 Sep 2023 02:14:06 UTC (846 KB)

Computer Science > Machine Learning

Title:Differentially Private Optimization on Large Model at Small Cost

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Differentially Private Optimization on Large Model at Small Cost

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators