Cardinality Estimators do not Preserve Privacy

Desfontaines, Damien; Lochbihler, Andreas; Basin, David

Computer Science > Cryptography and Security

arXiv:1808.05879 (cs)

[Submitted on 17 Aug 2018 (v1), last revised 18 Dec 2018 (this version, v3)]

Title:Cardinality Estimators do not Preserve Privacy

Authors:Damien Desfontaines, Andreas Lochbihler, David Basin

View PDF

Abstract:Cardinality estimators like HyperLogLog are sketching algorithms that estimate the number of distinct elements in a large multiset. Their use in privacy-sensitive contexts raises the question of whether they leak private information. In particular, can they provide any privacy guarantees while preserving their strong aggregation properties? We formulate an abstract notion of cardinality estimators, that captures this aggregation requirement: one can merge sketches without losing precision. We propose an attacker model and a corresponding privacy definition, strictly weaker than differential privacy: we assume that the attacker has no prior knowledge of the data. We then show that if a cardinality estimator satisfies this definition, then it cannot have a reasonable level of accuracy. We prove similar results for weaker versions of our definition, and analyze the privacy of existing algorithms, showing that their average privacy loss is significant, even for multisets with large cardinalities. We conclude that strong aggregation requirements are incompatible with any reasonable definition of privacy, and that cardinality estimators should be considered as sensitive as raw data. We also propose risk mitigation strategies for their real-world applications.

Subjects:	Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:1808.05879 [cs.CR]
	(or arXiv:1808.05879v3 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.1808.05879

Submission history

From: Damien Desfontaines [view email]
[v1] Fri, 17 Aug 2018 14:26:00 UTC (338 KB)
[v2] Tue, 4 Dec 2018 14:53:18 UTC (339 KB)
[v3] Tue, 18 Dec 2018 22:27:26 UTC (337 KB)

Computer Science > Cryptography and Security

Title:Cardinality Estimators do not Preserve Privacy

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Cardinality Estimators do not Preserve Privacy

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators