The Crossover Process: Learnability and Data Protection from Inference Attacks

Nock, Richard; Patrini, Giorgio; Lattimore, Finnian; Caetano, Tiberio

Computer Science > Machine Learning

arXiv:1606.04160 (cs)

[Submitted on 13 Jun 2016 (v1), last revised 7 Mar 2017 (this version, v2)]

Title:The Crossover Process: Learnability and Data Protection from Inference Attacks

Authors:Richard Nock, Giorgio Patrini, Finnian Lattimore, Tiberio Caetano

View PDF

Abstract:It is usual to consider data protection and learnability as conflicting objectives. This is not always the case: we show how to jointly control inference --- seen as the attack --- and learnability by a noise-free process that mixes training examples, the Crossover Process (cp). One key point is that the cp~is typically able to alter joint distributions without touching on marginals, nor altering the sufficient statistic for the class. In other words, it saves (and sometimes improves) generalization for supervised learning, but can alter the relationship between covariates --- and therefore fool measures of nonlinear independence and causal inference into misleading ad-hoc conclusions. For example, a cp~can increase / decrease odds ratios, bring fairness or break fairness, tamper with disparate impact, strengthen, weaken or reverse causal directions, change observed statistical measures of dependence. For each of these, we quantify changes brought by a cp, as well as its statistical impact on generalization abilities via a new complexity measure that we call the Rademacher cp~complexity. Experiments on a dozen readily available domains validate the theory.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
ACM classes:	I.2.6; K.4.1
Cite as:	arXiv:1606.04160 [cs.LG]
	(or arXiv:1606.04160v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1606.04160

Submission history

From: Richard Nock [view email]
[v1] Mon, 13 Jun 2016 22:27:36 UTC (5,329 KB)
[v2] Tue, 7 Mar 2017 21:41:50 UTC (5,405 KB)

Computer Science > Machine Learning

Title:The Crossover Process: Learnability and Data Protection from Inference Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Crossover Process: Learnability and Data Protection from Inference Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators