A Marketplace for Data: An Algorithmic Solution

Agarwal, Anish; Dahleh, Munther; Sarkar, Tuhin

Computer Science > Computer Science and Game Theory

arXiv:1805.08125 (cs)

[Submitted on 21 May 2018 (v1), last revised 12 May 2019 (this version, v4)]

Title:A Marketplace for Data: An Algorithmic Solution

Authors:Anish Agarwal, Munther Dahleh, Tuhin Sarkar

View PDF

Abstract:In this work, we aim to design a data marketplace; a robust real-time matching mechanism to efficiently buy and sell training data for Machine Learning tasks. While the monetization of data and pre-trained models is an essential focus of industry today, there does not exist a market mechanism to price training data and match buyers to sellers while still addressing the associated (computational and other) complexity. The challenge in creating such a market stems from the very nature of data as an asset: (i) it is freely replicable; (ii) its value is inherently combinatorial due to correlation with signal in other data; (iii) prediction tasks and the value of accuracy vary widely; (iv) usefulness of training data is difficult to verify a priori without first applying it to a prediction task. As our main contributions we: (i) propose a mathematical model for a two-sided data market and formally define the key associated challenges; (ii) construct algorithms for such a market to function and analyze how they meet the challenges defined. We highlight two technical contributions: (i) a new notion of 'fairness' required for cooperative games with freely replicable goods; (ii) a truthful, zero regret mechanism to auction a class of combinatorial goods based on utilizing Myerson's payment function and the Multiplicative Weights algorithm. These might be of independent interest.

Subjects:	Computer Science and Game Theory (cs.GT)
Cite as:	arXiv:1805.08125 [cs.GT]
	(or arXiv:1805.08125v4 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.1805.08125

Submission history

From: Anish Agarwal [view email]
[v1] Mon, 21 May 2018 15:32:42 UTC (670 KB)
[v2] Wed, 31 Oct 2018 18:09:00 UTC (635 KB)
[v3] Thu, 21 Feb 2019 23:20:01 UTC (692 KB)
[v4] Sun, 12 May 2019 21:41:53 UTC (796 KB)

Computer Science > Computer Science and Game Theory

Title:A Marketplace for Data: An Algorithmic Solution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:A Marketplace for Data: An Algorithmic Solution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators