Learning and Generalization for Matching Problems

Cohen, Alon; Hassidim, Avinatan; Kaplan, Haim; Mansour, Yishay; Moran, Shay

Computer Science > Machine Learning

arXiv:1902.04741v2 (cs)

[Submitted on 13 Feb 2019 (v1), revised 24 Feb 2019 (this version, v2), latest version 31 May 2019 (v3)]

Title:Learning and Generalization for Matching Problems

Authors:Alon Cohen, Avinatan Hassidim, Haim Kaplan, Yishay Mansour, Shay Moran

View PDF

Abstract:We study a classic algorithmic problem through the lens of statistical learning. That is, we consider a matching problem where the input graph is sampled from some distribution. This distribution is unknown to the algorithm; however, an additional graph which is sampled from the same distribution is given during a training phase (preprocessing). More specifically, the algorithmic problem is to match $k$ out of $n$ items that arrive online to $d$ categories ($d\ll k \ll n$). Our goal is to design a two-stage online algorithm that retains a small subset of items in the first stage which contains an offline matching of maximum weight. We then compute this optimal matching in a second stage. The added statistical component is that before the online matching process begins, our algorithms learn from a training set consisting of another matching instance drawn from the same unknown distribution. Using this training set, we learn a policy that we apply during the online matching process. We consider a class of online policies that we term \emph{thresholds policies}. For this class, we derive uniform convergence results both for the number of retained items and the value of the optimal matching. We show that the number of retained items and the value of the offline optimal matching deviate from their expectation by $O(\sqrt{k})$. This requires usage of less-standard concentration inequalities (standard ones give deviations of $O(\sqrt{n})$). Furthermore, we design an algorithm that outputs the optimal offline solution with high probability while retaining only $O(k\log \log n)$ items in expectation.

Comments:	15 pages, added more details to the proof of Lemma 16, as well as a figure to illustrate the construction
Subjects:	Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
Cite as:	arXiv:1902.04741 [cs.LG]
	(or arXiv:1902.04741v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1902.04741

Submission history

From: Shay Moran [view email]
[v1] Wed, 13 Feb 2019 05:02:12 UTC (31 KB)
[v2] Sun, 24 Feb 2019 20:51:00 UTC (45 KB)
[v3] Fri, 31 May 2019 14:44:59 UTC (56 KB)

Computer Science > Machine Learning

Title:Learning and Generalization for Matching Problems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning and Generalization for Matching Problems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators