Information Theoretic Optimal Learning of Gaussian Graphical Models

Misra, Sidhant; Vuffray, Marc; Lokhov, Andrey Y.

Computer Science > Machine Learning

arXiv:1703.04886 (cs)

[Submitted on 15 Mar 2017 (v1), last revised 18 Nov 2018 (this version, v3)]

Title:Information Theoretic Optimal Learning of Gaussian Graphical Models

Authors:Sidhant Misra, Marc Vuffray, Andrey Y. Lokhov

View PDF

Abstract:What is the optimal number of independent observations from which a sparse Gaussian Graphical Model can be correctly recovered? Information-theoretic arguments provide a lower bound on the minimum number of samples necessary to perfectly identify the support of any multivariate normal distribution as a function of model parameters. For a model defined on a sparse graph with $p$ nodes, a maximum degree $d$ and minimum normalized edge strength $\kappa$, this necessary number of samples scales at least as $d \log p/\kappa^2$. The sample complexity requirements of existing methods for perfect graph reconstruction exhibit dependency on additional parameters that do not enter in the lower bound. The question of whether the lower bound is tight and achievable by a polynomial time algorithm remains open. In this paper, we constructively answer this question and propose an algorithm, termed DICE, whose sample complexity matches the information-theoretic lower bound up to a universal constant factor. We also propose a related algorithm SLICE that has a slightly higher sample complexity, but can be implemented as a mixed integer quadratic program which makes it attractive in practice. Importantly, SLICE retains a critical advantage of DICE in that its sample complexity only depends on quantities present in the information theoretic lower bound. We anticipate that this result will stimulate future search of computationally efficient sample-optimal algorithms.

Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT); Statistics Theory (math.ST)
Cite as:	arXiv:1703.04886 [cs.LG]
	(or arXiv:1703.04886v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1703.04886

Submission history

From: Marc Vuffray [view email]
[v1] Wed, 15 Mar 2017 02:25:31 UTC (106 KB)
[v2] Mon, 26 Feb 2018 05:46:43 UTC (267 KB)
[v3] Sun, 18 Nov 2018 04:32:34 UTC (169 KB)

Computer Science > Machine Learning

Title:Information Theoretic Optimal Learning of Gaussian Graphical Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Information Theoretic Optimal Learning of Gaussian Graphical Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators