Measures of Entropy from Data Using Infinitely Divisible Kernels

Giraldo, Luis G. Sanchez; Rao, Murali; Principe, Jose C.

Computer Science > Machine Learning

arXiv:1211.2459 (cs)

[Submitted on 11 Nov 2012 (v1), last revised 1 Sep 2014 (this version, v3)]

Title:Measures of Entropy from Data Using Infinitely Divisible Kernels

Authors:Luis G. Sanchez Giraldo, Murali Rao, Jose C. Principe

View PDF

Abstract:Information theory provides principled ways to analyze different inference and learning problems such as hypothesis testing, clustering, dimensionality reduction, classification, among others. However, the use of information theoretic quantities as test statistics, that is, as quantities obtained from empirical data, poses a challenging estimation problem that often leads to strong simplifications such as Gaussian models, or the use of plug in density estimators that are restricted to certain representation of the data. In this paper, a framework to non-parametrically obtain measures of entropy directly from data using operators in reproducing kernel Hilbert spaces defined by infinitely divisible kernels is presented. The entropy functionals, which bear resemblance with quantum entropies, are defined on positive definite matrices and satisfy similar axioms to those of Renyi's definition of entropy. Convergence of the proposed estimators follows from concentration results on the difference between the ordered spectrum of the Gram matrices and the integral operators associated to the population quantities. In this way, capitalizing on both the axiomatic definition of entropy and on the representation power of positive definite kernels, the proposed measure of entropy avoids the estimation of the probability distribution underlying the data. Moreover, estimators of kernel-based conditional entropy and mutual information are also defined. Numerical experiments on independence tests compare favourably with state of the art.

Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as:	arXiv:1211.2459 [cs.LG]
	(or arXiv:1211.2459v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1211.2459

Submission history

From: Luis Sanchez Giraldo [view email]
[v1] Sun, 11 Nov 2012 20:49:28 UTC (67 KB)
[v2] Mon, 9 Dec 2013 19:38:31 UTC (76 KB)
[v3] Mon, 1 Sep 2014 21:52:55 UTC (86 KB)

Computer Science > Machine Learning

Title:Measures of Entropy from Data Using Infinitely Divisible Kernels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Measures of Entropy from Data Using Infinitely Divisible Kernels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators