An Operator Theoretic Approach to Nonparametric Mixture Models

Vandermeulen, Robert A.; Scott, Clayton D.

Statistics > Machine Learning

arXiv:1607.00071 (stat)

[Submitted on 30 Jun 2016 (v1), last revised 13 Oct 2016 (this version, v2)]

Title:An Operator Theoretic Approach to Nonparametric Mixture Models

Authors:Robert A. Vandermeulen, Clayton D. Scott

View PDF

Abstract:When estimating finite mixture models, it is common to make assumptions on the mixture components, such as parametric assumptions. In this work, we make no distributional assumptions on the mixture components and instead assume that observations from the mixture model are grouped, such that observations in the same group are known to be drawn from the same mixture component. We precisely characterize the number of observations $n$ per group needed for the mixture model to be identifiable, as a function of the number $m$ of mixture components. In addition to our assumption-free analysis, we also study the settings where the mixture components are either linearly independent or jointly irreducible. Furthermore, our analysis considers two kinds of identifiability -- where the mixture model is the simplest one explaining the data, and where it is the only one. As an application of these results, we precisely characterize identifiability of multinomial mixture models. Our analysis relies on an operator-theoretic framework that associates mixture models in the grouped-sample setting with certain infinite-dimensional tensors. Based on this framework, we introduce general spectral algorithms for recovering the mixture components and illustrate their use on a synthetic data set.

Comments:	Contains and greatly extends the results from our previous work, arXiv:1502.06644, and thus contains some overlap with that work. This version contains some small grammatical and technical corrections as well as some changes for improved clarity
Subjects:	Machine Learning (stat.ML); Statistics Theory (math.ST)
Cite as:	arXiv:1607.00071 [stat.ML]
	(or arXiv:1607.00071v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1607.00071

Submission history

From: Robert Vandermeulen [view email]
[v1] Thu, 30 Jun 2016 23:01:37 UTC (58 KB)
[v2] Thu, 13 Oct 2016 02:59:07 UTC (58 KB)

Statistics > Machine Learning

Title:An Operator Theoretic Approach to Nonparametric Mixture Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:An Operator Theoretic Approach to Nonparametric Mixture Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators