Deep clustering: On the link between discriminative models and K-means

Jabi, Mohammed; Pedersoli, Marco; Mitiche, Amar; Ayed, Ismail Ben

Computer Science > Machine Learning

arXiv:1810.04246 (cs)

[Submitted on 9 Oct 2018 (v1), last revised 15 Dec 2019 (this version, v2)]

Title:Deep clustering: On the link between discriminative models and K-means

Authors:Mohammed Jabi, Marco Pedersoli, Amar Mitiche, Ismail Ben Ayed

View PDF

Abstract:In the context of recent deep clustering studies, discriminative models dominate the literature and report the most competitive performances. These models learn a deep discriminative neural network classifier in which the labels are latent. Typically, they use multinomial logistic regression posteriors and parameter regularization, as is very common in supervised learning. It is generally acknowledged that discriminative objective functions (e.g., those based on the mutual information or the KL divergence) are more flexible than generative approaches (e.g., K-means) in the sense that they make fewer assumptions about the data distributions and, typically, yield much better unsupervised deep learning results. On the surface, several recent discriminative models may seem unrelated to K-means. This study shows that these models are, in fact, equivalent to K-means under mild conditions and common posterior models and parameter regularization. We prove that, for the commonly used logistic regression posteriors, maximizing the $L_2$ regularized mutual information via an approximate alternating direction method (ADM) is equivalent to a soft and regularized K-means loss. Our theoretical analysis not only connects directly several recent state-of-the-art discriminative models to K-means, but also leads to a new soft and regularized deep K-means algorithm, which yields competitive performance on several image clustering benchmarks.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1810.04246 [cs.LG]
	(or arXiv:1810.04246v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.04246

Submission history

From: Mohammed Jabi [view email]
[v1] Tue, 9 Oct 2018 21:17:09 UTC (250 KB)
[v2] Sun, 15 Dec 2019 23:28:05 UTC (713 KB)

Computer Science > Machine Learning

Title:Deep clustering: On the link between discriminative models and K-means

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Deep clustering: On the link between discriminative models and K-means

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators