Improving k-Means Clustering Performance with Disentangled Internal Representations

Agarap, Abien Fred; Azcarraga, Arnulfo P.

Computer Science > Machine Learning

arXiv:2006.04535 (cs)

[Submitted on 5 Jun 2020]

Title:Improving k-Means Clustering Performance with Disentangled Internal Representations

Authors:Abien Fred Agarap, Arnulfo P. Azcarraga

View PDF

Abstract:Deep clustering algorithms combine representation learning and clustering by jointly optimizing a clustering loss and a non-clustering loss. In such methods, a deep neural network is used for representation learning together with a clustering network. Instead of following this framework to improve clustering performance, we propose a simpler approach of optimizing the entanglement of the learned latent code representation of an autoencoder. We define entanglement as how close pairs of points from the same class or structure are, relative to pairs of points from different classes or structures. To measure the entanglement of data points, we use the soft nearest neighbor loss, and expand it by introducing an annealing temperature factor. Using our proposed approach, the test clustering accuracy was 96.2% on the MNIST dataset, 85.6% on the Fashion-MNIST dataset, and 79.2% on the EMNIST Balanced dataset, outperforming our baseline models.

Comments:	To be presented at IJCNN 2020
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:2006.04535 [cs.LG]
	(or arXiv:2006.04535v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.04535

Submission history

From: Abien Fred Agarap [view email]
[v1] Fri, 5 Jun 2020 11:32:34 UTC (1,962 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-06

Change to browse by:

cs
cs.NE
stat
stat.ML

References & Citations

1 blog link

(what is this?)

DBLP - CS Bibliography

listing | bibtex

Abien Fred Agarap

export BibTeX citation

Computer Science > Machine Learning

Title:Improving k-Means Clustering Performance with Disentangled Internal Representations

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Improving k-Means Clustering Performance with Disentangled Internal Representations

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators