GPGPU Linear Complexity t-SNE Optimization

Pezzotti, Nicola; Thijssen, Julian; Mordvintsev, Alexander; Hollt, Thomas; van Lew, Baldur; Lelieveldt, Boudewijn P. F.; Eisemann, Elmar; Vilanova, Anna

Computer Science > Machine Learning

arXiv:1805.10817 (cs)

[Submitted on 28 May 2018 (v1), last revised 8 Aug 2019 (this version, v2)]

Title:GPGPU Linear Complexity t-SNE Optimization

Authors:Nicola Pezzotti, Julian Thijssen, Alexander Mordvintsev, Thomas Hollt, Baldur van Lew, Boudewijn P.F. Lelieveldt, Elmar Eisemann, Anna Vilanova

View PDF

Abstract:The t-distributed Stochastic Neighbor Embedding (tSNE) algorithm has become in recent years one of the most used and insightful techniques for the exploratory data analysis of high-dimensional data. tSNE reveals clusters of high-dimensional data points at different scales while it requires only minimal tuning of its parameters. Despite these advantages, the computational complexity of the algorithm limits its application to relatively small datasets. To address this problem, several evolutions of tSNE have been developed in recent years, mainly focusing on the scalability of the similarity computations between data points. However, these contributions are insufficient to achieve interactive rates when visualizing the evolution of the tSNE embedding for large datasets. In this work, we present a novel approach to the minimization of the tSNE objective function that heavily relies on modern graphics hardware and has linear computational complexity. Our technique does not only beat the state of the art, but can even be executed on the client side in a browser. We propose to approximate the repulsion forces between data points using adaptive-resolution textures that are drawn at every iteration with WebGL. This approximation allows us to reformulate the tSNE minimization problem as a series of tensor operation that are computed with this http URL, a JavaScript library for scalable tensor computations.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1805.10817 [cs.LG]
	(or arXiv:1805.10817v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.10817

Submission history

From: Nicola Pezzotti [view email]
[v1] Mon, 28 May 2018 08:49:46 UTC (3,662 KB)
[v2] Thu, 8 Aug 2019 20:45:40 UTC (5,501 KB)

Computer Science > Machine Learning

Title:GPGPU Linear Complexity t-SNE Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:GPGPU Linear Complexity t-SNE Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators