Small Transformers Compute Universal Metric Embeddings

Kratsios, Anastasis; Debarnot, Valentin; Dokmanić, Ivan

Computer Science > Machine Learning

arXiv:2209.06788v1 (cs)

[Submitted on 14 Sep 2022 (this version), latest version 18 Oct 2022 (v2)]

Title:Small Transformers Compute Universal Metric Embeddings

Authors:Anastasis Kratsios, Valentin Debarnot, Ivan Dokmanić

View PDF

Abstract:We study representations of data from an arbitrary metric space $\mathcal{X}$ in the space of univariate Gaussian mixtures with a transport metric (Delon and Desolneux 2020). We derive embedding guarantees for feature maps implemented by small neural networks called \emph{probabilistic transformers}. Our guarantees are of memorization type: we prove that a probabilistic transformer of depth about $n\log(n)$ and width about $n^2$ can bi-Hölder embed any $n$-point dataset from $\mathcal{X}$ with low metric distortion, thus avoiding the curse of dimensionality. We further derive probabilistic bi-Lipschitz guarantees which trade off the amount of distortion and the probability that a randomly chosen pair of points embeds with that distortion. If $\mathcal{X}$'s geometry is sufficiently regular, we obtain stronger, bi-Lipschitz guarantees for all points in the dataset. As applications we derive neural embedding guarantees for datasets from Riemannian manifolds, metric trees, and certain types of combinatorial graphs.

Comments:	42 pages, 10 Figures, 3 Tables
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Combinatorics (math.CO); Metric Geometry (math.MG); Machine Learning (stat.ML)
MSC classes:	68T07, 30L05, 68R12, 68T30, 05C12
Cite as:	arXiv:2209.06788 [cs.LG]
	(or arXiv:2209.06788v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.06788

Submission history

From: Anastasis Kratsios [view email]
[v1] Wed, 14 Sep 2022 17:12:41 UTC (22,613 KB)
[v2] Tue, 18 Oct 2022 15:39:57 UTC (22,624 KB)

Computer Science > Machine Learning

Title:Small Transformers Compute Universal Metric Embeddings

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Small Transformers Compute Universal Metric Embeddings

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators