Matrix Product Sketching via Coordinated Sampling

Daliri, Majid; Freire, Juliana; Li, Danrong; Musco, Christopher

Computer Science > Data Structures and Algorithms

arXiv:2501.17836 (cs)

[Submitted on 29 Jan 2025]

Title:Matrix Product Sketching via Coordinated Sampling

Authors:Majid Daliri, Juliana Freire, Danrong Li, Christopher Musco

View PDF HTML (experimental)

Abstract:We revisit the well-studied problem of approximating a matrix product, $\mathbf{A}^T\mathbf{B}$, based on small space sketches $\mathcal{S}(\mathbf{A})$ and $\mathcal{S}(\mathbf{B})$ of $\mathbf{A} \in \R^{n \times d}$ and $\mathbf{B}\in \R^{n \times m}$. We are interested in the setting where the sketches must be computed independently of each other, except for the use of a shared random seed. We prove that, when $\mathbf{A}$ and $\mathbf{B}$ are sparse, methods based on \emph{coordinated random sampling} can outperform classical linear sketching approaches, like Johnson-Lindenstrauss Projection or CountSketch. For example, to obtain Frobenius norm error $\epsilon\|\mathbf{A}\|_F\|\mathbf{B}\|_F$, coordinated sampling requires sketches of size $O(s/\epsilon^2)$ when $\mathbf{A}$ and $\mathbf{B}$ have at most $s \leq d,m$ non-zeros per row. In contrast, linear sketching leads to sketches of size $O(d/\epsilon^2)$ and $O(m/\epsilon^2)$ for $\mathbf{A}$ and $\mathbf{B}$. We empirically evaluate our approach on two applications: 1) distributed linear regression in databases, a problem motivated by tasks like dataset discovery and augmentation, and 2) approximating attention matrices in transformer-based language models. In both cases, our sampling algorithms yield an order of magnitude improvement over linear sketching.

Comments:	18 pages
Subjects:	Data Structures and Algorithms (cs.DS); Databases (cs.DB); Machine Learning (cs.LG)
Cite as:	arXiv:2501.17836 [cs.DS]
	(or arXiv:2501.17836v1 [cs.DS] for this version)
	https://doi.org/10.48550/arXiv.2501.17836

Submission history

From: Majid Daliri [view email]
[v1] Wed, 29 Jan 2025 18:35:38 UTC (3,962 KB)

Computer Science > Data Structures and Algorithms

Title:Matrix Product Sketching via Coordinated Sampling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Data Structures and Algorithms

Title:Matrix Product Sketching via Coordinated Sampling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators