Learning the Positions in CountSketch

Li, Yi; Lin, Honghao; Liu, Simin; Vakilian, Ali; Woodruff, David P.

Computer Science > Machine Learning

arXiv:2306.06611 (cs)

[Submitted on 11 Jun 2023 (v1), last revised 11 Apr 2024 (this version, v2)]

Title:Learning the Positions in CountSketch

Authors:Yi Li, Honghao Lin, Simin Liu, Ali Vakilian, David P. Woodruff

View PDF HTML (experimental)

Abstract:We consider sketching algorithms which first compress data by multiplication with a random sketch matrix, and then apply the sketch to quickly solve an optimization problem, e.g., low-rank approximation and regression. In the learning-based sketching paradigm proposed by~\cite{indyk2019learning}, the sketch matrix is found by choosing a random sparse matrix, e.g., CountSketch, and then the values of its non-zero entries are updated by running gradient descent on a training data set. Despite the growing body of work on this paradigm, a noticeable omission is that the locations of the non-zero entries of previous algorithms were fixed, and only their values were learned. In this work, we propose the first learning-based algorithms that also optimize the locations of the non-zero entries. Our first proposed algorithm is based on a greedy algorithm. However, one drawback of the greedy algorithm is its slower training time. We fix this issue and propose approaches for learning a sketching matrix for both low-rank approximation and Hessian approximation for second order optimization. The latter is helpful for a range of constrained optimization problems, such as LASSO and matrix estimation with a nuclear norm constraint. Both approaches achieve good accuracy with a fast running time. Moreover, our experiments suggest that our algorithm can still reduce the error significantly even if we only have a very limited number of training matrices.

Comments:	Corrected the proof of Theorem 5.1. arXiv admin note: text overlap with arXiv:2007.09890
Subjects:	Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:2306.06611 [cs.LG]
	(or arXiv:2306.06611v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.06611

Submission history

From: Honghao Lin [view email]
[v1] Sun, 11 Jun 2023 07:28:35 UTC (181 KB)
[v2] Thu, 11 Apr 2024 00:31:28 UTC (181 KB)

Computer Science > Machine Learning

Title:Learning the Positions in CountSketch

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning the Positions in CountSketch

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators