StreaMRAK a Streaming Multi-Resolution Adaptive Kernel Algorithm

Oslandsbotn, Andreas; Kereta, Zeljko; Naumova, Valeriya; Freund, Yoav; Cloninger, Alexander

Computer Science > Machine Learning

arXiv:2108.10411 (cs)

[Submitted on 23 Aug 2021 (v1), last revised 7 Sep 2021 (this version, v2)]

Title:StreaMRAK a Streaming Multi-Resolution Adaptive Kernel Algorithm

Authors:Andreas Oslandsbotn, Zeljko Kereta, Valeriya Naumova, Yoav Freund, Alexander Cloninger

View PDF

Abstract:Kernel ridge regression (KRR) is a popular scheme for non-linear non-parametric learning. However, existing implementations of KRR require that all the data is stored in the main memory, which severely limits the use of KRR in contexts where data size far exceeds the memory size. Such applications are increasingly common in data mining, bioinformatics, and control. A powerful paradigm for computing on data sets that are too large for memory is the streaming model of computation, where we process one data sample at a time, discarding each sample before moving on to the next one. In this paper, we propose StreaMRAK - a streaming version of KRR. StreaMRAK improves on existing KRR schemes by dividing the problem into several levels of resolution, which allows continual refinement to the predictions. The algorithm reduces the memory requirement by continuously and efficiently integrating new samples into the training model. With a novel sub-sampling scheme, StreaMRAK reduces memory and computational complexities by creating a sketch of the original data, where the sub-sampling density is adapted to the bandwidth of the kernel and the local dimensionality of the data. We present a showcase study on two synthetic problems and the prediction of the trajectory of a double pendulum. The results show that the proposed algorithm is fast and accurate.

Subjects:	Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
MSC classes:	68Q32, 65D15, 46E22, 68W27
Cite as:	arXiv:2108.10411 [cs.LG]
	(or arXiv:2108.10411v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2108.10411

Submission history

From: Andreas Oslandsbotn [view email]
[v1] Mon, 23 Aug 2021 21:03:09 UTC (22,328 KB)
[v2] Tue, 7 Sep 2021 21:25:02 UTC (22,329 KB)

Computer Science > Machine Learning

Title:StreaMRAK a Streaming Multi-Resolution Adaptive Kernel Algorithm

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:StreaMRAK a Streaming Multi-Resolution Adaptive Kernel Algorithm

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators