Gradient-based Sampling: An Adaptive Importance Sampling for Least-squares

Zhu, Rong

Statistics > Machine Learning

arXiv:1803.00841 (stat)

[Submitted on 2 Mar 2018]

Title:Gradient-based Sampling: An Adaptive Importance Sampling for Least-squares

Authors:Rong Zhu

View PDF

Abstract:In modern data analysis, random sampling is an efficient and widely-used strategy to overcome the computational difficulties brought by large sample size. In previous studies, researchers conducted random sampling which is according to the input data but independent on the response variable, however the response variable may also be informative for sampling. In this paper we propose an adaptive sampling called the gradient-based sampling which is dependent on both the input data and the output for fast solving of least-square (LS) problems. We draw the data points by random sampling from the full data according to their gradient values. This sampling is computationally saving, since the running time of computing the sampling probabilities is reduced to O(nd) where n is the full sample size and d is the dimension of the input. Theoretically, we establish an error bound analysis of the general importance sampling with respect to LS solution from full data. The result establishes an improved performance of the use of our gradient- based sampling. Synthetic and real data sets are used to empirically argue that the gradient-based sampling has an obvious advantage over existing sampling methods from two aspects of statistical efficiency and computational saving.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1803.00841 [stat.ML]
	(or arXiv:1803.00841v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1803.00841
Journal reference:	30th Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain

Submission history

From: Rong Zhu [view email]
[v1] Fri, 2 Mar 2018 13:34:09 UTC (81 KB)

Statistics > Machine Learning

Title:Gradient-based Sampling: An Adaptive Importance Sampling for Least-squares

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Gradient-based Sampling: An Adaptive Importance Sampling for Least-squares

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators