Feature Selection for Ridge Regression with Provable Guarantees

Paul, Saurabh; Drineas, Petros

Statistics > Machine Learning

arXiv:1506.05173 (stat)

[Submitted on 17 Jun 2015 (v1), last revised 5 Dec 2015 (this version, v2)]

Title:Feature Selection for Ridge Regression with Provable Guarantees

Authors:Saurabh Paul, Petros Drineas

View PDF

Abstract:We introduce single-set spectral sparsification as a deterministic sampling based feature selection technique for regularized least squares classification, which is the classification analogue to ridge regression. The method is unsupervised and gives worst-case guarantees of the generalization power of the classification function after feature selection with respect to the classification function obtained using all features. We also introduce leverage-score sampling as an unsupervised randomized feature selection method for ridge regression. We provide risk bounds for both single-set spectral sparsification and leverage-score sampling on ridge regression in the fixed design setting and show that the risk in the sampled space is comparable to the risk in the full-feature space. We perform experiments on synthetic and real-world datasets, namely a subset of TechTC-300 datasets, to support our theory. Experimental results indicate that the proposed methods perform better than the existing feature selection methods.

Comments:	To appear in Neural Computation. A shorter version of this paper appeared at ECML-PKDD 2014 under the title "Deterministic Feature Selection for Regularized Least Squares Classification."
Subjects:	Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG)
Cite as:	arXiv:1506.05173 [stat.ML]
	(or arXiv:1506.05173v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1506.05173

Submission history

From: Saurabh Paul [view email]
[v1] Wed, 17 Jun 2015 00:05:04 UTC (607 KB)
[v2] Sat, 5 Dec 2015 18:27:38 UTC (611 KB)

Statistics > Machine Learning

Title:Feature Selection for Ridge Regression with Provable Guarantees

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Feature Selection for Ridge Regression with Provable Guarantees

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators