High Dimensional Robust Sparse Regression

Liu, Liu; Shen, Yanyao; Li, Tianyang; Caramanis, Constantine

Computer Science > Machine Learning

arXiv:1805.11643 (cs)

[Submitted on 29 May 2018 (v1), last revised 29 May 2019 (this version, v3)]

Title:High Dimensional Robust Sparse Regression

Authors:Liu Liu, Yanyao Shen, Tianyang Li, Constantine Caramanis

View PDF

Abstract:We provide a novel -- and to the best of our knowledge, the first -- algorithm for high dimensional sparse regression with constant fraction of corruptions in explanatory and/or response variables. Our algorithm recovers the true sparse parameters with sub-linear sample complexity, in the presence of a constant fraction of arbitrary corruptions. Our main contribution is a robust variant of Iterative Hard Thresholding. Using this, we provide accurate estimators: when the covariance matrix in sparse regression is identity, our error guarantee is near information-theoretically optimal. We then deal with robust sparse regression with unknown structured covariance matrix. We propose a filtering algorithm which consists of a novel randomized outlier removal technique for robust sparse mean estimation that may be of interest in its own right: the filtering algorithm is flexible enough to deal with unknown covariance. Also, it is orderwise more efficient computationally than the ellipsoid algorithm. Using sub-linear sample complexity, our algorithm achieves the best known (and first) error guarantee. We demonstrate the effectiveness on large-scale sparse regression problems with arbitrary corruptions.

Subjects:	Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
Cite as:	arXiv:1805.11643 [cs.LG]
	(or arXiv:1805.11643v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.11643

Submission history

From: Liu Liu [view email]
[v1] Tue, 29 May 2018 18:33:23 UTC (357 KB)
[v2] Tue, 5 Feb 2019 06:04:34 UTC (480 KB)
[v3] Wed, 29 May 2019 19:34:15 UTC (474 KB)

Computer Science > Machine Learning

Title:High Dimensional Robust Sparse Regression

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:High Dimensional Robust Sparse Regression

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators