CoCoLasso for High-dimensional Error-in-variables Regression

Datta, Abhirup; Zou, Hui

Mathematics > Statistics Theory

arXiv:1510.07123 (math)

[Submitted on 24 Oct 2015 (v1), last revised 1 Jan 2016 (this version, v2)]

Title:CoCoLasso for High-dimensional Error-in-variables Regression

Authors:Abhirup Datta, Hui Zou

View PDF

Abstract:Much theoretical and applied work has been devoted to high-dimensional regression with clean data. However, we often face corrupted data in many applications where missing data and measurement errors cannot be ignored. Loh and Wainwright (2012) proposed a non-convex modification of the Lasso for doing high-dimensional regression with noisy and missing data. It is generally agreed that the virtues of convexity contribute fundamentally the success and popularity of the Lasso. In light of this, we propose a new method named CoCoLasso that is convex and can handle a general class of corrupted datasets including the cases of additive measurement error and random missing data. We establish the estimation error bounds of CoCoLasso and its asymptotic sign-consistent selection property. We further elucidate how the standard cross validation techniques can be misleading in presence of measurement error and develop a novel corrected cross-validation technique by using the basic idea in CoCoLasso. The corrected cross-validation has its own importance. We demonstrate the superior performance of our method over the non-convex approach by simulation studies.

Subjects:	Statistics Theory (math.ST)
Cite as:	arXiv:1510.07123 [math.ST]
	(or arXiv:1510.07123v2 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.1510.07123

Submission history

From: Abhirup Datta [view email]
[v1] Sat, 24 Oct 2015 09:50:11 UTC (24 KB)
[v2] Fri, 1 Jan 2016 06:40:45 UTC (74 KB)

Mathematics > Statistics Theory

Title:CoCoLasso for High-dimensional Error-in-variables Regression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:CoCoLasso for High-dimensional Error-in-variables Regression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators