Post-L1-Penalized Estimators in High-Dimensional Linear Regression Models

Belloni, Alexandre; Chernozhukov, Victor

Mathematics > Statistics Theory

arXiv:1001.0188v1 (math)

[Submitted on 31 Dec 2009 (this version), latest version 20 Mar 2013 (v5)]

Title:Post-L1-Penalized Estimators in High-Dimensional Linear Regression Models

Authors:Alexandre Belloni, Victor Chernozhukov

View PDF

Abstract: In this paper we study the post-penalized estimator which applies ordinary, unpenalized linear regression to the model selected by the first step penalized estimators, typically the LASSO. We show that post-LASSO can perform as well or nearly as well as the LASSO in terms of the rate of convergence. We show that this performance occurs even if the LASSO-based model selection "fails", in the sense of missing some components of the "true" regression model. Furthermore, post-LASSO can perform strictly better than LASSO, in the sense of a strictly faster rate of convergence, if the LASSO-based model selection correctly includes all components of the "true" model as a subset and enough sparsity is obtained. Of course, in the extreme case, when LASSO perfectly selects the true model, the past-LASSO estimator becomes the oracle estimator. We show that the results hold in both parametric and non-parametric models; and by the "true" model we mean the best $s$-dimensional approximation to the true regression model, where the dimension $s$ is can be chosen to maximize the rate of convergence of LASSO or post-LASSO estimators. Moreover, our analysis is not limited to the LASSO estimator in the first step, and also applies to other estimators, for example, the trimmed LASSO or Dantzig selector estimator. Our analysis also highlights the importance of sparsity induced by the first estimators. That motivated us to also study the impact of trimming small components of the initial estimator to achieve a sparser support for the post-LASSO. Our analysis covers both traditional trimming, as well as a new practical completely data-driven trimming scheme that induces maximal sparsity subject to maintaining a certain goodness-of-fit.

Subjects:	Statistics Theory (math.ST); Probability (math.PR); Methodology (stat.ME)
Cite as:	arXiv:1001.0188 [math.ST]
	(or arXiv:1001.0188v1 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.1001.0188

Submission history

From: Alexandre Belloni [view email]
[v1] Thu, 31 Dec 2009 22:10:59 UTC (18 KB)
[v2] Fri, 26 Mar 2010 14:43:22 UTC (439 KB)
[v3] Sat, 11 Jun 2011 22:56:57 UTC (419 KB)
[v4] Thu, 25 Aug 2011 02:30:15 UTC (419 KB)
[v5] Wed, 20 Mar 2013 12:16:15 UTC (97 KB)

Mathematics > Statistics Theory

Title:Post-L1-Penalized Estimators in High-Dimensional Linear Regression Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Post-L1-Penalized Estimators in High-Dimensional Linear Regression Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators