Adaptive piecewise polynomial estimation via trend filtering

Tibshirani, Ryan J.

doi:10.1214/13-AOS1189

Mathematics > Statistics Theory

arXiv:1304.2986 (math)

[Submitted on 10 Apr 2013 (v1), last revised 21 Mar 2014 (this version, v2)]

Title:Adaptive piecewise polynomial estimation via trend filtering

Authors:Ryan J. Tibshirani

View PDF

Abstract:We study trend filtering, a recently proposed tool of Kim et al. [SIAM Rev. 51 (2009) 339-360] for nonparametric regression. The trend filtering estimate is defined as the minimizer of a penalized least squares criterion, in which the penalty term sums the absolute $k$th order discrete derivatives over the input points. Perhaps not surprisingly, trend filtering estimates appear to have the structure of $k$th degree spline functions, with adaptively chosen knot points (we say ``appear'' here as trend filtering estimates are not really functions over continuous domains, and are only defined over the discrete set of inputs). This brings to mind comparisons to other nonparametric regression tools that also produce adaptive splines; in particular, we compare trend filtering to smoothing splines, which penalize the sum of squared derivatives across input points, and to locally adaptive regression splines [Ann. Statist. 25 (1997) 387-413], which penalize the total variation of the $k$th derivative. Empirically, we discover that trend filtering estimates adapt to the local level of smoothness much better than smoothing splines, and further, they exhibit a remarkable similarity to locally adaptive regression splines. We also provide theoretical support for these empirical findings; most notably, we prove that (with the right choice of tuning parameter) the trend filtering estimate converges to the true underlying function at the minimax rate for functions whose $k$th derivative is of bounded variation. This is done via an asymptotic pairing of trend filtering and locally adaptive regression splines, which have already been shown to converge at the minimax rate [Ann. Statist. 25 (1997) 387-413]. At the core of this argument is a new result tying together the fitted values of two lasso problems that share the same outcome vector, but have different predictor matrices.

Comments:	Published in at this http URL the Annals of Statistics (this http URL) by the Institute of Mathematical Statistics (this http URL)
Subjects:	Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
Report number:	IMS-AOS-AOS1189
Cite as:	arXiv:1304.2986 [math.ST]
	(or arXiv:1304.2986v2 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.1304.2986
Journal reference:	Annals of Statistics 2014, Vol. 42, No. 1, 285-323
Related DOI:	https://doi.org/10.1214/13-AOS1189

Submission history

From: Ryan J. Tibshirani [view email] [via VTEX proxy]
[v1] Wed, 10 Apr 2013 15:02:53 UTC (243 KB)
[v2] Fri, 21 Mar 2014 13:42:10 UTC (2,133 KB)

Mathematics > Statistics Theory

Title:Adaptive piecewise polynomial estimation via trend filtering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Adaptive piecewise polynomial estimation via trend filtering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators