Cargando…

The lasso for high dimensional regression with a possible change point

We consider a high dimensional regression model with a possible change point due to a covariate threshold and develop the lasso estimator of regression coefficients as well as the threshold parameter. Our lasso estimator not only selects covariates but also selects a model between linear and thresho...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Sokbae, Seo, Myung Hwan, Shin, Youngki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5014306/
https://www.ncbi.nlm.nih.gov/pubmed/27656104
http://dx.doi.org/10.1111/rssb.12108
_version_ 1782452270634369024
author Lee, Sokbae
Seo, Myung Hwan
Shin, Youngki
author_facet Lee, Sokbae
Seo, Myung Hwan
Shin, Youngki
author_sort Lee, Sokbae
collection PubMed
description We consider a high dimensional regression model with a possible change point due to a covariate threshold and develop the lasso estimator of regression coefficients as well as the threshold parameter. Our lasso estimator not only selects covariates but also selects a model between linear and threshold regression models. Under a sparsity assumption, we derive non‐asymptotic oracle inequalities for both the prediction risk and the [Formula: see text] ‐estimation loss for regression coefficients. Since the lasso estimator selects variables simultaneously, we show that oracle inequalities can be established without pretesting the existence of the threshold effect. Furthermore, we establish conditions under which the estimation error of the unknown threshold parameter can be bounded by a factor that is nearly [Formula: see text] even when the number of regressors can be much larger than the sample size n. We illustrate the usefulness of our proposed estimation method via Monte Carlo simulations and an application to real data.
format Online
Article
Text
id pubmed-5014306
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-50143062016-09-19 The lasso for high dimensional regression with a possible change point Lee, Sokbae Seo, Myung Hwan Shin, Youngki J R Stat Soc Series B Stat Methodol Original Articles We consider a high dimensional regression model with a possible change point due to a covariate threshold and develop the lasso estimator of regression coefficients as well as the threshold parameter. Our lasso estimator not only selects covariates but also selects a model between linear and threshold regression models. Under a sparsity assumption, we derive non‐asymptotic oracle inequalities for both the prediction risk and the [Formula: see text] ‐estimation loss for regression coefficients. Since the lasso estimator selects variables simultaneously, we show that oracle inequalities can be established without pretesting the existence of the threshold effect. Furthermore, we establish conditions under which the estimation error of the unknown threshold parameter can be bounded by a factor that is nearly [Formula: see text] even when the number of regressors can be much larger than the sample size n. We illustrate the usefulness of our proposed estimation method via Monte Carlo simulations and an application to real data. John Wiley and Sons Inc. 2015-02-15 2016-01 /pmc/articles/PMC5014306/ /pubmed/27656104 http://dx.doi.org/10.1111/rssb.12108 Text en © 2015 The Authors Journal of the Royal Statistical Society: Series B (Statistics in Society) Published by John Wiley & Sons Ltd on behalf of the Royal Statistical Society. This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial (http://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Original Articles
Lee, Sokbae
Seo, Myung Hwan
Shin, Youngki
The lasso for high dimensional regression with a possible change point
title The lasso for high dimensional regression with a possible change point
title_full The lasso for high dimensional regression with a possible change point
title_fullStr The lasso for high dimensional regression with a possible change point
title_full_unstemmed The lasso for high dimensional regression with a possible change point
title_short The lasso for high dimensional regression with a possible change point
title_sort lasso for high dimensional regression with a possible change point
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5014306/
https://www.ncbi.nlm.nih.gov/pubmed/27656104
http://dx.doi.org/10.1111/rssb.12108
work_keys_str_mv AT leesokbae thelassoforhighdimensionalregressionwithapossiblechangepoint
AT seomyunghwan thelassoforhighdimensionalregressionwithapossiblechangepoint
AT shinyoungki thelassoforhighdimensionalregressionwithapossiblechangepoint
AT leesokbae lassoforhighdimensionalregressionwithapossiblechangepoint
AT seomyunghwan lassoforhighdimensionalregressionwithapossiblechangepoint
AT shinyoungki lassoforhighdimensionalregressionwithapossiblechangepoint