Cargando…
The lasso for high dimensional regression with a possible change point
We consider a high dimensional regression model with a possible change point due to a covariate threshold and develop the lasso estimator of regression coefficients as well as the threshold parameter. Our lasso estimator not only selects covariates but also selects a model between linear and thresho...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5014306/ https://www.ncbi.nlm.nih.gov/pubmed/27656104 http://dx.doi.org/10.1111/rssb.12108 |
_version_ | 1782452270634369024 |
---|---|
author | Lee, Sokbae Seo, Myung Hwan Shin, Youngki |
author_facet | Lee, Sokbae Seo, Myung Hwan Shin, Youngki |
author_sort | Lee, Sokbae |
collection | PubMed |
description | We consider a high dimensional regression model with a possible change point due to a covariate threshold and develop the lasso estimator of regression coefficients as well as the threshold parameter. Our lasso estimator not only selects covariates but also selects a model between linear and threshold regression models. Under a sparsity assumption, we derive non‐asymptotic oracle inequalities for both the prediction risk and the [Formula: see text] ‐estimation loss for regression coefficients. Since the lasso estimator selects variables simultaneously, we show that oracle inequalities can be established without pretesting the existence of the threshold effect. Furthermore, we establish conditions under which the estimation error of the unknown threshold parameter can be bounded by a factor that is nearly [Formula: see text] even when the number of regressors can be much larger than the sample size n. We illustrate the usefulness of our proposed estimation method via Monte Carlo simulations and an application to real data. |
format | Online Article Text |
id | pubmed-5014306 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-50143062016-09-19 The lasso for high dimensional regression with a possible change point Lee, Sokbae Seo, Myung Hwan Shin, Youngki J R Stat Soc Series B Stat Methodol Original Articles We consider a high dimensional regression model with a possible change point due to a covariate threshold and develop the lasso estimator of regression coefficients as well as the threshold parameter. Our lasso estimator not only selects covariates but also selects a model between linear and threshold regression models. Under a sparsity assumption, we derive non‐asymptotic oracle inequalities for both the prediction risk and the [Formula: see text] ‐estimation loss for regression coefficients. Since the lasso estimator selects variables simultaneously, we show that oracle inequalities can be established without pretesting the existence of the threshold effect. Furthermore, we establish conditions under which the estimation error of the unknown threshold parameter can be bounded by a factor that is nearly [Formula: see text] even when the number of regressors can be much larger than the sample size n. We illustrate the usefulness of our proposed estimation method via Monte Carlo simulations and an application to real data. John Wiley and Sons Inc. 2015-02-15 2016-01 /pmc/articles/PMC5014306/ /pubmed/27656104 http://dx.doi.org/10.1111/rssb.12108 Text en © 2015 The Authors Journal of the Royal Statistical Society: Series B (Statistics in Society) Published by John Wiley & Sons Ltd on behalf of the Royal Statistical Society. This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial (http://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. |
spellingShingle | Original Articles Lee, Sokbae Seo, Myung Hwan Shin, Youngki The lasso for high dimensional regression with a possible change point |
title | The lasso for high dimensional regression with a possible change point |
title_full | The lasso for high dimensional regression with a possible change point |
title_fullStr | The lasso for high dimensional regression with a possible change point |
title_full_unstemmed | The lasso for high dimensional regression with a possible change point |
title_short | The lasso for high dimensional regression with a possible change point |
title_sort | lasso for high dimensional regression with a possible change point |
topic | Original Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5014306/ https://www.ncbi.nlm.nih.gov/pubmed/27656104 http://dx.doi.org/10.1111/rssb.12108 |
work_keys_str_mv | AT leesokbae thelassoforhighdimensionalregressionwithapossiblechangepoint AT seomyunghwan thelassoforhighdimensionalregressionwithapossiblechangepoint AT shinyoungki thelassoforhighdimensionalregressionwithapossiblechangepoint AT leesokbae lassoforhighdimensionalregressionwithapossiblechangepoint AT seomyunghwan lassoforhighdimensionalregressionwithapossiblechangepoint AT shinyoungki lassoforhighdimensionalregressionwithapossiblechangepoint |