Cargando…

An adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression

High-dimensional linear regression model is the most popular statistical model for high-dimensional data, but it is quite a challenging task to achieve a sparse set of regression coefficients. In this paper, we propose a simple heuristic algorithm to construct sparse high-dimensional linear regressi...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Xue, Sun, Yifan, Zhou, Hai-Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8674299/
https://www.ncbi.nlm.nih.gov/pubmed/34911986
http://dx.doi.org/10.1038/s41598-021-03323-7
_version_ 1784615621378441216
author Yu, Xue
Sun, Yifan
Zhou, Hai-Jun
author_facet Yu, Xue
Sun, Yifan
Zhou, Hai-Jun
author_sort Yu, Xue
collection PubMed
description High-dimensional linear regression model is the most popular statistical model for high-dimensional data, but it is quite a challenging task to achieve a sparse set of regression coefficients. In this paper, we propose a simple heuristic algorithm to construct sparse high-dimensional linear regression models, which is adapted from the shortest-solution guided decimation algorithm and is referred to as ASSD. This algorithm constructs the support of regression coefficients under the guidance of the shortest least-squares solution of the recursively decimated linear models, and it applies an early-stopping criterion and a second-stage thresholding procedure to refine this support. Our extensive numerical results demonstrate that ASSD outperforms LASSO, adaptive LASSO, vector approximate message passing, and two other representative greedy algorithms in solution accuracy and robustness. ASSD is especially suitable for linear regression problems with highly correlated measurement matrices encountered in real-world applications.
format Online
Article
Text
id pubmed-8674299
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-86742992021-12-16 An adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression Yu, Xue Sun, Yifan Zhou, Hai-Jun Sci Rep Article High-dimensional linear regression model is the most popular statistical model for high-dimensional data, but it is quite a challenging task to achieve a sparse set of regression coefficients. In this paper, we propose a simple heuristic algorithm to construct sparse high-dimensional linear regression models, which is adapted from the shortest-solution guided decimation algorithm and is referred to as ASSD. This algorithm constructs the support of regression coefficients under the guidance of the shortest least-squares solution of the recursively decimated linear models, and it applies an early-stopping criterion and a second-stage thresholding procedure to refine this support. Our extensive numerical results demonstrate that ASSD outperforms LASSO, adaptive LASSO, vector approximate message passing, and two other representative greedy algorithms in solution accuracy and robustness. ASSD is especially suitable for linear regression problems with highly correlated measurement matrices encountered in real-world applications. Nature Publishing Group UK 2021-12-15 /pmc/articles/PMC8674299/ /pubmed/34911986 http://dx.doi.org/10.1038/s41598-021-03323-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Yu, Xue
Sun, Yifan
Zhou, Hai-Jun
An adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression
title An adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression
title_full An adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression
title_fullStr An adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression
title_full_unstemmed An adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression
title_short An adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression
title_sort adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8674299/
https://www.ncbi.nlm.nih.gov/pubmed/34911986
http://dx.doi.org/10.1038/s41598-021-03323-7
work_keys_str_mv AT yuxue anadaptiveshortestsolutionguideddecimationapproachtosparsehighdimensionallinearregression
AT sunyifan anadaptiveshortestsolutionguideddecimationapproachtosparsehighdimensionallinearregression
AT zhouhaijun anadaptiveshortestsolutionguideddecimationapproachtosparsehighdimensionallinearregression
AT yuxue adaptiveshortestsolutionguideddecimationapproachtosparsehighdimensionallinearregression
AT sunyifan adaptiveshortestsolutionguideddecimationapproachtosparsehighdimensionallinearregression
AT zhouhaijun adaptiveshortestsolutionguideddecimationapproachtosparsehighdimensionallinearregression