Cargando…
A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks
To fit sparse linear associations, a LASSO sparsity inducing penalty with a single hyperparameter provably allows to recover the important features (needles) with high probability in certain regimes even if the sample size is smaller than the dimension of the input vector (haystack). More recently l...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9587964/ https://www.ncbi.nlm.nih.gov/pubmed/36299529 http://dx.doi.org/10.1007/s11222-022-10169-0 |
_version_ | 1784814020836982784 |
---|---|
author | Ma, Xiaoyu Sardy, Sylvain Hengartner, Nick Bobenko, Nikolai Lin, Yen Ting |
author_facet | Ma, Xiaoyu Sardy, Sylvain Hengartner, Nick Bobenko, Nikolai Lin, Yen Ting |
author_sort | Ma, Xiaoyu |
collection | PubMed |
description | To fit sparse linear associations, a LASSO sparsity inducing penalty with a single hyperparameter provably allows to recover the important features (needles) with high probability in certain regimes even if the sample size is smaller than the dimension of the input vector (haystack). More recently learners known as artificial neural networks (ANN) have shown great successes in many machine learning tasks, in particular fitting nonlinear associations. Small learning rate, stochastic gradient descent algorithm and large training set help to cope with the explosion in the number of parameters present in deep neural networks. Yet few ANN learners have been developed and studied to find needles in nonlinear haystacks. Driven by a single hyperparameter, our ANN learner, like for sparse linear associations, exhibits a phase transition in the probability of retrieving the needles, which we do not observe with other ANN learners. To select our penalty parameter, we generalize the universal threshold of Donoho and Johnstone (Biometrika 81(3):425–455, 1994) which is a better rule than the conservative (too many false detections) and expensive cross-validation. In the spirit of simulated annealing, we propose a warm-start sparsity inducing algorithm to solve the high-dimensional, non-convex and non-differentiable optimization problem. We perform simulated and real data Monte Carlo experiments to quantify the effectiveness of our approach. |
format | Online Article Text |
id | pubmed-9587964 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-95879642022-10-24 A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks Ma, Xiaoyu Sardy, Sylvain Hengartner, Nick Bobenko, Nikolai Lin, Yen Ting Stat Comput OriginalPaper To fit sparse linear associations, a LASSO sparsity inducing penalty with a single hyperparameter provably allows to recover the important features (needles) with high probability in certain regimes even if the sample size is smaller than the dimension of the input vector (haystack). More recently learners known as artificial neural networks (ANN) have shown great successes in many machine learning tasks, in particular fitting nonlinear associations. Small learning rate, stochastic gradient descent algorithm and large training set help to cope with the explosion in the number of parameters present in deep neural networks. Yet few ANN learners have been developed and studied to find needles in nonlinear haystacks. Driven by a single hyperparameter, our ANN learner, like for sparse linear associations, exhibits a phase transition in the probability of retrieving the needles, which we do not observe with other ANN learners. To select our penalty parameter, we generalize the universal threshold of Donoho and Johnstone (Biometrika 81(3):425–455, 1994) which is a better rule than the conservative (too many false detections) and expensive cross-validation. In the spirit of simulated annealing, we propose a warm-start sparsity inducing algorithm to solve the high-dimensional, non-convex and non-differentiable optimization problem. We perform simulated and real data Monte Carlo experiments to quantify the effectiveness of our approach. Springer US 2022-10-22 2022 /pmc/articles/PMC9587964/ /pubmed/36299529 http://dx.doi.org/10.1007/s11222-022-10169-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | OriginalPaper Ma, Xiaoyu Sardy, Sylvain Hengartner, Nick Bobenko, Nikolai Lin, Yen Ting A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks |
title | A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks |
title_full | A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks |
title_fullStr | A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks |
title_full_unstemmed | A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks |
title_short | A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks |
title_sort | phase transition for finding needles in nonlinear haystacks with lasso artificial neural networks |
topic | OriginalPaper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9587964/ https://www.ncbi.nlm.nih.gov/pubmed/36299529 http://dx.doi.org/10.1007/s11222-022-10169-0 |
work_keys_str_mv | AT maxiaoyu aphasetransitionforfindingneedlesinnonlinearhaystackswithlassoartificialneuralnetworks AT sardysylvain aphasetransitionforfindingneedlesinnonlinearhaystackswithlassoartificialneuralnetworks AT hengartnernick aphasetransitionforfindingneedlesinnonlinearhaystackswithlassoartificialneuralnetworks AT bobenkonikolai aphasetransitionforfindingneedlesinnonlinearhaystackswithlassoartificialneuralnetworks AT linyenting aphasetransitionforfindingneedlesinnonlinearhaystackswithlassoartificialneuralnetworks AT maxiaoyu phasetransitionforfindingneedlesinnonlinearhaystackswithlassoartificialneuralnetworks AT sardysylvain phasetransitionforfindingneedlesinnonlinearhaystackswithlassoartificialneuralnetworks AT hengartnernick phasetransitionforfindingneedlesinnonlinearhaystackswithlassoartificialneuralnetworks AT bobenkonikolai phasetransitionforfindingneedlesinnonlinearhaystackswithlassoartificialneuralnetworks AT linyenting phasetransitionforfindingneedlesinnonlinearhaystackswithlassoartificialneuralnetworks |