Cargando…

Computing Leapfrog Regularization Paths with Applications to Large-Scale K-mer Logistic Regression

High-dimensional statistics deals with statistical inference when the number of parameters or features p exceeds the number of observations n (i.e., [Formula: see text]). In this case, the parameter space must be constrained either by regularization or by selecting a small subset of [Formula: see te...

Descripción completa

Detalles Bibliográficos
Autor principal: Benner, Philipp
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Mary Ann Liebert, Inc., publishers 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8219187/
https://www.ncbi.nlm.nih.gov/pubmed/33739865
http://dx.doi.org/10.1089/cmb.2020.0284
_version_ 1783710881772732416
author Benner, Philipp
author_facet Benner, Philipp
author_sort Benner, Philipp
collection PubMed
description High-dimensional statistics deals with statistical inference when the number of parameters or features p exceeds the number of observations n (i.e., [Formula: see text]). In this case, the parameter space must be constrained either by regularization or by selecting a small subset of [Formula: see text] features. Feature selection through [Formula: see text]-regularization combines the benefits of both approaches and has proven to yield good results in practice. However, the functional relation between the regularization strength [Formula: see text] and the number of selected features m is difficult to determine. Hence, parameters are typically estimated for all possible regularization strengths [Formula: see text]. These so-called regularization paths can be expensive to compute and most solutions may not even be of interest to the problem at hand. As an alternative, an algorithm is proposed that determines the [Formula: see text]-regularization strength [Formula: see text] iteratively for a fixed m. The algorithm can be used to compute leapfrog regularization paths by subsequently increasing m.
format Online
Article
Text
id pubmed-8219187
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Mary Ann Liebert, Inc., publishers
record_format MEDLINE/PubMed
spelling pubmed-82191872021-06-23 Computing Leapfrog Regularization Paths with Applications to Large-Scale K-mer Logistic Regression Benner, Philipp J Comput Biol Research Articles High-dimensional statistics deals with statistical inference when the number of parameters or features p exceeds the number of observations n (i.e., [Formula: see text]). In this case, the parameter space must be constrained either by regularization or by selecting a small subset of [Formula: see text] features. Feature selection through [Formula: see text]-regularization combines the benefits of both approaches and has proven to yield good results in practice. However, the functional relation between the regularization strength [Formula: see text] and the number of selected features m is difficult to determine. Hence, parameters are typically estimated for all possible regularization strengths [Formula: see text]. These so-called regularization paths can be expensive to compute and most solutions may not even be of interest to the problem at hand. As an alternative, an algorithm is proposed that determines the [Formula: see text]-regularization strength [Formula: see text] iteratively for a fixed m. The algorithm can be used to compute leapfrog regularization paths by subsequently increasing m. Mary Ann Liebert, Inc., publishers 2021-06-01 2021-06-14 /pmc/articles/PMC8219187/ /pubmed/33739865 http://dx.doi.org/10.1089/cmb.2020.0284 Text en © Philipp Benner 2021; Published by Mary Ann Liebert, Inc. https://creativecommons.org/licenses/by/4.0/This Open Access article is distributed under the terms of the Creative Commons License [CC-BY] (http://creativecommons.org/licenses/by/4.0 (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Benner, Philipp
Computing Leapfrog Regularization Paths with Applications to Large-Scale K-mer Logistic Regression
title Computing Leapfrog Regularization Paths with Applications to Large-Scale K-mer Logistic Regression
title_full Computing Leapfrog Regularization Paths with Applications to Large-Scale K-mer Logistic Regression
title_fullStr Computing Leapfrog Regularization Paths with Applications to Large-Scale K-mer Logistic Regression
title_full_unstemmed Computing Leapfrog Regularization Paths with Applications to Large-Scale K-mer Logistic Regression
title_short Computing Leapfrog Regularization Paths with Applications to Large-Scale K-mer Logistic Regression
title_sort computing leapfrog regularization paths with applications to large-scale k-mer logistic regression
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8219187/
https://www.ncbi.nlm.nih.gov/pubmed/33739865
http://dx.doi.org/10.1089/cmb.2020.0284
work_keys_str_mv AT bennerphilipp computingleapfrogregularizationpathswithapplicationstolargescalekmerlogisticregression