Cargando…

Regularization approaches in clinical biostatistics: A review of methods and their applications

A range of regularization approaches have been proposed in the data sciences to overcome overfitting, to exploit sparsity or to improve prediction. Using a broad definition of regularization, namely controlling model complexity by adding information in order to solve ill-posed problems or to prevent...

Descripción completa

Detalles Bibliográficos
Autores principales: Friedrich, Sarah, Groll, Andreas, Ickstadt, Katja, Kneib, Thomas, Pauly, Markus, Rahnenführer, Jörg, Friede, Tim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9896544/
https://www.ncbi.nlm.nih.gov/pubmed/36384320
http://dx.doi.org/10.1177/09622802221133557
_version_ 1784882074139754496
author Friedrich, Sarah
Groll, Andreas
Ickstadt, Katja
Kneib, Thomas
Pauly, Markus
Rahnenführer, Jörg
Friede, Tim
author_facet Friedrich, Sarah
Groll, Andreas
Ickstadt, Katja
Kneib, Thomas
Pauly, Markus
Rahnenführer, Jörg
Friede, Tim
author_sort Friedrich, Sarah
collection PubMed
description A range of regularization approaches have been proposed in the data sciences to overcome overfitting, to exploit sparsity or to improve prediction. Using a broad definition of regularization, namely controlling model complexity by adding information in order to solve ill-posed problems or to prevent overfitting, we review a range of approaches within this framework including penalization, early stopping, ensembling and model averaging. Aspects of their practical implementation are discussed including available R-packages and examples are provided. To assess the extent to which these approaches are used in medicine, we conducted a review of three general medical journals. It revealed that regularization approaches are rarely applied in practical clinical applications, with the exception of random effects models. Hence, we suggest a more frequent use of regularization approaches in medical research. In situations where also other approaches work well, the only downside of the regularization approaches is increased complexity in the conduct of the analyses which can pose challenges in terms of computational resources and expertise on the side of the data analyst. In our view, both can and should be overcome by investments in appropriate computing facilities and educational resources.
format Online
Article
Text
id pubmed-9896544
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-98965442023-02-04 Regularization approaches in clinical biostatistics: A review of methods and their applications Friedrich, Sarah Groll, Andreas Ickstadt, Katja Kneib, Thomas Pauly, Markus Rahnenführer, Jörg Friede, Tim Stat Methods Med Res Review Article A range of regularization approaches have been proposed in the data sciences to overcome overfitting, to exploit sparsity or to improve prediction. Using a broad definition of regularization, namely controlling model complexity by adding information in order to solve ill-posed problems or to prevent overfitting, we review a range of approaches within this framework including penalization, early stopping, ensembling and model averaging. Aspects of their practical implementation are discussed including available R-packages and examples are provided. To assess the extent to which these approaches are used in medicine, we conducted a review of three general medical journals. It revealed that regularization approaches are rarely applied in practical clinical applications, with the exception of random effects models. Hence, we suggest a more frequent use of regularization approaches in medical research. In situations where also other approaches work well, the only downside of the regularization approaches is increased complexity in the conduct of the analyses which can pose challenges in terms of computational resources and expertise on the side of the data analyst. In our view, both can and should be overcome by investments in appropriate computing facilities and educational resources. SAGE Publications 2022-11-16 2023-02 /pmc/articles/PMC9896544/ /pubmed/36384320 http://dx.doi.org/10.1177/09622802221133557 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Review Article
Friedrich, Sarah
Groll, Andreas
Ickstadt, Katja
Kneib, Thomas
Pauly, Markus
Rahnenführer, Jörg
Friede, Tim
Regularization approaches in clinical biostatistics: A review of methods and their applications
title Regularization approaches in clinical biostatistics: A review of methods and their applications
title_full Regularization approaches in clinical biostatistics: A review of methods and their applications
title_fullStr Regularization approaches in clinical biostatistics: A review of methods and their applications
title_full_unstemmed Regularization approaches in clinical biostatistics: A review of methods and their applications
title_short Regularization approaches in clinical biostatistics: A review of methods and their applications
title_sort regularization approaches in clinical biostatistics: a review of methods and their applications
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9896544/
https://www.ncbi.nlm.nih.gov/pubmed/36384320
http://dx.doi.org/10.1177/09622802221133557
work_keys_str_mv AT friedrichsarah regularizationapproachesinclinicalbiostatisticsareviewofmethodsandtheirapplications
AT grollandreas regularizationapproachesinclinicalbiostatisticsareviewofmethodsandtheirapplications
AT ickstadtkatja regularizationapproachesinclinicalbiostatisticsareviewofmethodsandtheirapplications
AT kneibthomas regularizationapproachesinclinicalbiostatisticsareviewofmethodsandtheirapplications
AT paulymarkus regularizationapproachesinclinicalbiostatisticsareviewofmethodsandtheirapplications
AT rahnenfuhrerjorg regularizationapproachesinclinicalbiostatisticsareviewofmethodsandtheirapplications
AT friedetim regularizationapproachesinclinicalbiostatisticsareviewofmethodsandtheirapplications