Cargando…

State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues

BACKGROUND: How to select variables and identify functional forms for continuous variables is a key concern when creating a multivariable model. Ad hoc ‘traditional’ approaches to variable selection have been in use for at least 50 years. Similarly, methods for determining functional forms for conti...

Descripción completa

Detalles Bibliográficos
Autores principales: Sauerbrei, Willi, Perperoglou, Aris, Schmid, Matthias, Abrahamowicz, Michal, Becher, Heiko, Binder, Harald, Dunkler, Daniela, Harrell, Frank E., Royston, Patrick, Heinze, Georg
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7114804/
https://www.ncbi.nlm.nih.gov/pubmed/32266321
http://dx.doi.org/10.1186/s41512-020-00074-3
_version_ 1783513965646577664
author Sauerbrei, Willi
Perperoglou, Aris
Schmid, Matthias
Abrahamowicz, Michal
Becher, Heiko
Binder, Harald
Dunkler, Daniela
Harrell, Frank E.
Royston, Patrick
Heinze, Georg
author_facet Sauerbrei, Willi
Perperoglou, Aris
Schmid, Matthias
Abrahamowicz, Michal
Becher, Heiko
Binder, Harald
Dunkler, Daniela
Harrell, Frank E.
Royston, Patrick
Heinze, Georg
author_sort Sauerbrei, Willi
collection PubMed
description BACKGROUND: How to select variables and identify functional forms for continuous variables is a key concern when creating a multivariable model. Ad hoc ‘traditional’ approaches to variable selection have been in use for at least 50 years. Similarly, methods for determining functional forms for continuous variables were first suggested many years ago. More recently, many alternative approaches to address these two challenges have been proposed, but knowledge of their properties and meaningful comparisons between them are scarce. To define a state of the art and to provide evidence-supported guidance to researchers who have only a basic level of statistical knowledge, many outstanding issues in multivariable modelling remain. Our main aims are to identify and illustrate such gaps in the literature and present them at a moderate technical level to the wide community of practitioners, researchers and students of statistics. METHODS: We briefly discuss general issues in building descriptive regression models, strategies for variable selection, different ways of choosing functional forms for continuous variables and methods for combining the selection of variables and functions. We discuss two examples, taken from the medical literature, to illustrate problems in the practice of modelling. RESULTS: Our overview revealed that there is not yet enough evidence on which to base recommendations for the selection of variables and functional forms in multivariable analysis. Such evidence may come from comparisons between alternative methods. In particular, we highlight seven important topics that require further investigation and make suggestions for the direction of further research. CONCLUSIONS: Selection of variables and of functional forms are important topics in multivariable analysis. To define a state of the art and to provide evidence-supported guidance to researchers who have only a basic level of statistical knowledge, further comparative research is required.
format Online
Article
Text
id pubmed-7114804
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-71148042020-04-07 State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues Sauerbrei, Willi Perperoglou, Aris Schmid, Matthias Abrahamowicz, Michal Becher, Heiko Binder, Harald Dunkler, Daniela Harrell, Frank E. Royston, Patrick Heinze, Georg Diagn Progn Res Commentary BACKGROUND: How to select variables and identify functional forms for continuous variables is a key concern when creating a multivariable model. Ad hoc ‘traditional’ approaches to variable selection have been in use for at least 50 years. Similarly, methods for determining functional forms for continuous variables were first suggested many years ago. More recently, many alternative approaches to address these two challenges have been proposed, but knowledge of their properties and meaningful comparisons between them are scarce. To define a state of the art and to provide evidence-supported guidance to researchers who have only a basic level of statistical knowledge, many outstanding issues in multivariable modelling remain. Our main aims are to identify and illustrate such gaps in the literature and present them at a moderate technical level to the wide community of practitioners, researchers and students of statistics. METHODS: We briefly discuss general issues in building descriptive regression models, strategies for variable selection, different ways of choosing functional forms for continuous variables and methods for combining the selection of variables and functions. We discuss two examples, taken from the medical literature, to illustrate problems in the practice of modelling. RESULTS: Our overview revealed that there is not yet enough evidence on which to base recommendations for the selection of variables and functional forms in multivariable analysis. Such evidence may come from comparisons between alternative methods. In particular, we highlight seven important topics that require further investigation and make suggestions for the direction of further research. CONCLUSIONS: Selection of variables and of functional forms are important topics in multivariable analysis. To define a state of the art and to provide evidence-supported guidance to researchers who have only a basic level of statistical knowledge, further comparative research is required. BioMed Central 2020-04-02 /pmc/articles/PMC7114804/ /pubmed/32266321 http://dx.doi.org/10.1186/s41512-020-00074-3 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Commentary
Sauerbrei, Willi
Perperoglou, Aris
Schmid, Matthias
Abrahamowicz, Michal
Becher, Heiko
Binder, Harald
Dunkler, Daniela
Harrell, Frank E.
Royston, Patrick
Heinze, Georg
State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues
title State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues
title_full State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues
title_fullStr State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues
title_full_unstemmed State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues
title_short State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues
title_sort state of the art in selection of variables and functional forms in multivariable analysis—outstanding issues
topic Commentary
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7114804/
https://www.ncbi.nlm.nih.gov/pubmed/32266321
http://dx.doi.org/10.1186/s41512-020-00074-3
work_keys_str_mv AT sauerbreiwilli stateoftheartinselectionofvariablesandfunctionalformsinmultivariableanalysisoutstandingissues
AT perperoglouaris stateoftheartinselectionofvariablesandfunctionalformsinmultivariableanalysisoutstandingissues
AT schmidmatthias stateoftheartinselectionofvariablesandfunctionalformsinmultivariableanalysisoutstandingissues
AT abrahamowiczmichal stateoftheartinselectionofvariablesandfunctionalformsinmultivariableanalysisoutstandingissues
AT becherheiko stateoftheartinselectionofvariablesandfunctionalformsinmultivariableanalysisoutstandingissues
AT binderharald stateoftheartinselectionofvariablesandfunctionalformsinmultivariableanalysisoutstandingissues
AT dunklerdaniela stateoftheartinselectionofvariablesandfunctionalformsinmultivariableanalysisoutstandingissues
AT harrellfranke stateoftheartinselectionofvariablesandfunctionalformsinmultivariableanalysisoutstandingissues
AT roystonpatrick stateoftheartinselectionofvariablesandfunctionalformsinmultivariableanalysisoutstandingissues
AT heinzegeorg stateoftheartinselectionofvariablesandfunctionalformsinmultivariableanalysisoutstandingissues
AT stateoftheartinselectionofvariablesandfunctionalformsinmultivariableanalysisoutstandingissues