Cargando…

Automatic Variable Selection Algorithms in Prognostic Factor Research in Neck Pain

This study aims to compare the variable selection strategies of different machine learning (ML) and statistical algorithms in the prognosis of neck pain (NP) recovery. A total of 3001 participants with NP were included. Three dichotomous outcomes of an improvement in NP, arm pain (AP), and disabilit...

Descripción completa

Detalles Bibliográficos
Autores principales: Liew, Bernard X. W., Kovacs, Francisco M., Rügamer, David, Royuela, Ana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10573798/
https://www.ncbi.nlm.nih.gov/pubmed/37834877
http://dx.doi.org/10.3390/jcm12196232
_version_ 1785120544657506304
author Liew, Bernard X. W.
Kovacs, Francisco M.
Rügamer, David
Royuela, Ana
author_facet Liew, Bernard X. W.
Kovacs, Francisco M.
Rügamer, David
Royuela, Ana
author_sort Liew, Bernard X. W.
collection PubMed
description This study aims to compare the variable selection strategies of different machine learning (ML) and statistical algorithms in the prognosis of neck pain (NP) recovery. A total of 3001 participants with NP were included. Three dichotomous outcomes of an improvement in NP, arm pain (AP), and disability at 3 months follow-up were used. Twenty-five variables (twenty-eight parameters) were included as predictors. There were more parameters than variables, as some categorical variables had >2 levels. Eight modelling techniques were compared: stepwise regression based on unadjusted p values (stepP), on adjusted p values (stepPAdj), on Akaike information criterion (stepAIC), best subset regression (BestSubset) least absolute shrinkage and selection operator [LASSO], Minimax concave penalty (MCP), model-based boosting (mboost), and multivariate adaptive regression splines (MuARS). The algorithm that selected the fewest predictors was stepPAdj (number of predictors, p = 4 to 8). MuARS was the algorithm with the second fewest predictors selected (p = 9 to 14). The predictor selected by all algorithms with the largest coefficient magnitude was “having undergone a neuroreflexotherapy intervention” for NP (β = from 1.987 to 2.296) and AP (β = from 2.639 to 3.554), and “Imaging findings: spinal stenosis” (β = from −1.331 to −1.763) for disability. Stepwise regression based on adjusted p-values resulted in the sparsest models, which enhanced clinical interpretability. MuARS appears to provide the optimal balance between model sparsity whilst retaining high predictive performance across outcomes. Different algorithms produced similar performances but resulted in a different number of variables selected. Rather than relying on any single algorithm, confidence in the variable selection may be increased by using multiple algorithms.
format Online
Article
Text
id pubmed-10573798
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-105737982023-10-14 Automatic Variable Selection Algorithms in Prognostic Factor Research in Neck Pain Liew, Bernard X. W. Kovacs, Francisco M. Rügamer, David Royuela, Ana J Clin Med Article This study aims to compare the variable selection strategies of different machine learning (ML) and statistical algorithms in the prognosis of neck pain (NP) recovery. A total of 3001 participants with NP were included. Three dichotomous outcomes of an improvement in NP, arm pain (AP), and disability at 3 months follow-up were used. Twenty-five variables (twenty-eight parameters) were included as predictors. There were more parameters than variables, as some categorical variables had >2 levels. Eight modelling techniques were compared: stepwise regression based on unadjusted p values (stepP), on adjusted p values (stepPAdj), on Akaike information criterion (stepAIC), best subset regression (BestSubset) least absolute shrinkage and selection operator [LASSO], Minimax concave penalty (MCP), model-based boosting (mboost), and multivariate adaptive regression splines (MuARS). The algorithm that selected the fewest predictors was stepPAdj (number of predictors, p = 4 to 8). MuARS was the algorithm with the second fewest predictors selected (p = 9 to 14). The predictor selected by all algorithms with the largest coefficient magnitude was “having undergone a neuroreflexotherapy intervention” for NP (β = from 1.987 to 2.296) and AP (β = from 2.639 to 3.554), and “Imaging findings: spinal stenosis” (β = from −1.331 to −1.763) for disability. Stepwise regression based on adjusted p-values resulted in the sparsest models, which enhanced clinical interpretability. MuARS appears to provide the optimal balance between model sparsity whilst retaining high predictive performance across outcomes. Different algorithms produced similar performances but resulted in a different number of variables selected. Rather than relying on any single algorithm, confidence in the variable selection may be increased by using multiple algorithms. MDPI 2023-09-27 /pmc/articles/PMC10573798/ /pubmed/37834877 http://dx.doi.org/10.3390/jcm12196232 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liew, Bernard X. W.
Kovacs, Francisco M.
Rügamer, David
Royuela, Ana
Automatic Variable Selection Algorithms in Prognostic Factor Research in Neck Pain
title Automatic Variable Selection Algorithms in Prognostic Factor Research in Neck Pain
title_full Automatic Variable Selection Algorithms in Prognostic Factor Research in Neck Pain
title_fullStr Automatic Variable Selection Algorithms in Prognostic Factor Research in Neck Pain
title_full_unstemmed Automatic Variable Selection Algorithms in Prognostic Factor Research in Neck Pain
title_short Automatic Variable Selection Algorithms in Prognostic Factor Research in Neck Pain
title_sort automatic variable selection algorithms in prognostic factor research in neck pain
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10573798/
https://www.ncbi.nlm.nih.gov/pubmed/37834877
http://dx.doi.org/10.3390/jcm12196232
work_keys_str_mv AT liewbernardxw automaticvariableselectionalgorithmsinprognosticfactorresearchinneckpain
AT kovacsfranciscom automaticvariableselectionalgorithmsinprognosticfactorresearchinneckpain
AT rugamerdavid automaticvariableselectionalgorithmsinprognosticfactorresearchinneckpain
AT royuelaana automaticvariableselectionalgorithmsinprognosticfactorresearchinneckpain