Cargando…

Empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure

Machine learning is increasingly being used to predict clinical outcomes. Most comparisons of different methods have been based on empirical analyses in specific datasets. We used Monte Carlo simulations to determine when machine learning methods perform better than statistical learning methods in a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Austin, Peter C., Harrell, Frank E., Lee, Douglas S., Steyerberg, Ewout W.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9166797/ https://www.ncbi.nlm.nih.gov/pubmed/35660759 http://dx.doi.org/10.1038/s41598-022-13015-5

_version_	1784720686210613248
author	Austin, Peter C. Harrell, Frank E. Lee, Douglas S. Steyerberg, Ewout W.
author_facet	Austin, Peter C. Harrell, Frank E. Lee, Douglas S. Steyerberg, Ewout W.
author_sort	Austin, Peter C.
collection	PubMed
description	Machine learning is increasingly being used to predict clinical outcomes. Most comparisons of different methods have been based on empirical analyses in specific datasets. We used Monte Carlo simulations to determine when machine learning methods perform better than statistical learning methods in a specific setting. We evaluated six learning methods: stochastic gradient boosting machines using trees as the base learners, random forests, artificial neural networks, the lasso, ridge regression, and linear regression estimated using ordinary least squares (OLS). Our simulations were informed by empirical analyses in patients with acute myocardial infarction (AMI) and congestive heart failure (CHF) and used six data-generating processes, each based on one of the six learning methods, to simulate continuous outcomes in the derivation and validation samples. The outcome was systolic blood pressure at hospital discharge, a continuous outcome. We applied the six learning methods in each of the simulated derivation samples and evaluated performance in the simulated validation samples. The primary observation was that neural networks tended to result in estimates with worse predictive accuracy than the other five methods in both disease samples and across all six data-generating processes. Boosted trees and OLS regression tended to perform well across a range of scenarios.
format	Online Article Text
id	pubmed-9166797
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-91667972022-06-05 Empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure Austin, Peter C. Harrell, Frank E. Lee, Douglas S. Steyerberg, Ewout W. Sci Rep Article Machine learning is increasingly being used to predict clinical outcomes. Most comparisons of different methods have been based on empirical analyses in specific datasets. We used Monte Carlo simulations to determine when machine learning methods perform better than statistical learning methods in a specific setting. We evaluated six learning methods: stochastic gradient boosting machines using trees as the base learners, random forests, artificial neural networks, the lasso, ridge regression, and linear regression estimated using ordinary least squares (OLS). Our simulations were informed by empirical analyses in patients with acute myocardial infarction (AMI) and congestive heart failure (CHF) and used six data-generating processes, each based on one of the six learning methods, to simulate continuous outcomes in the derivation and validation samples. The outcome was systolic blood pressure at hospital discharge, a continuous outcome. We applied the six learning methods in each of the simulated derivation samples and evaluated performance in the simulated validation samples. The primary observation was that neural networks tended to result in estimates with worse predictive accuracy than the other five methods in both disease samples and across all six data-generating processes. Boosted trees and OLS regression tended to perform well across a range of scenarios. Nature Publishing Group UK 2022-06-03 /pmc/articles/PMC9166797/ /pubmed/35660759 http://dx.doi.org/10.1038/s41598-022-13015-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Austin, Peter C. Harrell, Frank E. Lee, Douglas S. Steyerberg, Ewout W. Empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure
title	Empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure
title_full	Empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure
title_fullStr	Empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure
title_full_unstemmed	Empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure
title_short	Empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure
title_sort	empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9166797/ https://www.ncbi.nlm.nih.gov/pubmed/35660759 http://dx.doi.org/10.1038/s41598-022-13015-5
work_keys_str_mv	AT austinpeterc empiricalanalysesandsimulationsshowedthatdifferentmachineandstatisticallearningmethodshaddifferingperformanceforpredictingbloodpressure AT harrellfranke empiricalanalysesandsimulationsshowedthatdifferentmachineandstatisticallearningmethodshaddifferingperformanceforpredictingbloodpressure AT leedouglass empiricalanalysesandsimulationsshowedthatdifferentmachineandstatisticallearningmethodshaddifferingperformanceforpredictingbloodpressure AT steyerbergewoutw empiricalanalysesandsimulationsshowedthatdifferentmachineandstatisticallearningmethodshaddifferingperformanceforpredictingbloodpressure

Empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure

Ejemplares similares