Cargando…

Random forest-based prediction of stroke outcome

We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stro...

Descripción completa

Detalles Bibliográficos
Autores principales: Fernandez-Lozano, Carlos, Hervella, Pablo, Mato-Abad, Virginia, Rodríguez-Yáñez, Manuel, Suárez-Garaboa, Sonia, López-Dequidt, Iria, Estany-Gestal, Ana, Sobrino, Tomás, Campos, Francisco, Castillo, José, Rodríguez-Yáñez, Santiago, Iglesias-Rey, Ramón
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8115135/
https://www.ncbi.nlm.nih.gov/pubmed/33980906
http://dx.doi.org/10.1038/s41598-021-89434-7
_version_ 1783691180089802752
author Fernandez-Lozano, Carlos
Hervella, Pablo
Mato-Abad, Virginia
Rodríguez-Yáñez, Manuel
Suárez-Garaboa, Sonia
López-Dequidt, Iria
Estany-Gestal, Ana
Sobrino, Tomás
Campos, Francisco
Castillo, José
Rodríguez-Yáñez, Santiago
Iglesias-Rey, Ramón
author_facet Fernandez-Lozano, Carlos
Hervella, Pablo
Mato-Abad, Virginia
Rodríguez-Yáñez, Manuel
Suárez-Garaboa, Sonia
López-Dequidt, Iria
Estany-Gestal, Ana
Sobrino, Tomás
Campos, Francisco
Castillo, José
Rodríguez-Yáñez, Santiago
Iglesias-Rey, Ramón
author_sort Fernandez-Lozano, Carlos
collection PubMed
description We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stroke (IS) and non-traumatic intracerebral hemorrhage (ICH) admitted to Stroke Unit of a European Tertiary Hospital prospectively registered. We identified the main variables for machine learning Random Forest (RF), generating a predictive model that can estimate patient mortality/morbidity according to the following groups: (1) IS + ICH, (2) IS, and (3) ICH. A total of 6022 patients were included: 4922 (mean age 71.9 ± 13.8 years) with IS and 1100 (mean age 73.3 ± 13.1 years) with ICH. NIHSS at 24, 48 h and axillary temperature at admission were the most important variables to consider for evolution of patients at 3-months. IS + ICH group was the most stable for mortality prediction [0.904 ± 0.025 of area under the receiver operating characteristics curve (AUC)]. IS group presented similar results, although variability between experiments was slightly higher (0.909 ± 0.032 of AUC). ICH group was the one in which RF had more problems to make adequate predictions (0.9837 vs. 0.7104 of AUC). There were no major differences between IS and IS + ICH groups according to morbidity prediction (0.738 and 0.755 of AUC) but, after checking normality with a Shapiro Wilk test with the null hypothesis that the data follow a normal distribution, it was rejected with W = 0.93546 (p-value < 2.2e−16). Conditions required for a parametric test do not hold, and we performed a paired Wilcoxon Test assuming the null hypothesis that all the groups have the same performance. The null hypothesis was rejected with a value < 2.2e−16, so there are statistical differences between IS and ICH groups. In conclusion, machine learning algorithms RF can be effectively used in stroke patients for long-term outcome prediction of mortality and morbidity.
format Online
Article
Text
id pubmed-8115135
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81151352021-05-12 Random forest-based prediction of stroke outcome Fernandez-Lozano, Carlos Hervella, Pablo Mato-Abad, Virginia Rodríguez-Yáñez, Manuel Suárez-Garaboa, Sonia López-Dequidt, Iria Estany-Gestal, Ana Sobrino, Tomás Campos, Francisco Castillo, José Rodríguez-Yáñez, Santiago Iglesias-Rey, Ramón Sci Rep Article We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stroke (IS) and non-traumatic intracerebral hemorrhage (ICH) admitted to Stroke Unit of a European Tertiary Hospital prospectively registered. We identified the main variables for machine learning Random Forest (RF), generating a predictive model that can estimate patient mortality/morbidity according to the following groups: (1) IS + ICH, (2) IS, and (3) ICH. A total of 6022 patients were included: 4922 (mean age 71.9 ± 13.8 years) with IS and 1100 (mean age 73.3 ± 13.1 years) with ICH. NIHSS at 24, 48 h and axillary temperature at admission were the most important variables to consider for evolution of patients at 3-months. IS + ICH group was the most stable for mortality prediction [0.904 ± 0.025 of area under the receiver operating characteristics curve (AUC)]. IS group presented similar results, although variability between experiments was slightly higher (0.909 ± 0.032 of AUC). ICH group was the one in which RF had more problems to make adequate predictions (0.9837 vs. 0.7104 of AUC). There were no major differences between IS and IS + ICH groups according to morbidity prediction (0.738 and 0.755 of AUC) but, after checking normality with a Shapiro Wilk test with the null hypothesis that the data follow a normal distribution, it was rejected with W = 0.93546 (p-value < 2.2e−16). Conditions required for a parametric test do not hold, and we performed a paired Wilcoxon Test assuming the null hypothesis that all the groups have the same performance. The null hypothesis was rejected with a value < 2.2e−16, so there are statistical differences between IS and ICH groups. In conclusion, machine learning algorithms RF can be effectively used in stroke patients for long-term outcome prediction of mortality and morbidity. Nature Publishing Group UK 2021-05-12 /pmc/articles/PMC8115135/ /pubmed/33980906 http://dx.doi.org/10.1038/s41598-021-89434-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Fernandez-Lozano, Carlos
Hervella, Pablo
Mato-Abad, Virginia
Rodríguez-Yáñez, Manuel
Suárez-Garaboa, Sonia
López-Dequidt, Iria
Estany-Gestal, Ana
Sobrino, Tomás
Campos, Francisco
Castillo, José
Rodríguez-Yáñez, Santiago
Iglesias-Rey, Ramón
Random forest-based prediction of stroke outcome
title Random forest-based prediction of stroke outcome
title_full Random forest-based prediction of stroke outcome
title_fullStr Random forest-based prediction of stroke outcome
title_full_unstemmed Random forest-based prediction of stroke outcome
title_short Random forest-based prediction of stroke outcome
title_sort random forest-based prediction of stroke outcome
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8115135/
https://www.ncbi.nlm.nih.gov/pubmed/33980906
http://dx.doi.org/10.1038/s41598-021-89434-7
work_keys_str_mv AT fernandezlozanocarlos randomforestbasedpredictionofstrokeoutcome
AT hervellapablo randomforestbasedpredictionofstrokeoutcome
AT matoabadvirginia randomforestbasedpredictionofstrokeoutcome
AT rodriguezyanezmanuel randomforestbasedpredictionofstrokeoutcome
AT suarezgaraboasonia randomforestbasedpredictionofstrokeoutcome
AT lopezdequidtiria randomforestbasedpredictionofstrokeoutcome
AT estanygestalana randomforestbasedpredictionofstrokeoutcome
AT sobrinotomas randomforestbasedpredictionofstrokeoutcome
AT camposfrancisco randomforestbasedpredictionofstrokeoutcome
AT castillojose randomforestbasedpredictionofstrokeoutcome
AT rodriguezyanezsantiago randomforestbasedpredictionofstrokeoutcome
AT iglesiasreyramon randomforestbasedpredictionofstrokeoutcome