Cargando…
Random forest-based prediction of stroke outcome
We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stro...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8115135/ https://www.ncbi.nlm.nih.gov/pubmed/33980906 http://dx.doi.org/10.1038/s41598-021-89434-7 |
_version_ | 1783691180089802752 |
---|---|
author | Fernandez-Lozano, Carlos Hervella, Pablo Mato-Abad, Virginia Rodríguez-Yáñez, Manuel Suárez-Garaboa, Sonia López-Dequidt, Iria Estany-Gestal, Ana Sobrino, Tomás Campos, Francisco Castillo, José Rodríguez-Yáñez, Santiago Iglesias-Rey, Ramón |
author_facet | Fernandez-Lozano, Carlos Hervella, Pablo Mato-Abad, Virginia Rodríguez-Yáñez, Manuel Suárez-Garaboa, Sonia López-Dequidt, Iria Estany-Gestal, Ana Sobrino, Tomás Campos, Francisco Castillo, José Rodríguez-Yáñez, Santiago Iglesias-Rey, Ramón |
author_sort | Fernandez-Lozano, Carlos |
collection | PubMed |
description | We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stroke (IS) and non-traumatic intracerebral hemorrhage (ICH) admitted to Stroke Unit of a European Tertiary Hospital prospectively registered. We identified the main variables for machine learning Random Forest (RF), generating a predictive model that can estimate patient mortality/morbidity according to the following groups: (1) IS + ICH, (2) IS, and (3) ICH. A total of 6022 patients were included: 4922 (mean age 71.9 ± 13.8 years) with IS and 1100 (mean age 73.3 ± 13.1 years) with ICH. NIHSS at 24, 48 h and axillary temperature at admission were the most important variables to consider for evolution of patients at 3-months. IS + ICH group was the most stable for mortality prediction [0.904 ± 0.025 of area under the receiver operating characteristics curve (AUC)]. IS group presented similar results, although variability between experiments was slightly higher (0.909 ± 0.032 of AUC). ICH group was the one in which RF had more problems to make adequate predictions (0.9837 vs. 0.7104 of AUC). There were no major differences between IS and IS + ICH groups according to morbidity prediction (0.738 and 0.755 of AUC) but, after checking normality with a Shapiro Wilk test with the null hypothesis that the data follow a normal distribution, it was rejected with W = 0.93546 (p-value < 2.2e−16). Conditions required for a parametric test do not hold, and we performed a paired Wilcoxon Test assuming the null hypothesis that all the groups have the same performance. The null hypothesis was rejected with a value < 2.2e−16, so there are statistical differences between IS and ICH groups. In conclusion, machine learning algorithms RF can be effectively used in stroke patients for long-term outcome prediction of mortality and morbidity. |
format | Online Article Text |
id | pubmed-8115135 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-81151352021-05-12 Random forest-based prediction of stroke outcome Fernandez-Lozano, Carlos Hervella, Pablo Mato-Abad, Virginia Rodríguez-Yáñez, Manuel Suárez-Garaboa, Sonia López-Dequidt, Iria Estany-Gestal, Ana Sobrino, Tomás Campos, Francisco Castillo, José Rodríguez-Yáñez, Santiago Iglesias-Rey, Ramón Sci Rep Article We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stroke (IS) and non-traumatic intracerebral hemorrhage (ICH) admitted to Stroke Unit of a European Tertiary Hospital prospectively registered. We identified the main variables for machine learning Random Forest (RF), generating a predictive model that can estimate patient mortality/morbidity according to the following groups: (1) IS + ICH, (2) IS, and (3) ICH. A total of 6022 patients were included: 4922 (mean age 71.9 ± 13.8 years) with IS and 1100 (mean age 73.3 ± 13.1 years) with ICH. NIHSS at 24, 48 h and axillary temperature at admission were the most important variables to consider for evolution of patients at 3-months. IS + ICH group was the most stable for mortality prediction [0.904 ± 0.025 of area under the receiver operating characteristics curve (AUC)]. IS group presented similar results, although variability between experiments was slightly higher (0.909 ± 0.032 of AUC). ICH group was the one in which RF had more problems to make adequate predictions (0.9837 vs. 0.7104 of AUC). There were no major differences between IS and IS + ICH groups according to morbidity prediction (0.738 and 0.755 of AUC) but, after checking normality with a Shapiro Wilk test with the null hypothesis that the data follow a normal distribution, it was rejected with W = 0.93546 (p-value < 2.2e−16). Conditions required for a parametric test do not hold, and we performed a paired Wilcoxon Test assuming the null hypothesis that all the groups have the same performance. The null hypothesis was rejected with a value < 2.2e−16, so there are statistical differences between IS and ICH groups. In conclusion, machine learning algorithms RF can be effectively used in stroke patients for long-term outcome prediction of mortality and morbidity. Nature Publishing Group UK 2021-05-12 /pmc/articles/PMC8115135/ /pubmed/33980906 http://dx.doi.org/10.1038/s41598-021-89434-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Fernandez-Lozano, Carlos Hervella, Pablo Mato-Abad, Virginia Rodríguez-Yáñez, Manuel Suárez-Garaboa, Sonia López-Dequidt, Iria Estany-Gestal, Ana Sobrino, Tomás Campos, Francisco Castillo, José Rodríguez-Yáñez, Santiago Iglesias-Rey, Ramón Random forest-based prediction of stroke outcome |
title | Random forest-based prediction of stroke outcome |
title_full | Random forest-based prediction of stroke outcome |
title_fullStr | Random forest-based prediction of stroke outcome |
title_full_unstemmed | Random forest-based prediction of stroke outcome |
title_short | Random forest-based prediction of stroke outcome |
title_sort | random forest-based prediction of stroke outcome |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8115135/ https://www.ncbi.nlm.nih.gov/pubmed/33980906 http://dx.doi.org/10.1038/s41598-021-89434-7 |
work_keys_str_mv | AT fernandezlozanocarlos randomforestbasedpredictionofstrokeoutcome AT hervellapablo randomforestbasedpredictionofstrokeoutcome AT matoabadvirginia randomforestbasedpredictionofstrokeoutcome AT rodriguezyanezmanuel randomforestbasedpredictionofstrokeoutcome AT suarezgaraboasonia randomforestbasedpredictionofstrokeoutcome AT lopezdequidtiria randomforestbasedpredictionofstrokeoutcome AT estanygestalana randomforestbasedpredictionofstrokeoutcome AT sobrinotomas randomforestbasedpredictionofstrokeoutcome AT camposfrancisco randomforestbasedpredictionofstrokeoutcome AT castillojose randomforestbasedpredictionofstrokeoutcome AT rodriguezyanezsantiago randomforestbasedpredictionofstrokeoutcome AT iglesiasreyramon randomforestbasedpredictionofstrokeoutcome |