Cargando…

Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score

Current stroke risk assessment tools presume the impact of risk factors is linear and cumulative. However, both novel risk factors and their interplay influencing stroke incidence are difficult to reveal using traditional additive models. The goal of this study was to improve upon the established Re...

Descripción completa

Detalles Bibliográficos
Autores principales: Orfanoudaki, Agni, Chesley, Emma, Cadisch, Christian, Stein, Barry, Nouh, Amre, Alberts, Mark J., Bertsimas, Dimitris
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7241753/
https://www.ncbi.nlm.nih.gov/pubmed/32437368
http://dx.doi.org/10.1371/journal.pone.0232414
_version_ 1783537121511866368
author Orfanoudaki, Agni
Chesley, Emma
Cadisch, Christian
Stein, Barry
Nouh, Amre
Alberts, Mark J.
Bertsimas, Dimitris
author_facet Orfanoudaki, Agni
Chesley, Emma
Cadisch, Christian
Stein, Barry
Nouh, Amre
Alberts, Mark J.
Bertsimas, Dimitris
author_sort Orfanoudaki, Agni
collection PubMed
description Current stroke risk assessment tools presume the impact of risk factors is linear and cumulative. However, both novel risk factors and their interplay influencing stroke incidence are difficult to reveal using traditional additive models. The goal of this study was to improve upon the established Revised Framingham Stroke Risk Score and design an interactive Non-Linear Stroke Risk Score. Leveraging machine learning algorithms, our work aimed at increasing the accuracy of event prediction and uncovering new relationships in an interpretable fashion. A two-phase approach was used to create our stroke risk prediction score. First, clinical examinations of the Framingham offspring cohort were utilized as the training dataset for the predictive model. Optimal Classification Trees were used to develop a tree-based model to predict 10-year risk of stroke. Unlike classical methods, this algorithm adaptively changes the splits on the independent variables, introducing non-linear interactions among them. Second, the model was validated with a multi-ethnicity cohort from the Boston Medical Center. Our stroke risk score suggests a key dichotomy between patients with history of cardiovascular disease and the rest of the population. While it agrees with known findings, it also identified 23 unique stroke risk profiles and highlighted new non-linear relationships; such as the role of T-wave abnormality on electrocardiography and hematocrit levels in a patient’s risk profile. Our results suggested that the non-linear approach significantly improves upon the baseline in the c-statistic (training 87.43% (CI 0.85–0.90) vs. 73.74% (CI 0.70–0.76); validation 75.29% (CI 0.74–0.76) vs 65.93% (CI 0.64–0.67), even in multi-ethnicity populations. The clinical implications of the new risk score include prioritization of risk factor modification and personalized care at the patient level with improved targeting of interventions for stroke prevention.
format Online
Article
Text
id pubmed-7241753
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-72417532020-06-03 Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score Orfanoudaki, Agni Chesley, Emma Cadisch, Christian Stein, Barry Nouh, Amre Alberts, Mark J. Bertsimas, Dimitris PLoS One Research Article Current stroke risk assessment tools presume the impact of risk factors is linear and cumulative. However, both novel risk factors and their interplay influencing stroke incidence are difficult to reveal using traditional additive models. The goal of this study was to improve upon the established Revised Framingham Stroke Risk Score and design an interactive Non-Linear Stroke Risk Score. Leveraging machine learning algorithms, our work aimed at increasing the accuracy of event prediction and uncovering new relationships in an interpretable fashion. A two-phase approach was used to create our stroke risk prediction score. First, clinical examinations of the Framingham offspring cohort were utilized as the training dataset for the predictive model. Optimal Classification Trees were used to develop a tree-based model to predict 10-year risk of stroke. Unlike classical methods, this algorithm adaptively changes the splits on the independent variables, introducing non-linear interactions among them. Second, the model was validated with a multi-ethnicity cohort from the Boston Medical Center. Our stroke risk score suggests a key dichotomy between patients with history of cardiovascular disease and the rest of the population. While it agrees with known findings, it also identified 23 unique stroke risk profiles and highlighted new non-linear relationships; such as the role of T-wave abnormality on electrocardiography and hematocrit levels in a patient’s risk profile. Our results suggested that the non-linear approach significantly improves upon the baseline in the c-statistic (training 87.43% (CI 0.85–0.90) vs. 73.74% (CI 0.70–0.76); validation 75.29% (CI 0.74–0.76) vs 65.93% (CI 0.64–0.67), even in multi-ethnicity populations. The clinical implications of the new risk score include prioritization of risk factor modification and personalized care at the patient level with improved targeting of interventions for stroke prevention. Public Library of Science 2020-05-21 /pmc/articles/PMC7241753/ /pubmed/32437368 http://dx.doi.org/10.1371/journal.pone.0232414 Text en © 2020 Orfanoudaki et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Orfanoudaki, Agni
Chesley, Emma
Cadisch, Christian
Stein, Barry
Nouh, Amre
Alberts, Mark J.
Bertsimas, Dimitris
Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score
title Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score
title_full Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score
title_fullStr Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score
title_full_unstemmed Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score
title_short Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score
title_sort machine learning provides evidence that stroke risk is not linear: the non-linear framingham stroke risk score
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7241753/
https://www.ncbi.nlm.nih.gov/pubmed/32437368
http://dx.doi.org/10.1371/journal.pone.0232414
work_keys_str_mv AT orfanoudakiagni machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT chesleyemma machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT cadischchristian machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT steinbarry machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT nouhamre machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT albertsmarkj machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore
AT bertsimasdimitris machinelearningprovidesevidencethatstrokeriskisnotlinearthenonlinearframinghamstrokeriskscore