Cargando…

The First Predictor of Impending Stroke: An Internet Search-Based Algorithm Blind to Established Cardiometabolic Risk, Outperforms Classical Risk Factor-Based Calculators

Cerebrovascular disease is a leading cause of mortality and disability and an immense global burden. It is partly related to aging in a metabolic syndrome-promoting environment. Prevention strategies are insufficient: they rely on intermittent screening in predominantly high-risk individuals, while...

Descripción completa

Detalles Bibliográficos
Autores principales: Shaklai, Sigal, Gilad-Bachrach, Ran, Yom-Tov, Elad, Stern, Naftali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8090271/
http://dx.doi.org/10.1210/jendso/bvab048.610
_version_ 1783687242984718336
author Shaklai, Sigal
Gilad-Bachrach, Ran
Yom-Tov, Elad
Stern, Naftali
author_facet Shaklai, Sigal
Gilad-Bachrach, Ran
Yom-Tov, Elad
Stern, Naftali
author_sort Shaklai, Sigal
collection PubMed
description Cerebrovascular disease is a leading cause of mortality and disability and an immense global burden. It is partly related to aging in a metabolic syndrome-promoting environment. Prevention strategies are insufficient: they rely on intermittent screening in predominantly high-risk individuals, while most cases occur in the intermediate risk population. Screening algorithms like the Framingham Risk Score predict events in the next decade using traditional risk factors (age, diabetes, hypertension, obesity, hyperlipidemia, atrial fibrillation and smoking). No current tool can predict a near/impending stroke in the next few weeks/months. This could provide a time window for urgent preventive measures (anticoagulation, hypertension control or lipid lowering). Algorithms analyzing daily computer usage, examining motor, language and executive functions, correlate with standard cognitive tests. Covert cerebrovascular disease is linked to subtle cognitive and motor deficits and increased risk for stroke. We examined whether this could be harnessed to predict an imminent stroke. We developed an algorithm based on internet search queries to identify people at increased risk for a near stroke event. The algorithm was entirely blind to traditional cardiovascular risk factors or risks for cognitive decline. We analyzed queries submitted to the Bing search engine by 285 people for which a stroke event was identified and 1195 controls, with regards to attributes previously shown to reflect cognitive function. Controls included random people 60 years and above or similar aged individuals who queried for one of eight control conditions: myocardial infarction, atrial fibrillation, hypertension, migraine, B12 deficiency, depression, hypothyroidism and surgery. We used a random forest model with 1000 trees to distinguish the patient cohort from each of the control cohorts. Ten-fold cross-validation at the user level was used to reduce the likelihood of overfitting. All processing was conducted using Matlab 2019. Results: Our model performed well against all controls with area under the curve of the receiver operating curve (AUC) of 0.985 or higher and a true positive rate (at 1% false positive rate) above 80% for separating patients from each of the controls. The predictive power rose as the stroke date approached and if data was acquired beginning 120 days prior to the event. Good prediction accuracy was also obtained for a prospective cohort of users collected one year later. We propose that impending stroke can be identified through alterations in internet usage patterns that reflect cognitive function, with high predictive power (AUC exceeding 0.985 for a near event in comparison to an AUC of ~0.71 in the Framingham Stroke Risk Score for a 10 year event). This could potentially be applied inexpensively, continuously and on a large scale to willing individuals with the aim of reducing stroke events.
format Online
Article
Text
id pubmed-8090271
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80902712021-05-06 The First Predictor of Impending Stroke: An Internet Search-Based Algorithm Blind to Established Cardiometabolic Risk, Outperforms Classical Risk Factor-Based Calculators Shaklai, Sigal Gilad-Bachrach, Ran Yom-Tov, Elad Stern, Naftali J Endocr Soc Cardiovascular Endocrinology Cerebrovascular disease is a leading cause of mortality and disability and an immense global burden. It is partly related to aging in a metabolic syndrome-promoting environment. Prevention strategies are insufficient: they rely on intermittent screening in predominantly high-risk individuals, while most cases occur in the intermediate risk population. Screening algorithms like the Framingham Risk Score predict events in the next decade using traditional risk factors (age, diabetes, hypertension, obesity, hyperlipidemia, atrial fibrillation and smoking). No current tool can predict a near/impending stroke in the next few weeks/months. This could provide a time window for urgent preventive measures (anticoagulation, hypertension control or lipid lowering). Algorithms analyzing daily computer usage, examining motor, language and executive functions, correlate with standard cognitive tests. Covert cerebrovascular disease is linked to subtle cognitive and motor deficits and increased risk for stroke. We examined whether this could be harnessed to predict an imminent stroke. We developed an algorithm based on internet search queries to identify people at increased risk for a near stroke event. The algorithm was entirely blind to traditional cardiovascular risk factors or risks for cognitive decline. We analyzed queries submitted to the Bing search engine by 285 people for which a stroke event was identified and 1195 controls, with regards to attributes previously shown to reflect cognitive function. Controls included random people 60 years and above or similar aged individuals who queried for one of eight control conditions: myocardial infarction, atrial fibrillation, hypertension, migraine, B12 deficiency, depression, hypothyroidism and surgery. We used a random forest model with 1000 trees to distinguish the patient cohort from each of the control cohorts. Ten-fold cross-validation at the user level was used to reduce the likelihood of overfitting. All processing was conducted using Matlab 2019. Results: Our model performed well against all controls with area under the curve of the receiver operating curve (AUC) of 0.985 or higher and a true positive rate (at 1% false positive rate) above 80% for separating patients from each of the controls. The predictive power rose as the stroke date approached and if data was acquired beginning 120 days prior to the event. Good prediction accuracy was also obtained for a prospective cohort of users collected one year later. We propose that impending stroke can be identified through alterations in internet usage patterns that reflect cognitive function, with high predictive power (AUC exceeding 0.985 for a near event in comparison to an AUC of ~0.71 in the Framingham Stroke Risk Score for a 10 year event). This could potentially be applied inexpensively, continuously and on a large scale to willing individuals with the aim of reducing stroke events. Oxford University Press 2021-05-03 /pmc/articles/PMC8090271/ http://dx.doi.org/10.1210/jendso/bvab048.610 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Endocrine Society. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Cardiovascular Endocrinology
Shaklai, Sigal
Gilad-Bachrach, Ran
Yom-Tov, Elad
Stern, Naftali
The First Predictor of Impending Stroke: An Internet Search-Based Algorithm Blind to Established Cardiometabolic Risk, Outperforms Classical Risk Factor-Based Calculators
title The First Predictor of Impending Stroke: An Internet Search-Based Algorithm Blind to Established Cardiometabolic Risk, Outperforms Classical Risk Factor-Based Calculators
title_full The First Predictor of Impending Stroke: An Internet Search-Based Algorithm Blind to Established Cardiometabolic Risk, Outperforms Classical Risk Factor-Based Calculators
title_fullStr The First Predictor of Impending Stroke: An Internet Search-Based Algorithm Blind to Established Cardiometabolic Risk, Outperforms Classical Risk Factor-Based Calculators
title_full_unstemmed The First Predictor of Impending Stroke: An Internet Search-Based Algorithm Blind to Established Cardiometabolic Risk, Outperforms Classical Risk Factor-Based Calculators
title_short The First Predictor of Impending Stroke: An Internet Search-Based Algorithm Blind to Established Cardiometabolic Risk, Outperforms Classical Risk Factor-Based Calculators
title_sort first predictor of impending stroke: an internet search-based algorithm blind to established cardiometabolic risk, outperforms classical risk factor-based calculators
topic Cardiovascular Endocrinology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8090271/
http://dx.doi.org/10.1210/jendso/bvab048.610
work_keys_str_mv AT shaklaisigal thefirstpredictorofimpendingstrokeaninternetsearchbasedalgorithmblindtoestablishedcardiometabolicriskoutperformsclassicalriskfactorbasedcalculators
AT giladbachrachran thefirstpredictorofimpendingstrokeaninternetsearchbasedalgorithmblindtoestablishedcardiometabolicriskoutperformsclassicalriskfactorbasedcalculators
AT yomtovelad thefirstpredictorofimpendingstrokeaninternetsearchbasedalgorithmblindtoestablishedcardiometabolicriskoutperformsclassicalriskfactorbasedcalculators
AT sternnaftali thefirstpredictorofimpendingstrokeaninternetsearchbasedalgorithmblindtoestablishedcardiometabolicriskoutperformsclassicalriskfactorbasedcalculators
AT shaklaisigal firstpredictorofimpendingstrokeaninternetsearchbasedalgorithmblindtoestablishedcardiometabolicriskoutperformsclassicalriskfactorbasedcalculators
AT giladbachrachran firstpredictorofimpendingstrokeaninternetsearchbasedalgorithmblindtoestablishedcardiometabolicriskoutperformsclassicalriskfactorbasedcalculators
AT yomtovelad firstpredictorofimpendingstrokeaninternetsearchbasedalgorithmblindtoestablishedcardiometabolicriskoutperformsclassicalriskfactorbasedcalculators
AT sternnaftali firstpredictorofimpendingstrokeaninternetsearchbasedalgorithmblindtoestablishedcardiometabolicriskoutperformsclassicalriskfactorbasedcalculators