Cargando…

Predicting atrial fibrillation in primary care using machine learning

BACKGROUND: Atrial fibrillation (AF) is the most common sustained heart arrhythmia. However, as many cases are asymptomatic, a large proportion of patients remain undiagnosed until serious complications arise. Efficient, cost-effective detection of the undiagnosed may be supported by risk-prediction...

Descripción completa

Detalles Bibliográficos
Autores principales: Hill, Nathan R., Ayoubkhani, Daniel, McEwan, Phil, Sugrue, Daniel M., Farooqui, Usman, Lister, Steven, Lumley, Matthew, Bakhai, Ameet, Cohen, Alexander T., O’Neill, Mark, Clifton, David, Gordon, Jason
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6824570/
https://www.ncbi.nlm.nih.gov/pubmed/31675367
http://dx.doi.org/10.1371/journal.pone.0224582
_version_ 1783464754363236352
author Hill, Nathan R.
Ayoubkhani, Daniel
McEwan, Phil
Sugrue, Daniel M.
Farooqui, Usman
Lister, Steven
Lumley, Matthew
Bakhai, Ameet
Cohen, Alexander T.
O’Neill, Mark
Clifton, David
Gordon, Jason
author_facet Hill, Nathan R.
Ayoubkhani, Daniel
McEwan, Phil
Sugrue, Daniel M.
Farooqui, Usman
Lister, Steven
Lumley, Matthew
Bakhai, Ameet
Cohen, Alexander T.
O’Neill, Mark
Clifton, David
Gordon, Jason
author_sort Hill, Nathan R.
collection PubMed
description BACKGROUND: Atrial fibrillation (AF) is the most common sustained heart arrhythmia. However, as many cases are asymptomatic, a large proportion of patients remain undiagnosed until serious complications arise. Efficient, cost-effective detection of the undiagnosed may be supported by risk-prediction models relating patient factors to AF risk. However, there exists a need for an implementable risk model that is contemporaneous and informed by routinely collected patient data, reflecting the real-world pathology of AF. METHODS: This study sought to develop and evaluate novel and conventional statistical and machine learning models for risk-predication of AF. This was a retrospective, cohort study of adults (aged ≥30 years) without a history of AF, listed on the Clinical Practice Research Datalink, from January 2006 to December 2016. Models evaluated included published risk models (Framingham, ARIC, CHARGE-AF), machine learning models, which evaluated baseline and time-updated information (neural network, LASSO, random forests, support vector machines), and Cox regression. RESULTS: Analysis of 2,994,837 individuals (3.2% AF) identified time-varying neural networks as the optimal model achieving an AUROC of 0.827 vs. 0.725, with number needed to screen of 9 vs. 13 patients at 75% sensitivity, when compared with the best existing model CHARGE-AF. The optimal model confirmed known baseline risk factors (age, previous cardiovascular disease, antihypertensive medication usage) and identified additional time-varying predictors (proximity of cardiovascular events, body mass index (both levels and changes), pulse pressure, and the frequency of blood pressure measurements). CONCLUSION: The optimal time-varying machine learning model exhibited greater predictive performance than existing AF risk models and reflected known and new patient risk factors for AF.
format Online
Article
Text
id pubmed-6824570
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-68245702019-11-12 Predicting atrial fibrillation in primary care using machine learning Hill, Nathan R. Ayoubkhani, Daniel McEwan, Phil Sugrue, Daniel M. Farooqui, Usman Lister, Steven Lumley, Matthew Bakhai, Ameet Cohen, Alexander T. O’Neill, Mark Clifton, David Gordon, Jason PLoS One Research Article BACKGROUND: Atrial fibrillation (AF) is the most common sustained heart arrhythmia. However, as many cases are asymptomatic, a large proportion of patients remain undiagnosed until serious complications arise. Efficient, cost-effective detection of the undiagnosed may be supported by risk-prediction models relating patient factors to AF risk. However, there exists a need for an implementable risk model that is contemporaneous and informed by routinely collected patient data, reflecting the real-world pathology of AF. METHODS: This study sought to develop and evaluate novel and conventional statistical and machine learning models for risk-predication of AF. This was a retrospective, cohort study of adults (aged ≥30 years) without a history of AF, listed on the Clinical Practice Research Datalink, from January 2006 to December 2016. Models evaluated included published risk models (Framingham, ARIC, CHARGE-AF), machine learning models, which evaluated baseline and time-updated information (neural network, LASSO, random forests, support vector machines), and Cox regression. RESULTS: Analysis of 2,994,837 individuals (3.2% AF) identified time-varying neural networks as the optimal model achieving an AUROC of 0.827 vs. 0.725, with number needed to screen of 9 vs. 13 patients at 75% sensitivity, when compared with the best existing model CHARGE-AF. The optimal model confirmed known baseline risk factors (age, previous cardiovascular disease, antihypertensive medication usage) and identified additional time-varying predictors (proximity of cardiovascular events, body mass index (both levels and changes), pulse pressure, and the frequency of blood pressure measurements). CONCLUSION: The optimal time-varying machine learning model exhibited greater predictive performance than existing AF risk models and reflected known and new patient risk factors for AF. Public Library of Science 2019-11-01 /pmc/articles/PMC6824570/ /pubmed/31675367 http://dx.doi.org/10.1371/journal.pone.0224582 Text en © 2019 Hill et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hill, Nathan R.
Ayoubkhani, Daniel
McEwan, Phil
Sugrue, Daniel M.
Farooqui, Usman
Lister, Steven
Lumley, Matthew
Bakhai, Ameet
Cohen, Alexander T.
O’Neill, Mark
Clifton, David
Gordon, Jason
Predicting atrial fibrillation in primary care using machine learning
title Predicting atrial fibrillation in primary care using machine learning
title_full Predicting atrial fibrillation in primary care using machine learning
title_fullStr Predicting atrial fibrillation in primary care using machine learning
title_full_unstemmed Predicting atrial fibrillation in primary care using machine learning
title_short Predicting atrial fibrillation in primary care using machine learning
title_sort predicting atrial fibrillation in primary care using machine learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6824570/
https://www.ncbi.nlm.nih.gov/pubmed/31675367
http://dx.doi.org/10.1371/journal.pone.0224582
work_keys_str_mv AT hillnathanr predictingatrialfibrillationinprimarycareusingmachinelearning
AT ayoubkhanidaniel predictingatrialfibrillationinprimarycareusingmachinelearning
AT mcewanphil predictingatrialfibrillationinprimarycareusingmachinelearning
AT sugruedanielm predictingatrialfibrillationinprimarycareusingmachinelearning
AT farooquiusman predictingatrialfibrillationinprimarycareusingmachinelearning
AT listersteven predictingatrialfibrillationinprimarycareusingmachinelearning
AT lumleymatthew predictingatrialfibrillationinprimarycareusingmachinelearning
AT bakhaiameet predictingatrialfibrillationinprimarycareusingmachinelearning
AT cohenalexandert predictingatrialfibrillationinprimarycareusingmachinelearning
AT oneillmark predictingatrialfibrillationinprimarycareusingmachinelearning
AT cliftondavid predictingatrialfibrillationinprimarycareusingmachinelearning
AT gordonjason predictingatrialfibrillationinprimarycareusingmachinelearning