Cargando…

Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups

BACKGROUND: The promise of precision population health includes the ability to use robust patient data to tailor prevention and care to specific groups. Advanced analytics may allow for automated detection of clinically informative subgroups that account for clinical, genetic, and environmental vari...

Descripción completa

Detalles Bibliográficos
Autores principales: Flores, Alyssa M., Schuler, Alejandro, Eberhard, Anne Verena, Olin, Jeffrey W., Cooke, John P., Leeper, Nicholas J., Shah, Nigam H., Ross, Elsie G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9075403/
https://www.ncbi.nlm.nih.gov/pubmed/34845917
http://dx.doi.org/10.1161/JAHA.121.021976
_version_ 1784701676575260672
author Flores, Alyssa M.
Schuler, Alejandro
Eberhard, Anne Verena
Olin, Jeffrey W.
Cooke, John P.
Leeper, Nicholas J.
Shah, Nigam H.
Ross, Elsie G.
author_facet Flores, Alyssa M.
Schuler, Alejandro
Eberhard, Anne Verena
Olin, Jeffrey W.
Cooke, John P.
Leeper, Nicholas J.
Shah, Nigam H.
Ross, Elsie G.
author_sort Flores, Alyssa M.
collection PubMed
description BACKGROUND: The promise of precision population health includes the ability to use robust patient data to tailor prevention and care to specific groups. Advanced analytics may allow for automated detection of clinically informative subgroups that account for clinical, genetic, and environmental variability. This study sought to evaluate whether unsupervised machine learning approaches could interpret heterogeneous and missing clinical data to discover clinically important coronary artery disease subgroups. METHODS AND RESULTS: The Genetic Determinants of Peripheral Arterial Disease study is a prospective cohort that includes individuals with newly diagnosed and/or symptomatic coronary artery disease. We applied generalized low rank modeling and K‐means cluster analysis using 155 phenotypic and genetic variables from 1329 participants. Cox proportional hazard models were used to examine associations between clusters and major adverse cardiovascular and cerebrovascular events and all‐cause mortality. We then compared performance of risk stratification based on clusters and the American College of Cardiology/American Heart Association pooled cohort equations. Unsupervised analysis identified 4 phenotypically and prognostically distinct clusters. All‐cause mortality was highest in cluster 1 (oldest/most comorbid; 26%), whereas major adverse cardiovascular and cerebrovascular event rates were highest in cluster 2 (youngest/multiethnic; 41%). Cluster 4 (middle‐aged/healthiest behaviors) experienced more incident major adverse cardiovascular and cerebrovascular events (30%) than cluster 3 (middle‐aged/lowest medication adherence; 23%), despite apparently similar risk factor and lifestyle profiles. In comparison with the pooled cohort equations, cluster membership was more informative for risk assessment of myocardial infarction, stroke, and mortality. CONCLUSIONS: Unsupervised clustering identified 4 unique coronary artery disease subgroups with distinct clinical trajectories. Flexible unsupervised machine learning algorithms offer the ability to meaningfully process heterogeneous patient data and provide sharper insights into disease characterization and risk assessment. REGISTRATION: URL: https://www.clinicaltrials.gov; Unique identifier: NCT00380185.
format Online
Article
Text
id pubmed-9075403
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-90754032022-05-10 Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups Flores, Alyssa M. Schuler, Alejandro Eberhard, Anne Verena Olin, Jeffrey W. Cooke, John P. Leeper, Nicholas J. Shah, Nigam H. Ross, Elsie G. J Am Heart Assoc Original Research BACKGROUND: The promise of precision population health includes the ability to use robust patient data to tailor prevention and care to specific groups. Advanced analytics may allow for automated detection of clinically informative subgroups that account for clinical, genetic, and environmental variability. This study sought to evaluate whether unsupervised machine learning approaches could interpret heterogeneous and missing clinical data to discover clinically important coronary artery disease subgroups. METHODS AND RESULTS: The Genetic Determinants of Peripheral Arterial Disease study is a prospective cohort that includes individuals with newly diagnosed and/or symptomatic coronary artery disease. We applied generalized low rank modeling and K‐means cluster analysis using 155 phenotypic and genetic variables from 1329 participants. Cox proportional hazard models were used to examine associations between clusters and major adverse cardiovascular and cerebrovascular events and all‐cause mortality. We then compared performance of risk stratification based on clusters and the American College of Cardiology/American Heart Association pooled cohort equations. Unsupervised analysis identified 4 phenotypically and prognostically distinct clusters. All‐cause mortality was highest in cluster 1 (oldest/most comorbid; 26%), whereas major adverse cardiovascular and cerebrovascular event rates were highest in cluster 2 (youngest/multiethnic; 41%). Cluster 4 (middle‐aged/healthiest behaviors) experienced more incident major adverse cardiovascular and cerebrovascular events (30%) than cluster 3 (middle‐aged/lowest medication adherence; 23%), despite apparently similar risk factor and lifestyle profiles. In comparison with the pooled cohort equations, cluster membership was more informative for risk assessment of myocardial infarction, stroke, and mortality. CONCLUSIONS: Unsupervised clustering identified 4 unique coronary artery disease subgroups with distinct clinical trajectories. Flexible unsupervised machine learning algorithms offer the ability to meaningfully process heterogeneous patient data and provide sharper insights into disease characterization and risk assessment. REGISTRATION: URL: https://www.clinicaltrials.gov; Unique identifier: NCT00380185. John Wiley and Sons Inc. 2021-11-30 /pmc/articles/PMC9075403/ /pubmed/34845917 http://dx.doi.org/10.1161/JAHA.121.021976 Text en © 2021 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Original Research
Flores, Alyssa M.
Schuler, Alejandro
Eberhard, Anne Verena
Olin, Jeffrey W.
Cooke, John P.
Leeper, Nicholas J.
Shah, Nigam H.
Ross, Elsie G.
Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups
title Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups
title_full Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups
title_fullStr Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups
title_full_unstemmed Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups
title_short Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups
title_sort unsupervised learning for automated detection of coronary artery disease subgroups
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9075403/
https://www.ncbi.nlm.nih.gov/pubmed/34845917
http://dx.doi.org/10.1161/JAHA.121.021976
work_keys_str_mv AT floresalyssam unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups
AT schuleralejandro unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups
AT eberhardanneverena unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups
AT olinjeffreyw unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups
AT cookejohnp unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups
AT leepernicholasj unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups
AT shahnigamh unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups
AT rosselsieg unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups