Cargando…
Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups
BACKGROUND: The promise of precision population health includes the ability to use robust patient data to tailor prevention and care to specific groups. Advanced analytics may allow for automated detection of clinically informative subgroups that account for clinical, genetic, and environmental vari...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9075403/ https://www.ncbi.nlm.nih.gov/pubmed/34845917 http://dx.doi.org/10.1161/JAHA.121.021976 |
_version_ | 1784701676575260672 |
---|---|
author | Flores, Alyssa M. Schuler, Alejandro Eberhard, Anne Verena Olin, Jeffrey W. Cooke, John P. Leeper, Nicholas J. Shah, Nigam H. Ross, Elsie G. |
author_facet | Flores, Alyssa M. Schuler, Alejandro Eberhard, Anne Verena Olin, Jeffrey W. Cooke, John P. Leeper, Nicholas J. Shah, Nigam H. Ross, Elsie G. |
author_sort | Flores, Alyssa M. |
collection | PubMed |
description | BACKGROUND: The promise of precision population health includes the ability to use robust patient data to tailor prevention and care to specific groups. Advanced analytics may allow for automated detection of clinically informative subgroups that account for clinical, genetic, and environmental variability. This study sought to evaluate whether unsupervised machine learning approaches could interpret heterogeneous and missing clinical data to discover clinically important coronary artery disease subgroups. METHODS AND RESULTS: The Genetic Determinants of Peripheral Arterial Disease study is a prospective cohort that includes individuals with newly diagnosed and/or symptomatic coronary artery disease. We applied generalized low rank modeling and K‐means cluster analysis using 155 phenotypic and genetic variables from 1329 participants. Cox proportional hazard models were used to examine associations between clusters and major adverse cardiovascular and cerebrovascular events and all‐cause mortality. We then compared performance of risk stratification based on clusters and the American College of Cardiology/American Heart Association pooled cohort equations. Unsupervised analysis identified 4 phenotypically and prognostically distinct clusters. All‐cause mortality was highest in cluster 1 (oldest/most comorbid; 26%), whereas major adverse cardiovascular and cerebrovascular event rates were highest in cluster 2 (youngest/multiethnic; 41%). Cluster 4 (middle‐aged/healthiest behaviors) experienced more incident major adverse cardiovascular and cerebrovascular events (30%) than cluster 3 (middle‐aged/lowest medication adherence; 23%), despite apparently similar risk factor and lifestyle profiles. In comparison with the pooled cohort equations, cluster membership was more informative for risk assessment of myocardial infarction, stroke, and mortality. CONCLUSIONS: Unsupervised clustering identified 4 unique coronary artery disease subgroups with distinct clinical trajectories. Flexible unsupervised machine learning algorithms offer the ability to meaningfully process heterogeneous patient data and provide sharper insights into disease characterization and risk assessment. REGISTRATION: URL: https://www.clinicaltrials.gov; Unique identifier: NCT00380185. |
format | Online Article Text |
id | pubmed-9075403 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90754032022-05-10 Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups Flores, Alyssa M. Schuler, Alejandro Eberhard, Anne Verena Olin, Jeffrey W. Cooke, John P. Leeper, Nicholas J. Shah, Nigam H. Ross, Elsie G. J Am Heart Assoc Original Research BACKGROUND: The promise of precision population health includes the ability to use robust patient data to tailor prevention and care to specific groups. Advanced analytics may allow for automated detection of clinically informative subgroups that account for clinical, genetic, and environmental variability. This study sought to evaluate whether unsupervised machine learning approaches could interpret heterogeneous and missing clinical data to discover clinically important coronary artery disease subgroups. METHODS AND RESULTS: The Genetic Determinants of Peripheral Arterial Disease study is a prospective cohort that includes individuals with newly diagnosed and/or symptomatic coronary artery disease. We applied generalized low rank modeling and K‐means cluster analysis using 155 phenotypic and genetic variables from 1329 participants. Cox proportional hazard models were used to examine associations between clusters and major adverse cardiovascular and cerebrovascular events and all‐cause mortality. We then compared performance of risk stratification based on clusters and the American College of Cardiology/American Heart Association pooled cohort equations. Unsupervised analysis identified 4 phenotypically and prognostically distinct clusters. All‐cause mortality was highest in cluster 1 (oldest/most comorbid; 26%), whereas major adverse cardiovascular and cerebrovascular event rates were highest in cluster 2 (youngest/multiethnic; 41%). Cluster 4 (middle‐aged/healthiest behaviors) experienced more incident major adverse cardiovascular and cerebrovascular events (30%) than cluster 3 (middle‐aged/lowest medication adherence; 23%), despite apparently similar risk factor and lifestyle profiles. In comparison with the pooled cohort equations, cluster membership was more informative for risk assessment of myocardial infarction, stroke, and mortality. CONCLUSIONS: Unsupervised clustering identified 4 unique coronary artery disease subgroups with distinct clinical trajectories. Flexible unsupervised machine learning algorithms offer the ability to meaningfully process heterogeneous patient data and provide sharper insights into disease characterization and risk assessment. REGISTRATION: URL: https://www.clinicaltrials.gov; Unique identifier: NCT00380185. John Wiley and Sons Inc. 2021-11-30 /pmc/articles/PMC9075403/ /pubmed/34845917 http://dx.doi.org/10.1161/JAHA.121.021976 Text en © 2021 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Original Research Flores, Alyssa M. Schuler, Alejandro Eberhard, Anne Verena Olin, Jeffrey W. Cooke, John P. Leeper, Nicholas J. Shah, Nigam H. Ross, Elsie G. Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups |
title | Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups |
title_full | Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups |
title_fullStr | Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups |
title_full_unstemmed | Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups |
title_short | Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups |
title_sort | unsupervised learning for automated detection of coronary artery disease subgroups |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9075403/ https://www.ncbi.nlm.nih.gov/pubmed/34845917 http://dx.doi.org/10.1161/JAHA.121.021976 |
work_keys_str_mv | AT floresalyssam unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups AT schuleralejandro unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups AT eberhardanneverena unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups AT olinjeffreyw unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups AT cookejohnp unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups AT leepernicholasj unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups AT shahnigamh unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups AT rosselsieg unsupervisedlearningforautomateddetectionofcoronaryarterydiseasesubgroups |