Cargando…
Machine Learning Approach to Classify Cardiovascular Disease in Patients With Nonalcoholic Fatty Liver Disease in the UK Biobank Cohort
BACKGROUND: Nonalcoholic fatty liver disease (NAFLD) is the most prevalent liver disease worldwide. Cardiovascular disease (CVD) is the leading cause of mortality among patients with NAFLD. The aim of our study was to develop a machine learning algorithm integrating clinical, lifestyle, and genetic...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9075189/ https://www.ncbi.nlm.nih.gov/pubmed/34927450 http://dx.doi.org/10.1161/JAHA.121.022576 |
_version_ | 1784701625002098688 |
---|---|
author | Sharma, Divya Gotlieb, Neta Farkouh, Michael E. Patel, Keyur Xu, Wei Bhat, Mamatha |
author_facet | Sharma, Divya Gotlieb, Neta Farkouh, Michael E. Patel, Keyur Xu, Wei Bhat, Mamatha |
author_sort | Sharma, Divya |
collection | PubMed |
description | BACKGROUND: Nonalcoholic fatty liver disease (NAFLD) is the most prevalent liver disease worldwide. Cardiovascular disease (CVD) is the leading cause of mortality among patients with NAFLD. The aim of our study was to develop a machine learning algorithm integrating clinical, lifestyle, and genetic risk factors to identify CVD in patients with NAFLD. METHODS AND RESULTS: We created a cohort of patients with NAFLD from the UK Biobank, diagnosed according to proton density fat fraction from magnetic resonance imaging data sets. A total of 400 patients with NAFLD with subclinical atherosclerosis or clinical CVD, defined by disease codes, constituted cases and 446 NAFLD cases with no CVD constituted controls. We evaluated 7 different supervised machine learning approaches on clinical, lifestyle, and genetic variables for identifying CVD in patients with NAFLD. The most significant clinical and lifestyle variables observed by the predictive modeling were age (59 years [54.00–63.00 years]), hypertension (145 mm Hg [134.0–156.0 mm Hg] and 85 mm Hg [79.00–93.00 mm Hg]), waist circumference (98 cm [95.00–105.00 cm]), and sedentary lifestyle, defined as time spent watching TV >4 h/d. In the genetic data, single‐nucleotide polymorphisms in IL16 and ANKLE1 gene were most significant. Our proposed ensemble‐based integrative machine learning model achieved an area under the curve of 0.849 using the random forest modeling for CVD prediction. CONCLUSIONS: We propose a machine learning algorithm that identifies CVD in patients with NAFLD through integration of significant clinical, lifestyle, and genetic risk factors. These patients with NAFLD at higher risk of CVD should be flagged for screening and aggressive treatment of their cardiometabolic risk factors to prevent cardiovascular morbidity and mortality. |
format | Online Article Text |
id | pubmed-9075189 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90751892022-05-10 Machine Learning Approach to Classify Cardiovascular Disease in Patients With Nonalcoholic Fatty Liver Disease in the UK Biobank Cohort Sharma, Divya Gotlieb, Neta Farkouh, Michael E. Patel, Keyur Xu, Wei Bhat, Mamatha J Am Heart Assoc Original Research BACKGROUND: Nonalcoholic fatty liver disease (NAFLD) is the most prevalent liver disease worldwide. Cardiovascular disease (CVD) is the leading cause of mortality among patients with NAFLD. The aim of our study was to develop a machine learning algorithm integrating clinical, lifestyle, and genetic risk factors to identify CVD in patients with NAFLD. METHODS AND RESULTS: We created a cohort of patients with NAFLD from the UK Biobank, diagnosed according to proton density fat fraction from magnetic resonance imaging data sets. A total of 400 patients with NAFLD with subclinical atherosclerosis or clinical CVD, defined by disease codes, constituted cases and 446 NAFLD cases with no CVD constituted controls. We evaluated 7 different supervised machine learning approaches on clinical, lifestyle, and genetic variables for identifying CVD in patients with NAFLD. The most significant clinical and lifestyle variables observed by the predictive modeling were age (59 years [54.00–63.00 years]), hypertension (145 mm Hg [134.0–156.0 mm Hg] and 85 mm Hg [79.00–93.00 mm Hg]), waist circumference (98 cm [95.00–105.00 cm]), and sedentary lifestyle, defined as time spent watching TV >4 h/d. In the genetic data, single‐nucleotide polymorphisms in IL16 and ANKLE1 gene were most significant. Our proposed ensemble‐based integrative machine learning model achieved an area under the curve of 0.849 using the random forest modeling for CVD prediction. CONCLUSIONS: We propose a machine learning algorithm that identifies CVD in patients with NAFLD through integration of significant clinical, lifestyle, and genetic risk factors. These patients with NAFLD at higher risk of CVD should be flagged for screening and aggressive treatment of their cardiometabolic risk factors to prevent cardiovascular morbidity and mortality. John Wiley and Sons Inc. 2021-12-20 /pmc/articles/PMC9075189/ /pubmed/34927450 http://dx.doi.org/10.1161/JAHA.121.022576 Text en © 2021 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. |
spellingShingle | Original Research Sharma, Divya Gotlieb, Neta Farkouh, Michael E. Patel, Keyur Xu, Wei Bhat, Mamatha Machine Learning Approach to Classify Cardiovascular Disease in Patients With Nonalcoholic Fatty Liver Disease in the UK Biobank Cohort |
title | Machine Learning Approach to Classify Cardiovascular Disease in Patients With Nonalcoholic Fatty Liver Disease in the UK Biobank Cohort |
title_full | Machine Learning Approach to Classify Cardiovascular Disease in Patients With Nonalcoholic Fatty Liver Disease in the UK Biobank Cohort |
title_fullStr | Machine Learning Approach to Classify Cardiovascular Disease in Patients With Nonalcoholic Fatty Liver Disease in the UK Biobank Cohort |
title_full_unstemmed | Machine Learning Approach to Classify Cardiovascular Disease in Patients With Nonalcoholic Fatty Liver Disease in the UK Biobank Cohort |
title_short | Machine Learning Approach to Classify Cardiovascular Disease in Patients With Nonalcoholic Fatty Liver Disease in the UK Biobank Cohort |
title_sort | machine learning approach to classify cardiovascular disease in patients with nonalcoholic fatty liver disease in the uk biobank cohort |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9075189/ https://www.ncbi.nlm.nih.gov/pubmed/34927450 http://dx.doi.org/10.1161/JAHA.121.022576 |
work_keys_str_mv | AT sharmadivya machinelearningapproachtoclassifycardiovasculardiseaseinpatientswithnonalcoholicfattyliverdiseaseintheukbiobankcohort AT gotliebneta machinelearningapproachtoclassifycardiovasculardiseaseinpatientswithnonalcoholicfattyliverdiseaseintheukbiobankcohort AT farkouhmichaele machinelearningapproachtoclassifycardiovasculardiseaseinpatientswithnonalcoholicfattyliverdiseaseintheukbiobankcohort AT patelkeyur machinelearningapproachtoclassifycardiovasculardiseaseinpatientswithnonalcoholicfattyliverdiseaseintheukbiobankcohort AT xuwei machinelearningapproachtoclassifycardiovasculardiseaseinpatientswithnonalcoholicfattyliverdiseaseintheukbiobankcohort AT bhatmamatha machinelearningapproachtoclassifycardiovasculardiseaseinpatientswithnonalcoholicfattyliverdiseaseintheukbiobankcohort |