Cargando…
Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population
BACKGROUND: Accurately predicting the risk of atherosclerotic cardiovascular disease (ASCVD) is crucial for implementing individualized prevention strategies and improving patient outcomes. Our objective is to develop machine learning (ML)-based models for predicting ASCVD risk in a prospective Chin...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10367272/ https://www.ncbi.nlm.nih.gov/pubmed/37488520 http://dx.doi.org/10.1186/s12911-023-02242-z |
_version_ | 1785077352672264192 |
---|---|
author | Fan, Zihao Du, Zhi Fu, Jinrong Zhou, Ying Zhang, Pengyu Shi, Chuning Sun, Yingxian |
author_facet | Fan, Zihao Du, Zhi Fu, Jinrong Zhou, Ying Zhang, Pengyu Shi, Chuning Sun, Yingxian |
author_sort | Fan, Zihao |
collection | PubMed |
description | BACKGROUND: Accurately predicting the risk of atherosclerotic cardiovascular disease (ASCVD) is crucial for implementing individualized prevention strategies and improving patient outcomes. Our objective is to develop machine learning (ML)-based models for predicting ASCVD risk in a prospective Chinese population and compare their performance with conventional regression models. METHODS: A hybrid dataset consisting of 551 features was used, including 98 demographic, behavioral, and psychological features, 444 Electrocardiograph (ECG) features, and 9 Echocardiography (Echo) features. Seven machine learning (ML)-based models were trained, validated, and tested after selecting the 30 most informative features. We compared the discrimination, calibration, net benefit, and net reclassification improvement (NRI) of the ML models with those of conventional ASCVD risk calculators, such as the Pooled Cohort Equations (PCE) and Prediction for ASCVD Risk in China (China-PAR). RESULTS: The study included 9,609 participants (mean age 53.4 ± 10.4 years, 53.7% female), and during a median follow-up of 4.7 years, 431 (4.5%) participants developed ASCVD. In the testing set, the final ML-based ANN model outperformed PCE, China-PAR, recalibrated PCE, and recalibrated China-PAR in predicting ASCVD. This was demonstrated by the model's higher area under the curve (AUC) of 0.800, compared to 0.777, 0.780, 0.779, and 0.779 for the other models, respectively. Additionally, the model had a lower Hosmer–Lemeshow χ2 of 9.1, compared to 37.3, 67.6, 126.6, and 18.6 for the other models. The net benefit at a threshold of 5% was also higher for the ML-based ANN model at 0.017, compared to 0.016, 0.013, 0.017, and 0.016 for the other models, respectively. Furthermore, the NRI was 0.089 for the ML-based ANN model, while it was 0.355, 0.098, and 0.088 for PCE, China-PAR, and recalibrated PCE, respectively. CONCLUSIONS: Compared to conventional regression ASCVD risk calculators, such as PCE and China-PAR, the ANN prediction model may help optimize identification of individuals at heightened cardiovascular risk by flexibly incorporating a wider range of potential predictors. The findings may help guide clinical decision-making and ultimately contribute to ASCVD prevention and management. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02242-z. |
format | Online Article Text |
id | pubmed-10367272 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-103672722023-07-26 Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population Fan, Zihao Du, Zhi Fu, Jinrong Zhou, Ying Zhang, Pengyu Shi, Chuning Sun, Yingxian BMC Med Inform Decis Mak Research BACKGROUND: Accurately predicting the risk of atherosclerotic cardiovascular disease (ASCVD) is crucial for implementing individualized prevention strategies and improving patient outcomes. Our objective is to develop machine learning (ML)-based models for predicting ASCVD risk in a prospective Chinese population and compare their performance with conventional regression models. METHODS: A hybrid dataset consisting of 551 features was used, including 98 demographic, behavioral, and psychological features, 444 Electrocardiograph (ECG) features, and 9 Echocardiography (Echo) features. Seven machine learning (ML)-based models were trained, validated, and tested after selecting the 30 most informative features. We compared the discrimination, calibration, net benefit, and net reclassification improvement (NRI) of the ML models with those of conventional ASCVD risk calculators, such as the Pooled Cohort Equations (PCE) and Prediction for ASCVD Risk in China (China-PAR). RESULTS: The study included 9,609 participants (mean age 53.4 ± 10.4 years, 53.7% female), and during a median follow-up of 4.7 years, 431 (4.5%) participants developed ASCVD. In the testing set, the final ML-based ANN model outperformed PCE, China-PAR, recalibrated PCE, and recalibrated China-PAR in predicting ASCVD. This was demonstrated by the model's higher area under the curve (AUC) of 0.800, compared to 0.777, 0.780, 0.779, and 0.779 for the other models, respectively. Additionally, the model had a lower Hosmer–Lemeshow χ2 of 9.1, compared to 37.3, 67.6, 126.6, and 18.6 for the other models. The net benefit at a threshold of 5% was also higher for the ML-based ANN model at 0.017, compared to 0.016, 0.013, 0.017, and 0.016 for the other models, respectively. Furthermore, the NRI was 0.089 for the ML-based ANN model, while it was 0.355, 0.098, and 0.088 for PCE, China-PAR, and recalibrated PCE, respectively. CONCLUSIONS: Compared to conventional regression ASCVD risk calculators, such as PCE and China-PAR, the ANN prediction model may help optimize identification of individuals at heightened cardiovascular risk by flexibly incorporating a wider range of potential predictors. The findings may help guide clinical decision-making and ultimately contribute to ASCVD prevention and management. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02242-z. BioMed Central 2023-07-24 /pmc/articles/PMC10367272/ /pubmed/37488520 http://dx.doi.org/10.1186/s12911-023-02242-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Fan, Zihao Du, Zhi Fu, Jinrong Zhou, Ying Zhang, Pengyu Shi, Chuning Sun, Yingxian Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population |
title | Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population |
title_full | Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population |
title_fullStr | Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population |
title_full_unstemmed | Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population |
title_short | Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population |
title_sort | comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general chinese population |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10367272/ https://www.ncbi.nlm.nih.gov/pubmed/37488520 http://dx.doi.org/10.1186/s12911-023-02242-z |
work_keys_str_mv | AT fanzihao comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation AT duzhi comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation AT fujinrong comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation AT zhouying comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation AT zhangpengyu comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation AT shichuning comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation AT sunyingxian comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation |