Cargando…

Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population

BACKGROUND: Accurately predicting the risk of atherosclerotic cardiovascular disease (ASCVD) is crucial for implementing individualized prevention strategies and improving patient outcomes. Our objective is to develop machine learning (ML)-based models for predicting ASCVD risk in a prospective Chin...

Descripción completa

Detalles Bibliográficos
Autores principales: Fan, Zihao, Du, Zhi, Fu, Jinrong, Zhou, Ying, Zhang, Pengyu, Shi, Chuning, Sun, Yingxian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10367272/
https://www.ncbi.nlm.nih.gov/pubmed/37488520
http://dx.doi.org/10.1186/s12911-023-02242-z
_version_ 1785077352672264192
author Fan, Zihao
Du, Zhi
Fu, Jinrong
Zhou, Ying
Zhang, Pengyu
Shi, Chuning
Sun, Yingxian
author_facet Fan, Zihao
Du, Zhi
Fu, Jinrong
Zhou, Ying
Zhang, Pengyu
Shi, Chuning
Sun, Yingxian
author_sort Fan, Zihao
collection PubMed
description BACKGROUND: Accurately predicting the risk of atherosclerotic cardiovascular disease (ASCVD) is crucial for implementing individualized prevention strategies and improving patient outcomes. Our objective is to develop machine learning (ML)-based models for predicting ASCVD risk in a prospective Chinese population and compare their performance with conventional regression models. METHODS: A hybrid dataset consisting of 551 features was used, including 98 demographic, behavioral, and psychological features, 444 Electrocardiograph (ECG) features, and 9 Echocardiography (Echo) features. Seven machine learning (ML)-based models were trained, validated, and tested after selecting the 30 most informative features. We compared the discrimination, calibration, net benefit, and net reclassification improvement (NRI) of the ML models with those of conventional ASCVD risk calculators, such as the Pooled Cohort Equations (PCE) and Prediction for ASCVD Risk in China (China-PAR). RESULTS: The study included 9,609 participants (mean age 53.4 ± 10.4 years, 53.7% female), and during a median follow-up of 4.7 years, 431 (4.5%) participants developed ASCVD. In the testing set, the final ML-based ANN model outperformed PCE, China-PAR, recalibrated PCE, and recalibrated China-PAR in predicting ASCVD. This was demonstrated by the model's higher area under the curve (AUC) of 0.800, compared to 0.777, 0.780, 0.779, and 0.779 for the other models, respectively. Additionally, the model had a lower Hosmer–Lemeshow χ2 of 9.1, compared to 37.3, 67.6, 126.6, and 18.6 for the other models. The net benefit at a threshold of 5% was also higher for the ML-based ANN model at 0.017, compared to 0.016, 0.013, 0.017, and 0.016 for the other models, respectively. Furthermore, the NRI was 0.089 for the ML-based ANN model, while it was 0.355, 0.098, and 0.088 for PCE, China-PAR, and recalibrated PCE, respectively. CONCLUSIONS: Compared to conventional regression ASCVD risk calculators, such as PCE and China-PAR, the ANN prediction model may help optimize identification of individuals at heightened cardiovascular risk by flexibly incorporating a wider range of potential predictors. The findings may help guide clinical decision-making and ultimately contribute to ASCVD prevention and management. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02242-z.
format Online
Article
Text
id pubmed-10367272
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-103672722023-07-26 Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population Fan, Zihao Du, Zhi Fu, Jinrong Zhou, Ying Zhang, Pengyu Shi, Chuning Sun, Yingxian BMC Med Inform Decis Mak Research BACKGROUND: Accurately predicting the risk of atherosclerotic cardiovascular disease (ASCVD) is crucial for implementing individualized prevention strategies and improving patient outcomes. Our objective is to develop machine learning (ML)-based models for predicting ASCVD risk in a prospective Chinese population and compare their performance with conventional regression models. METHODS: A hybrid dataset consisting of 551 features was used, including 98 demographic, behavioral, and psychological features, 444 Electrocardiograph (ECG) features, and 9 Echocardiography (Echo) features. Seven machine learning (ML)-based models were trained, validated, and tested after selecting the 30 most informative features. We compared the discrimination, calibration, net benefit, and net reclassification improvement (NRI) of the ML models with those of conventional ASCVD risk calculators, such as the Pooled Cohort Equations (PCE) and Prediction for ASCVD Risk in China (China-PAR). RESULTS: The study included 9,609 participants (mean age 53.4 ± 10.4 years, 53.7% female), and during a median follow-up of 4.7 years, 431 (4.5%) participants developed ASCVD. In the testing set, the final ML-based ANN model outperformed PCE, China-PAR, recalibrated PCE, and recalibrated China-PAR in predicting ASCVD. This was demonstrated by the model's higher area under the curve (AUC) of 0.800, compared to 0.777, 0.780, 0.779, and 0.779 for the other models, respectively. Additionally, the model had a lower Hosmer–Lemeshow χ2 of 9.1, compared to 37.3, 67.6, 126.6, and 18.6 for the other models. The net benefit at a threshold of 5% was also higher for the ML-based ANN model at 0.017, compared to 0.016, 0.013, 0.017, and 0.016 for the other models, respectively. Furthermore, the NRI was 0.089 for the ML-based ANN model, while it was 0.355, 0.098, and 0.088 for PCE, China-PAR, and recalibrated PCE, respectively. CONCLUSIONS: Compared to conventional regression ASCVD risk calculators, such as PCE and China-PAR, the ANN prediction model may help optimize identification of individuals at heightened cardiovascular risk by flexibly incorporating a wider range of potential predictors. The findings may help guide clinical decision-making and ultimately contribute to ASCVD prevention and management. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02242-z. BioMed Central 2023-07-24 /pmc/articles/PMC10367272/ /pubmed/37488520 http://dx.doi.org/10.1186/s12911-023-02242-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Fan, Zihao
Du, Zhi
Fu, Jinrong
Zhou, Ying
Zhang, Pengyu
Shi, Chuning
Sun, Yingxian
Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population
title Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population
title_full Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population
title_fullStr Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population
title_full_unstemmed Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population
title_short Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population
title_sort comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general chinese population
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10367272/
https://www.ncbi.nlm.nih.gov/pubmed/37488520
http://dx.doi.org/10.1186/s12911-023-02242-z
work_keys_str_mv AT fanzihao comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation
AT duzhi comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation
AT fujinrong comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation
AT zhouying comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation
AT zhangpengyu comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation
AT shichuning comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation
AT sunyingxian comparingtheperformanceofmachinelearningandconventionalmodelsforpredictingatheroscleroticcardiovasculardiseaseinageneralchinesepopulation