Cargando…

Study on the risk of coronary heart disease in middle-aged and young people based on machine learning methods: a retrospective cohort study

OBJECTIVE: To identify coronary heart disease risk factors in young and middle-aged persons and develop a tailored risk prediction model. METHODS: A retrospective cohort study was used in this research. From January 2017 to January 2020, 553 patients in the Department of Cardiology at a tertiary hos...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Jiaoyu, Zhang, Lixiang, Ma, Likun, Zhou, Xiaojuan, Yang, Beibei, Wang, Wenjing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9653049/
https://www.ncbi.nlm.nih.gov/pubmed/36389421
http://dx.doi.org/10.7717/peerj.14078
_version_ 1784828604444573696
author Cao, Jiaoyu
Zhang, Lixiang
Ma, Likun
Zhou, Xiaojuan
Yang, Beibei
Wang, Wenjing
author_facet Cao, Jiaoyu
Zhang, Lixiang
Ma, Likun
Zhou, Xiaojuan
Yang, Beibei
Wang, Wenjing
author_sort Cao, Jiaoyu
collection PubMed
description OBJECTIVE: To identify coronary heart disease risk factors in young and middle-aged persons and develop a tailored risk prediction model. METHODS: A retrospective cohort study was used in this research. From January 2017 to January 2020, 553 patients in the Department of Cardiology at a tertiary hospital in Anhui Province were chosen as research subjects. The research subjects were separated into two groups based on the results of coronary angiography performed during hospitalization (n = 201) and non-coronary heart disease (n = 352). R software (R 3.6.1) was used to analyze the clinical data of the two groups. A logistic regression prediction model and three machine learning models, including BP neural network, Extreme gradient boosting (XGBoost), and random forest, were built, and the best prediction model was chosen based on the relevant parameters of the different machine learning models. RESULTS: Univariate analysis identified a total of 24 indexes with statistically significant differences between coronary heart disease and non-coronary heart disease groups, which were incorporated in the logistic regression model and three machine learning models. The AUCs of the test set in the logistic regression prediction model, BP neural network model, random forest model, and XGBoost model were 0.829, 0.795, 0.928, and 0.940, respectively, and the F1 scores were 0.634, 0.606, 0.846, and 0.887, indicating that the XGBoost model’s prediction value was the best. CONCLUSION: The XGBoost model, which is based on coronary heart disease risk factors in young and middle-aged people, has a high risk prediction efficiency for coronary heart disease in young and middle-aged people and can help clinical medical staff screen young and middle-aged people at high risk of coronary heart disease in clinical practice.
format Online
Article
Text
id pubmed-9653049
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-96530492022-11-15 Study on the risk of coronary heart disease in middle-aged and young people based on machine learning methods: a retrospective cohort study Cao, Jiaoyu Zhang, Lixiang Ma, Likun Zhou, Xiaojuan Yang, Beibei Wang, Wenjing PeerJ Cardiology OBJECTIVE: To identify coronary heart disease risk factors in young and middle-aged persons and develop a tailored risk prediction model. METHODS: A retrospective cohort study was used in this research. From January 2017 to January 2020, 553 patients in the Department of Cardiology at a tertiary hospital in Anhui Province were chosen as research subjects. The research subjects were separated into two groups based on the results of coronary angiography performed during hospitalization (n = 201) and non-coronary heart disease (n = 352). R software (R 3.6.1) was used to analyze the clinical data of the two groups. A logistic regression prediction model and three machine learning models, including BP neural network, Extreme gradient boosting (XGBoost), and random forest, were built, and the best prediction model was chosen based on the relevant parameters of the different machine learning models. RESULTS: Univariate analysis identified a total of 24 indexes with statistically significant differences between coronary heart disease and non-coronary heart disease groups, which were incorporated in the logistic regression model and three machine learning models. The AUCs of the test set in the logistic regression prediction model, BP neural network model, random forest model, and XGBoost model were 0.829, 0.795, 0.928, and 0.940, respectively, and the F1 scores were 0.634, 0.606, 0.846, and 0.887, indicating that the XGBoost model’s prediction value was the best. CONCLUSION: The XGBoost model, which is based on coronary heart disease risk factors in young and middle-aged people, has a high risk prediction efficiency for coronary heart disease in young and middle-aged people and can help clinical medical staff screen young and middle-aged people at high risk of coronary heart disease in clinical practice. PeerJ Inc. 2022-11-09 /pmc/articles/PMC9653049/ /pubmed/36389421 http://dx.doi.org/10.7717/peerj.14078 Text en © 2022 Cao et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Cardiology
Cao, Jiaoyu
Zhang, Lixiang
Ma, Likun
Zhou, Xiaojuan
Yang, Beibei
Wang, Wenjing
Study on the risk of coronary heart disease in middle-aged and young people based on machine learning methods: a retrospective cohort study
title Study on the risk of coronary heart disease in middle-aged and young people based on machine learning methods: a retrospective cohort study
title_full Study on the risk of coronary heart disease in middle-aged and young people based on machine learning methods: a retrospective cohort study
title_fullStr Study on the risk of coronary heart disease in middle-aged and young people based on machine learning methods: a retrospective cohort study
title_full_unstemmed Study on the risk of coronary heart disease in middle-aged and young people based on machine learning methods: a retrospective cohort study
title_short Study on the risk of coronary heart disease in middle-aged and young people based on machine learning methods: a retrospective cohort study
title_sort study on the risk of coronary heart disease in middle-aged and young people based on machine learning methods: a retrospective cohort study
topic Cardiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9653049/
https://www.ncbi.nlm.nih.gov/pubmed/36389421
http://dx.doi.org/10.7717/peerj.14078
work_keys_str_mv AT caojiaoyu studyontheriskofcoronaryheartdiseaseinmiddleagedandyoungpeoplebasedonmachinelearningmethodsaretrospectivecohortstudy
AT zhanglixiang studyontheriskofcoronaryheartdiseaseinmiddleagedandyoungpeoplebasedonmachinelearningmethodsaretrospectivecohortstudy
AT malikun studyontheriskofcoronaryheartdiseaseinmiddleagedandyoungpeoplebasedonmachinelearningmethodsaretrospectivecohortstudy
AT zhouxiaojuan studyontheriskofcoronaryheartdiseaseinmiddleagedandyoungpeoplebasedonmachinelearningmethodsaretrospectivecohortstudy
AT yangbeibei studyontheriskofcoronaryheartdiseaseinmiddleagedandyoungpeoplebasedonmachinelearningmethodsaretrospectivecohortstudy
AT wangwenjing studyontheriskofcoronaryheartdiseaseinmiddleagedandyoungpeoplebasedonmachinelearningmethodsaretrospectivecohortstudy