Cargando…
Utilizing machine learning algorithms for the prediction of carotid artery plaques in a Chinese population
Background: Ischemic stroke is a significant global health issue, imposing substantial social and economic burdens. Carotid artery plaques (CAP) serve as an important risk factor for stroke, and early screening can effectively reduce stroke incidence. However, China lacks nationwide data on carotid...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10657816/ https://www.ncbi.nlm.nih.gov/pubmed/38028761 http://dx.doi.org/10.3389/fphys.2023.1295371 |
_version_ | 1785137307036155904 |
---|---|
author | Weng, Shuwei Chen, Jin Ding, Chen Hu, Die Liu, Wenwu Yang, Yanyi Peng, Daoquan |
author_facet | Weng, Shuwei Chen, Jin Ding, Chen Hu, Die Liu, Wenwu Yang, Yanyi Peng, Daoquan |
author_sort | Weng, Shuwei |
collection | PubMed |
description | Background: Ischemic stroke is a significant global health issue, imposing substantial social and economic burdens. Carotid artery plaques (CAP) serve as an important risk factor for stroke, and early screening can effectively reduce stroke incidence. However, China lacks nationwide data on carotid artery plaques. Machine learning (ML) can offer an economically efficient screening method. This study aimed to develop ML models using routine health examinations and blood markers to predict the occurrence of carotid artery plaques. Methods: This study included data from 5,211 participants aged 18–70, encompassing health check-ups and biochemical indicators. Among them, 1,164 participants were diagnosed with carotid artery plaques through carotid ultrasound. We constructed six ML models by employing feature selection with elastic net regression, selecting 13 indicators. Model performance was evaluated using accuracy, sensitivity, specificity, Positive Predictive Value (PPV), Negative Predictive Value (NPV), F1 score, kappa value, and Area Under the Curve (AUC) value. Feature importance was assessed by calculating the root mean square error (RMSE) loss after permutations for each variable in every model. Results: Among all six ML models, LightGBM achieved the highest accuracy at 91.8%. Feature importance analysis revealed that age, Low-Density Lipoprotein Cholesterol (LDL-c), and systolic blood pressure were important predictive factors in the models. Conclusion: LightGBM can effectively predict the occurrence of carotid artery plaques using demographic information, physical examination data and biochemistry data. |
format | Online Article Text |
id | pubmed-10657816 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-106578162023-11-06 Utilizing machine learning algorithms for the prediction of carotid artery plaques in a Chinese population Weng, Shuwei Chen, Jin Ding, Chen Hu, Die Liu, Wenwu Yang, Yanyi Peng, Daoquan Front Physiol Physiology Background: Ischemic stroke is a significant global health issue, imposing substantial social and economic burdens. Carotid artery plaques (CAP) serve as an important risk factor for stroke, and early screening can effectively reduce stroke incidence. However, China lacks nationwide data on carotid artery plaques. Machine learning (ML) can offer an economically efficient screening method. This study aimed to develop ML models using routine health examinations and blood markers to predict the occurrence of carotid artery plaques. Methods: This study included data from 5,211 participants aged 18–70, encompassing health check-ups and biochemical indicators. Among them, 1,164 participants were diagnosed with carotid artery plaques through carotid ultrasound. We constructed six ML models by employing feature selection with elastic net regression, selecting 13 indicators. Model performance was evaluated using accuracy, sensitivity, specificity, Positive Predictive Value (PPV), Negative Predictive Value (NPV), F1 score, kappa value, and Area Under the Curve (AUC) value. Feature importance was assessed by calculating the root mean square error (RMSE) loss after permutations for each variable in every model. Results: Among all six ML models, LightGBM achieved the highest accuracy at 91.8%. Feature importance analysis revealed that age, Low-Density Lipoprotein Cholesterol (LDL-c), and systolic blood pressure were important predictive factors in the models. Conclusion: LightGBM can effectively predict the occurrence of carotid artery plaques using demographic information, physical examination data and biochemistry data. Frontiers Media S.A. 2023-11-06 /pmc/articles/PMC10657816/ /pubmed/38028761 http://dx.doi.org/10.3389/fphys.2023.1295371 Text en Copyright © 2023 Weng, Chen, Ding, Hu, Liu, Yang and Peng. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Physiology Weng, Shuwei Chen, Jin Ding, Chen Hu, Die Liu, Wenwu Yang, Yanyi Peng, Daoquan Utilizing machine learning algorithms for the prediction of carotid artery plaques in a Chinese population |
title | Utilizing machine learning algorithms for the prediction of carotid artery plaques in a Chinese population |
title_full | Utilizing machine learning algorithms for the prediction of carotid artery plaques in a Chinese population |
title_fullStr | Utilizing machine learning algorithms for the prediction of carotid artery plaques in a Chinese population |
title_full_unstemmed | Utilizing machine learning algorithms for the prediction of carotid artery plaques in a Chinese population |
title_short | Utilizing machine learning algorithms for the prediction of carotid artery plaques in a Chinese population |
title_sort | utilizing machine learning algorithms for the prediction of carotid artery plaques in a chinese population |
topic | Physiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10657816/ https://www.ncbi.nlm.nih.gov/pubmed/38028761 http://dx.doi.org/10.3389/fphys.2023.1295371 |
work_keys_str_mv | AT wengshuwei utilizingmachinelearningalgorithmsforthepredictionofcarotidarteryplaquesinachinesepopulation AT chenjin utilizingmachinelearningalgorithmsforthepredictionofcarotidarteryplaquesinachinesepopulation AT dingchen utilizingmachinelearningalgorithmsforthepredictionofcarotidarteryplaquesinachinesepopulation AT hudie utilizingmachinelearningalgorithmsforthepredictionofcarotidarteryplaquesinachinesepopulation AT liuwenwu utilizingmachinelearningalgorithmsforthepredictionofcarotidarteryplaquesinachinesepopulation AT yangyanyi utilizingmachinelearningalgorithmsforthepredictionofcarotidarteryplaquesinachinesepopulation AT pengdaoquan utilizingmachinelearningalgorithmsforthepredictionofcarotidarteryplaquesinachinesepopulation |