Cargando…
A machine learning approach to personalized predictors of dyslipidemia: a cohort study
INTRODUCTION: Mexico ranks second in the global prevalence of obesity in the adult population, which increases the probability of developing dyslipidemia. Dyslipidemia is closely related to cardiovascular diseases, which are the leading cause of death in the country. Therefore, developing tools that...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10548235/ https://www.ncbi.nlm.nih.gov/pubmed/37799151 http://dx.doi.org/10.3389/fpubh.2023.1213926 |
_version_ | 1785115234652913664 |
---|---|
author | Gutiérrez-Esparza, Guadalupe Pulido, Tomas Martínez-García, Mireya Ramírez-delReal, Tania Groves-Miralrio, Lucero E. Márquez-Murillo, Manlio F. Amezcua-Guerra, Luis M. Vargas-Alarcón, Gilberto Hernández-Lemus, Enrique |
author_facet | Gutiérrez-Esparza, Guadalupe Pulido, Tomas Martínez-García, Mireya Ramírez-delReal, Tania Groves-Miralrio, Lucero E. Márquez-Murillo, Manlio F. Amezcua-Guerra, Luis M. Vargas-Alarcón, Gilberto Hernández-Lemus, Enrique |
author_sort | Gutiérrez-Esparza, Guadalupe |
collection | PubMed |
description | INTRODUCTION: Mexico ranks second in the global prevalence of obesity in the adult population, which increases the probability of developing dyslipidemia. Dyslipidemia is closely related to cardiovascular diseases, which are the leading cause of death in the country. Therefore, developing tools that facilitate the prediction of dyslipidemias is essential for prevention and early treatment. METHODS: In this study, we utilized a dataset from a Mexico City cohort consisting of 2,621 participants, men and women aged between 20 and 50 years, with and without some type of dyslipidemia. Our primary objective was to identify potential factors associated with different types of dyslipidemia in both men and women. Machine learning algorithms were employed to achieve this goal. To facilitate feature selection, we applied the Variable Importance Measures (VIM) of Random Forest (RF), XGBoost, and Gradient Boosting Machine (GBM). Additionally, to address class imbalance, we employed Synthetic Minority Over-sampling Technique (SMOTE) for dataset resampling. The dataset encompassed anthropometric measurements, biochemical tests, dietary intake, family health history, and other health parameters, including smoking habits, alcohol consumption, quality of sleep, and physical activity. RESULTS: Our results revealed that the VIM algorithm of RF yielded the most optimal subset of attributes, closely followed by GBM, achieving a balanced accuracy of up to 80%. The selection of the best subset of attributes was based on the comparative performance of classifiers, evaluated through balanced accuracy, sensitivity, and specificity metrics. DISCUSSION: The top five features contributing to an increased risk of various types of dyslipidemia were identified through the machine learning technique. These features include body mass index, elevated uric acid levels, age, sleep disorders, and anxiety. The findings of this study shed light on significant factors that play a role in dyslipidemia development, aiding in the early identification, prevention, and treatment of this condition. |
format | Online Article Text |
id | pubmed-10548235 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-105482352023-10-05 A machine learning approach to personalized predictors of dyslipidemia: a cohort study Gutiérrez-Esparza, Guadalupe Pulido, Tomas Martínez-García, Mireya Ramírez-delReal, Tania Groves-Miralrio, Lucero E. Márquez-Murillo, Manlio F. Amezcua-Guerra, Luis M. Vargas-Alarcón, Gilberto Hernández-Lemus, Enrique Front Public Health Public Health INTRODUCTION: Mexico ranks second in the global prevalence of obesity in the adult population, which increases the probability of developing dyslipidemia. Dyslipidemia is closely related to cardiovascular diseases, which are the leading cause of death in the country. Therefore, developing tools that facilitate the prediction of dyslipidemias is essential for prevention and early treatment. METHODS: In this study, we utilized a dataset from a Mexico City cohort consisting of 2,621 participants, men and women aged between 20 and 50 years, with and without some type of dyslipidemia. Our primary objective was to identify potential factors associated with different types of dyslipidemia in both men and women. Machine learning algorithms were employed to achieve this goal. To facilitate feature selection, we applied the Variable Importance Measures (VIM) of Random Forest (RF), XGBoost, and Gradient Boosting Machine (GBM). Additionally, to address class imbalance, we employed Synthetic Minority Over-sampling Technique (SMOTE) for dataset resampling. The dataset encompassed anthropometric measurements, biochemical tests, dietary intake, family health history, and other health parameters, including smoking habits, alcohol consumption, quality of sleep, and physical activity. RESULTS: Our results revealed that the VIM algorithm of RF yielded the most optimal subset of attributes, closely followed by GBM, achieving a balanced accuracy of up to 80%. The selection of the best subset of attributes was based on the comparative performance of classifiers, evaluated through balanced accuracy, sensitivity, and specificity metrics. DISCUSSION: The top five features contributing to an increased risk of various types of dyslipidemia were identified through the machine learning technique. These features include body mass index, elevated uric acid levels, age, sleep disorders, and anxiety. The findings of this study shed light on significant factors that play a role in dyslipidemia development, aiding in the early identification, prevention, and treatment of this condition. Frontiers Media S.A. 2023-09-20 /pmc/articles/PMC10548235/ /pubmed/37799151 http://dx.doi.org/10.3389/fpubh.2023.1213926 Text en Copyright © 2023 Gutiérrez-Esparza, Pulido, Martínez-García, Ramírez-delReal, Groves-Miralrio, Márquez-Murillo, Amezcua-Guerra, Vargas-Alarcón and Hernández-Lemus. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Public Health Gutiérrez-Esparza, Guadalupe Pulido, Tomas Martínez-García, Mireya Ramírez-delReal, Tania Groves-Miralrio, Lucero E. Márquez-Murillo, Manlio F. Amezcua-Guerra, Luis M. Vargas-Alarcón, Gilberto Hernández-Lemus, Enrique A machine learning approach to personalized predictors of dyslipidemia: a cohort study |
title | A machine learning approach to personalized predictors of dyslipidemia: a cohort study |
title_full | A machine learning approach to personalized predictors of dyslipidemia: a cohort study |
title_fullStr | A machine learning approach to personalized predictors of dyslipidemia: a cohort study |
title_full_unstemmed | A machine learning approach to personalized predictors of dyslipidemia: a cohort study |
title_short | A machine learning approach to personalized predictors of dyslipidemia: a cohort study |
title_sort | machine learning approach to personalized predictors of dyslipidemia: a cohort study |
topic | Public Health |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10548235/ https://www.ncbi.nlm.nih.gov/pubmed/37799151 http://dx.doi.org/10.3389/fpubh.2023.1213926 |
work_keys_str_mv | AT gutierrezesparzaguadalupe amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT pulidotomas amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT martinezgarciamireya amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT ramirezdelrealtania amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT grovesmiralrioluceroe amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT marquezmurillomanliof amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT amezcuaguerraluism amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT vargasalarcongilberto amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT hernandezlemusenrique amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT gutierrezesparzaguadalupe machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT pulidotomas machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT martinezgarciamireya machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT ramirezdelrealtania machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT grovesmiralrioluceroe machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT marquezmurillomanliof machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT amezcuaguerraluism machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT vargasalarcongilberto machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy AT hernandezlemusenrique machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy |