Cargando…

A machine learning approach to personalized predictors of dyslipidemia: a cohort study

INTRODUCTION: Mexico ranks second in the global prevalence of obesity in the adult population, which increases the probability of developing dyslipidemia. Dyslipidemia is closely related to cardiovascular diseases, which are the leading cause of death in the country. Therefore, developing tools that...

Descripción completa

Detalles Bibliográficos
Autores principales: Gutiérrez-Esparza, Guadalupe, Pulido, Tomas, Martínez-García, Mireya, Ramírez-delReal, Tania, Groves-Miralrio, Lucero E., Márquez-Murillo, Manlio F., Amezcua-Guerra, Luis M., Vargas-Alarcón, Gilberto, Hernández-Lemus, Enrique
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10548235/
https://www.ncbi.nlm.nih.gov/pubmed/37799151
http://dx.doi.org/10.3389/fpubh.2023.1213926
_version_ 1785115234652913664
author Gutiérrez-Esparza, Guadalupe
Pulido, Tomas
Martínez-García, Mireya
Ramírez-delReal, Tania
Groves-Miralrio, Lucero E.
Márquez-Murillo, Manlio F.
Amezcua-Guerra, Luis M.
Vargas-Alarcón, Gilberto
Hernández-Lemus, Enrique
author_facet Gutiérrez-Esparza, Guadalupe
Pulido, Tomas
Martínez-García, Mireya
Ramírez-delReal, Tania
Groves-Miralrio, Lucero E.
Márquez-Murillo, Manlio F.
Amezcua-Guerra, Luis M.
Vargas-Alarcón, Gilberto
Hernández-Lemus, Enrique
author_sort Gutiérrez-Esparza, Guadalupe
collection PubMed
description INTRODUCTION: Mexico ranks second in the global prevalence of obesity in the adult population, which increases the probability of developing dyslipidemia. Dyslipidemia is closely related to cardiovascular diseases, which are the leading cause of death in the country. Therefore, developing tools that facilitate the prediction of dyslipidemias is essential for prevention and early treatment. METHODS: In this study, we utilized a dataset from a Mexico City cohort consisting of 2,621 participants, men and women aged between 20 and 50 years, with and without some type of dyslipidemia. Our primary objective was to identify potential factors associated with different types of dyslipidemia in both men and women. Machine learning algorithms were employed to achieve this goal. To facilitate feature selection, we applied the Variable Importance Measures (VIM) of Random Forest (RF), XGBoost, and Gradient Boosting Machine (GBM). Additionally, to address class imbalance, we employed Synthetic Minority Over-sampling Technique (SMOTE) for dataset resampling. The dataset encompassed anthropometric measurements, biochemical tests, dietary intake, family health history, and other health parameters, including smoking habits, alcohol consumption, quality of sleep, and physical activity. RESULTS: Our results revealed that the VIM algorithm of RF yielded the most optimal subset of attributes, closely followed by GBM, achieving a balanced accuracy of up to 80%. The selection of the best subset of attributes was based on the comparative performance of classifiers, evaluated through balanced accuracy, sensitivity, and specificity metrics. DISCUSSION: The top five features contributing to an increased risk of various types of dyslipidemia were identified through the machine learning technique. These features include body mass index, elevated uric acid levels, age, sleep disorders, and anxiety. The findings of this study shed light on significant factors that play a role in dyslipidemia development, aiding in the early identification, prevention, and treatment of this condition.
format Online
Article
Text
id pubmed-10548235
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-105482352023-10-05 A machine learning approach to personalized predictors of dyslipidemia: a cohort study Gutiérrez-Esparza, Guadalupe Pulido, Tomas Martínez-García, Mireya Ramírez-delReal, Tania Groves-Miralrio, Lucero E. Márquez-Murillo, Manlio F. Amezcua-Guerra, Luis M. Vargas-Alarcón, Gilberto Hernández-Lemus, Enrique Front Public Health Public Health INTRODUCTION: Mexico ranks second in the global prevalence of obesity in the adult population, which increases the probability of developing dyslipidemia. Dyslipidemia is closely related to cardiovascular diseases, which are the leading cause of death in the country. Therefore, developing tools that facilitate the prediction of dyslipidemias is essential for prevention and early treatment. METHODS: In this study, we utilized a dataset from a Mexico City cohort consisting of 2,621 participants, men and women aged between 20 and 50 years, with and without some type of dyslipidemia. Our primary objective was to identify potential factors associated with different types of dyslipidemia in both men and women. Machine learning algorithms were employed to achieve this goal. To facilitate feature selection, we applied the Variable Importance Measures (VIM) of Random Forest (RF), XGBoost, and Gradient Boosting Machine (GBM). Additionally, to address class imbalance, we employed Synthetic Minority Over-sampling Technique (SMOTE) for dataset resampling. The dataset encompassed anthropometric measurements, biochemical tests, dietary intake, family health history, and other health parameters, including smoking habits, alcohol consumption, quality of sleep, and physical activity. RESULTS: Our results revealed that the VIM algorithm of RF yielded the most optimal subset of attributes, closely followed by GBM, achieving a balanced accuracy of up to 80%. The selection of the best subset of attributes was based on the comparative performance of classifiers, evaluated through balanced accuracy, sensitivity, and specificity metrics. DISCUSSION: The top five features contributing to an increased risk of various types of dyslipidemia were identified through the machine learning technique. These features include body mass index, elevated uric acid levels, age, sleep disorders, and anxiety. The findings of this study shed light on significant factors that play a role in dyslipidemia development, aiding in the early identification, prevention, and treatment of this condition. Frontiers Media S.A. 2023-09-20 /pmc/articles/PMC10548235/ /pubmed/37799151 http://dx.doi.org/10.3389/fpubh.2023.1213926 Text en Copyright © 2023 Gutiérrez-Esparza, Pulido, Martínez-García, Ramírez-delReal, Groves-Miralrio, Márquez-Murillo, Amezcua-Guerra, Vargas-Alarcón and Hernández-Lemus. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Gutiérrez-Esparza, Guadalupe
Pulido, Tomas
Martínez-García, Mireya
Ramírez-delReal, Tania
Groves-Miralrio, Lucero E.
Márquez-Murillo, Manlio F.
Amezcua-Guerra, Luis M.
Vargas-Alarcón, Gilberto
Hernández-Lemus, Enrique
A machine learning approach to personalized predictors of dyslipidemia: a cohort study
title A machine learning approach to personalized predictors of dyslipidemia: a cohort study
title_full A machine learning approach to personalized predictors of dyslipidemia: a cohort study
title_fullStr A machine learning approach to personalized predictors of dyslipidemia: a cohort study
title_full_unstemmed A machine learning approach to personalized predictors of dyslipidemia: a cohort study
title_short A machine learning approach to personalized predictors of dyslipidemia: a cohort study
title_sort machine learning approach to personalized predictors of dyslipidemia: a cohort study
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10548235/
https://www.ncbi.nlm.nih.gov/pubmed/37799151
http://dx.doi.org/10.3389/fpubh.2023.1213926
work_keys_str_mv AT gutierrezesparzaguadalupe amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT pulidotomas amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT martinezgarciamireya amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT ramirezdelrealtania amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT grovesmiralrioluceroe amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT marquezmurillomanliof amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT amezcuaguerraluism amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT vargasalarcongilberto amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT hernandezlemusenrique amachinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT gutierrezesparzaguadalupe machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT pulidotomas machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT martinezgarciamireya machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT ramirezdelrealtania machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT grovesmiralrioluceroe machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT marquezmurillomanliof machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT amezcuaguerraluism machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT vargasalarcongilberto machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy
AT hernandezlemusenrique machinelearningapproachtopersonalizedpredictorsofdyslipidemiaacohortstudy