Cargando…

Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning

BACKGROUND: Due to the increasing insulin resistance (IR) in childhood, rates of diabetes and cardiovascular disease may rise in the future and seriously threaten the healthy development of children. Finding an easy way to predict IR in children can help pediatricians to identify these children in t...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Qian, Wan, Nai-jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9526431/
https://www.ncbi.nlm.nih.gov/pubmed/36193541
http://dx.doi.org/10.2147/DMSO.S380772
_version_ 1784800875168923648
author Zhang, Qian
Wan, Nai-jun
author_facet Zhang, Qian
Wan, Nai-jun
author_sort Zhang, Qian
collection PubMed
description BACKGROUND: Due to the increasing insulin resistance (IR) in childhood, rates of diabetes and cardiovascular disease may rise in the future and seriously threaten the healthy development of children. Finding an easy way to predict IR in children can help pediatricians to identify these children in time and intervene appropriately, which is particularly important for practitioners in primary health care. PATIENTS AND METHODS: Seventeen features from 503 children 6–12 years old were collected. We defined IR by HOMA-IR greater than 3.0, thus classifying children with IR and those without IR. Data were preprocessed by multivariate imputation and oversampling to resolve missing values and data imbalances; then, recursive feature elimination was applied to further select features of interest, and 5 machine learning methods—namely, logistic regression (LR), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and gradient boosting with categorical features support (CatBoost)—were used for model training. We tested the trained models on an external test set containing information from 133 children, from which performance metrics were extracted and the optimal model was selected. RESULTS: After feature selection, the numbers of chosen features for the LR, SVM, RF, XGBoost, and CatBoost models were 6, 9, 10, 14, and 6, respectively. Among them, glucose, waist circumference, and age were chosen as predictors by most of the models. Finally, all 5 models achieved good performance on the external test set. Both XGBoost and CatBoost had the same AUC (0.85), which was highest among those of all models. Their accuracy, sensitivity, precision, and F1 scores were also close, but the specificity of XGBoost reached 0.79, which was significantly higher than that of CatBoost, so XGBoost was chosen as the optimal model. CONCLUSION: The model developed herein has a good predictive ability for IR in children 6–12 years old and can be clinically applied to help pediatricians identify children with IR in a simple and inexpensive way.
format Online
Article
Text
id pubmed-9526431
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Dove
record_format MEDLINE/PubMed
spelling pubmed-95264312022-10-02 Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning Zhang, Qian Wan, Nai-jun Diabetes Metab Syndr Obes Original Research BACKGROUND: Due to the increasing insulin resistance (IR) in childhood, rates of diabetes and cardiovascular disease may rise in the future and seriously threaten the healthy development of children. Finding an easy way to predict IR in children can help pediatricians to identify these children in time and intervene appropriately, which is particularly important for practitioners in primary health care. PATIENTS AND METHODS: Seventeen features from 503 children 6–12 years old were collected. We defined IR by HOMA-IR greater than 3.0, thus classifying children with IR and those without IR. Data were preprocessed by multivariate imputation and oversampling to resolve missing values and data imbalances; then, recursive feature elimination was applied to further select features of interest, and 5 machine learning methods—namely, logistic regression (LR), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and gradient boosting with categorical features support (CatBoost)—were used for model training. We tested the trained models on an external test set containing information from 133 children, from which performance metrics were extracted and the optimal model was selected. RESULTS: After feature selection, the numbers of chosen features for the LR, SVM, RF, XGBoost, and CatBoost models were 6, 9, 10, 14, and 6, respectively. Among them, glucose, waist circumference, and age were chosen as predictors by most of the models. Finally, all 5 models achieved good performance on the external test set. Both XGBoost and CatBoost had the same AUC (0.85), which was highest among those of all models. Their accuracy, sensitivity, precision, and F1 scores were also close, but the specificity of XGBoost reached 0.79, which was significantly higher than that of CatBoost, so XGBoost was chosen as the optimal model. CONCLUSION: The model developed herein has a good predictive ability for IR in children 6–12 years old and can be clinically applied to help pediatricians identify children with IR in a simple and inexpensive way. Dove 2022-09-27 /pmc/articles/PMC9526431/ /pubmed/36193541 http://dx.doi.org/10.2147/DMSO.S380772 Text en © 2022 Zhang and Wan. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).
spellingShingle Original Research
Zhang, Qian
Wan, Nai-jun
Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning
title Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning
title_full Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning
title_fullStr Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning
title_full_unstemmed Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning
title_short Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning
title_sort simple method to predict insulin resistance in children aged 6–12 years by using machine learning
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9526431/
https://www.ncbi.nlm.nih.gov/pubmed/36193541
http://dx.doi.org/10.2147/DMSO.S380772
work_keys_str_mv AT zhangqian simplemethodtopredictinsulinresistanceinchildrenaged612yearsbyusingmachinelearning
AT wannaijun simplemethodtopredictinsulinresistanceinchildrenaged612yearsbyusingmachinelearning