Cargando…
Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning
BACKGROUND: Due to the increasing insulin resistance (IR) in childhood, rates of diabetes and cardiovascular disease may rise in the future and seriously threaten the healthy development of children. Finding an easy way to predict IR in children can help pediatricians to identify these children in t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Dove
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9526431/ https://www.ncbi.nlm.nih.gov/pubmed/36193541 http://dx.doi.org/10.2147/DMSO.S380772 |
_version_ | 1784800875168923648 |
---|---|
author | Zhang, Qian Wan, Nai-jun |
author_facet | Zhang, Qian Wan, Nai-jun |
author_sort | Zhang, Qian |
collection | PubMed |
description | BACKGROUND: Due to the increasing insulin resistance (IR) in childhood, rates of diabetes and cardiovascular disease may rise in the future and seriously threaten the healthy development of children. Finding an easy way to predict IR in children can help pediatricians to identify these children in time and intervene appropriately, which is particularly important for practitioners in primary health care. PATIENTS AND METHODS: Seventeen features from 503 children 6–12 years old were collected. We defined IR by HOMA-IR greater than 3.0, thus classifying children with IR and those without IR. Data were preprocessed by multivariate imputation and oversampling to resolve missing values and data imbalances; then, recursive feature elimination was applied to further select features of interest, and 5 machine learning methods—namely, logistic regression (LR), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and gradient boosting with categorical features support (CatBoost)—were used for model training. We tested the trained models on an external test set containing information from 133 children, from which performance metrics were extracted and the optimal model was selected. RESULTS: After feature selection, the numbers of chosen features for the LR, SVM, RF, XGBoost, and CatBoost models were 6, 9, 10, 14, and 6, respectively. Among them, glucose, waist circumference, and age were chosen as predictors by most of the models. Finally, all 5 models achieved good performance on the external test set. Both XGBoost and CatBoost had the same AUC (0.85), which was highest among those of all models. Their accuracy, sensitivity, precision, and F1 scores were also close, but the specificity of XGBoost reached 0.79, which was significantly higher than that of CatBoost, so XGBoost was chosen as the optimal model. CONCLUSION: The model developed herein has a good predictive ability for IR in children 6–12 years old and can be clinically applied to help pediatricians identify children with IR in a simple and inexpensive way. |
format | Online Article Text |
id | pubmed-9526431 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Dove |
record_format | MEDLINE/PubMed |
spelling | pubmed-95264312022-10-02 Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning Zhang, Qian Wan, Nai-jun Diabetes Metab Syndr Obes Original Research BACKGROUND: Due to the increasing insulin resistance (IR) in childhood, rates of diabetes and cardiovascular disease may rise in the future and seriously threaten the healthy development of children. Finding an easy way to predict IR in children can help pediatricians to identify these children in time and intervene appropriately, which is particularly important for practitioners in primary health care. PATIENTS AND METHODS: Seventeen features from 503 children 6–12 years old were collected. We defined IR by HOMA-IR greater than 3.0, thus classifying children with IR and those without IR. Data were preprocessed by multivariate imputation and oversampling to resolve missing values and data imbalances; then, recursive feature elimination was applied to further select features of interest, and 5 machine learning methods—namely, logistic regression (LR), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and gradient boosting with categorical features support (CatBoost)—were used for model training. We tested the trained models on an external test set containing information from 133 children, from which performance metrics were extracted and the optimal model was selected. RESULTS: After feature selection, the numbers of chosen features for the LR, SVM, RF, XGBoost, and CatBoost models were 6, 9, 10, 14, and 6, respectively. Among them, glucose, waist circumference, and age were chosen as predictors by most of the models. Finally, all 5 models achieved good performance on the external test set. Both XGBoost and CatBoost had the same AUC (0.85), which was highest among those of all models. Their accuracy, sensitivity, precision, and F1 scores were also close, but the specificity of XGBoost reached 0.79, which was significantly higher than that of CatBoost, so XGBoost was chosen as the optimal model. CONCLUSION: The model developed herein has a good predictive ability for IR in children 6–12 years old and can be clinically applied to help pediatricians identify children with IR in a simple and inexpensive way. Dove 2022-09-27 /pmc/articles/PMC9526431/ /pubmed/36193541 http://dx.doi.org/10.2147/DMSO.S380772 Text en © 2022 Zhang and Wan. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php). |
spellingShingle | Original Research Zhang, Qian Wan, Nai-jun Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning |
title | Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning |
title_full | Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning |
title_fullStr | Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning |
title_full_unstemmed | Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning |
title_short | Simple Method to Predict Insulin Resistance in Children Aged 6–12 Years by Using Machine Learning |
title_sort | simple method to predict insulin resistance in children aged 6–12 years by using machine learning |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9526431/ https://www.ncbi.nlm.nih.gov/pubmed/36193541 http://dx.doi.org/10.2147/DMSO.S380772 |
work_keys_str_mv | AT zhangqian simplemethodtopredictinsulinresistanceinchildrenaged612yearsbyusingmachinelearning AT wannaijun simplemethodtopredictinsulinresistanceinchildrenaged612yearsbyusingmachinelearning |