Cargando…

Predicting youth diabetes risk using NHANES data and machine learning

Prediabetes and diabetes mellitus (preDM/DM) have become alarmingly prevalent among youth in recent years. However, simple questionnaire-based screening tools to reliably assess diabetes risk are only available for adults, not youth. As a first step in developing such a tool, we used a large-scale d...

Descripción completa

Detalles Bibliográficos
Autores principales: Vangeepuram, Nita, Liu, Bian, Chiu, Po-hsiang, Wang, Linhua, Pandey, Gaurav
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8160335/
https://www.ncbi.nlm.nih.gov/pubmed/34045491
http://dx.doi.org/10.1038/s41598-021-90406-0
_version_ 1783700262702022656
author Vangeepuram, Nita
Liu, Bian
Chiu, Po-hsiang
Wang, Linhua
Pandey, Gaurav
author_facet Vangeepuram, Nita
Liu, Bian
Chiu, Po-hsiang
Wang, Linhua
Pandey, Gaurav
author_sort Vangeepuram, Nita
collection PubMed
description Prediabetes and diabetes mellitus (preDM/DM) have become alarmingly prevalent among youth in recent years. However, simple questionnaire-based screening tools to reliably assess diabetes risk are only available for adults, not youth. As a first step in developing such a tool, we used a large-scale dataset from the National Health and Nutritional Examination Survey (NHANES) to examine the performance of a published pediatric clinical screening guideline in identifying youth with preDM/DM based on American Diabetes Association diagnostic biomarkers. We assessed the agreement between the clinical guideline and biomarker criteria using established evaluation measures (sensitivity, specificity, positive/negative predictive value, F-measure for the positive/negative preDM/DM classes, and Kappa). We also compared the performance of the guideline to those of machine learning (ML) based preDM/DM classifiers derived from the NHANES dataset. Approximately 29% of the 2858 youth in our study population had preDM/DM based on biomarker criteria. The clinical guideline had a sensitivity of 43.1% and specificity of 67.6%, positive/negative predictive values of 35.2%/74.5%, positive/negative F-measures of 38.8%/70.9%, and Kappa of 0.1 (95%CI: 0.06–0.14). The performance of the guideline varied across demographic subgroups. Some ML-based classifiers performed comparably to or better than the screening guideline, especially in identifying preDM/DM youth (p = 5.23 × 10(−5)).We demonstrated that a recommended pediatric clinical screening guideline did not perform well in identifying preDM/DM status among youth. Additional work is needed to develop a simple yet accurate screener for youth diabetes risk, potentially by using advanced ML methods and a wider range of clinical and behavioral health data.
format Online
Article
Text
id pubmed-8160335
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81603352021-06-01 Predicting youth diabetes risk using NHANES data and machine learning Vangeepuram, Nita Liu, Bian Chiu, Po-hsiang Wang, Linhua Pandey, Gaurav Sci Rep Article Prediabetes and diabetes mellitus (preDM/DM) have become alarmingly prevalent among youth in recent years. However, simple questionnaire-based screening tools to reliably assess diabetes risk are only available for adults, not youth. As a first step in developing such a tool, we used a large-scale dataset from the National Health and Nutritional Examination Survey (NHANES) to examine the performance of a published pediatric clinical screening guideline in identifying youth with preDM/DM based on American Diabetes Association diagnostic biomarkers. We assessed the agreement between the clinical guideline and biomarker criteria using established evaluation measures (sensitivity, specificity, positive/negative predictive value, F-measure for the positive/negative preDM/DM classes, and Kappa). We also compared the performance of the guideline to those of machine learning (ML) based preDM/DM classifiers derived from the NHANES dataset. Approximately 29% of the 2858 youth in our study population had preDM/DM based on biomarker criteria. The clinical guideline had a sensitivity of 43.1% and specificity of 67.6%, positive/negative predictive values of 35.2%/74.5%, positive/negative F-measures of 38.8%/70.9%, and Kappa of 0.1 (95%CI: 0.06–0.14). The performance of the guideline varied across demographic subgroups. Some ML-based classifiers performed comparably to or better than the screening guideline, especially in identifying preDM/DM youth (p = 5.23 × 10(−5)).We demonstrated that a recommended pediatric clinical screening guideline did not perform well in identifying preDM/DM status among youth. Additional work is needed to develop a simple yet accurate screener for youth diabetes risk, potentially by using advanced ML methods and a wider range of clinical and behavioral health data. Nature Publishing Group UK 2021-05-27 /pmc/articles/PMC8160335/ /pubmed/34045491 http://dx.doi.org/10.1038/s41598-021-90406-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Vangeepuram, Nita
Liu, Bian
Chiu, Po-hsiang
Wang, Linhua
Pandey, Gaurav
Predicting youth diabetes risk using NHANES data and machine learning
title Predicting youth diabetes risk using NHANES data and machine learning
title_full Predicting youth diabetes risk using NHANES data and machine learning
title_fullStr Predicting youth diabetes risk using NHANES data and machine learning
title_full_unstemmed Predicting youth diabetes risk using NHANES data and machine learning
title_short Predicting youth diabetes risk using NHANES data and machine learning
title_sort predicting youth diabetes risk using nhanes data and machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8160335/
https://www.ncbi.nlm.nih.gov/pubmed/34045491
http://dx.doi.org/10.1038/s41598-021-90406-0
work_keys_str_mv AT vangeepuramnita predictingyouthdiabetesriskusingnhanesdataandmachinelearning
AT liubian predictingyouthdiabetesriskusingnhanesdataandmachinelearning
AT chiupohsiang predictingyouthdiabetesriskusingnhanesdataandmachinelearning
AT wanglinhua predictingyouthdiabetesriskusingnhanesdataandmachinelearning
AT pandeygaurav predictingyouthdiabetesriskusingnhanesdataandmachinelearning