Cargando…
Predicting youth diabetes risk using NHANES data and machine learning
Prediabetes and diabetes mellitus (preDM/DM) have become alarmingly prevalent among youth in recent years. However, simple questionnaire-based screening tools to reliably assess diabetes risk are only available for adults, not youth. As a first step in developing such a tool, we used a large-scale d...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8160335/ https://www.ncbi.nlm.nih.gov/pubmed/34045491 http://dx.doi.org/10.1038/s41598-021-90406-0 |
_version_ | 1783700262702022656 |
---|---|
author | Vangeepuram, Nita Liu, Bian Chiu, Po-hsiang Wang, Linhua Pandey, Gaurav |
author_facet | Vangeepuram, Nita Liu, Bian Chiu, Po-hsiang Wang, Linhua Pandey, Gaurav |
author_sort | Vangeepuram, Nita |
collection | PubMed |
description | Prediabetes and diabetes mellitus (preDM/DM) have become alarmingly prevalent among youth in recent years. However, simple questionnaire-based screening tools to reliably assess diabetes risk are only available for adults, not youth. As a first step in developing such a tool, we used a large-scale dataset from the National Health and Nutritional Examination Survey (NHANES) to examine the performance of a published pediatric clinical screening guideline in identifying youth with preDM/DM based on American Diabetes Association diagnostic biomarkers. We assessed the agreement between the clinical guideline and biomarker criteria using established evaluation measures (sensitivity, specificity, positive/negative predictive value, F-measure for the positive/negative preDM/DM classes, and Kappa). We also compared the performance of the guideline to those of machine learning (ML) based preDM/DM classifiers derived from the NHANES dataset. Approximately 29% of the 2858 youth in our study population had preDM/DM based on biomarker criteria. The clinical guideline had a sensitivity of 43.1% and specificity of 67.6%, positive/negative predictive values of 35.2%/74.5%, positive/negative F-measures of 38.8%/70.9%, and Kappa of 0.1 (95%CI: 0.06–0.14). The performance of the guideline varied across demographic subgroups. Some ML-based classifiers performed comparably to or better than the screening guideline, especially in identifying preDM/DM youth (p = 5.23 × 10(−5)).We demonstrated that a recommended pediatric clinical screening guideline did not perform well in identifying preDM/DM status among youth. Additional work is needed to develop a simple yet accurate screener for youth diabetes risk, potentially by using advanced ML methods and a wider range of clinical and behavioral health data. |
format | Online Article Text |
id | pubmed-8160335 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-81603352021-06-01 Predicting youth diabetes risk using NHANES data and machine learning Vangeepuram, Nita Liu, Bian Chiu, Po-hsiang Wang, Linhua Pandey, Gaurav Sci Rep Article Prediabetes and diabetes mellitus (preDM/DM) have become alarmingly prevalent among youth in recent years. However, simple questionnaire-based screening tools to reliably assess diabetes risk are only available for adults, not youth. As a first step in developing such a tool, we used a large-scale dataset from the National Health and Nutritional Examination Survey (NHANES) to examine the performance of a published pediatric clinical screening guideline in identifying youth with preDM/DM based on American Diabetes Association diagnostic biomarkers. We assessed the agreement between the clinical guideline and biomarker criteria using established evaluation measures (sensitivity, specificity, positive/negative predictive value, F-measure for the positive/negative preDM/DM classes, and Kappa). We also compared the performance of the guideline to those of machine learning (ML) based preDM/DM classifiers derived from the NHANES dataset. Approximately 29% of the 2858 youth in our study population had preDM/DM based on biomarker criteria. The clinical guideline had a sensitivity of 43.1% and specificity of 67.6%, positive/negative predictive values of 35.2%/74.5%, positive/negative F-measures of 38.8%/70.9%, and Kappa of 0.1 (95%CI: 0.06–0.14). The performance of the guideline varied across demographic subgroups. Some ML-based classifiers performed comparably to or better than the screening guideline, especially in identifying preDM/DM youth (p = 5.23 × 10(−5)).We demonstrated that a recommended pediatric clinical screening guideline did not perform well in identifying preDM/DM status among youth. Additional work is needed to develop a simple yet accurate screener for youth diabetes risk, potentially by using advanced ML methods and a wider range of clinical and behavioral health data. Nature Publishing Group UK 2021-05-27 /pmc/articles/PMC8160335/ /pubmed/34045491 http://dx.doi.org/10.1038/s41598-021-90406-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Vangeepuram, Nita Liu, Bian Chiu, Po-hsiang Wang, Linhua Pandey, Gaurav Predicting youth diabetes risk using NHANES data and machine learning |
title | Predicting youth diabetes risk using NHANES data and machine learning |
title_full | Predicting youth diabetes risk using NHANES data and machine learning |
title_fullStr | Predicting youth diabetes risk using NHANES data and machine learning |
title_full_unstemmed | Predicting youth diabetes risk using NHANES data and machine learning |
title_short | Predicting youth diabetes risk using NHANES data and machine learning |
title_sort | predicting youth diabetes risk using nhanes data and machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8160335/ https://www.ncbi.nlm.nih.gov/pubmed/34045491 http://dx.doi.org/10.1038/s41598-021-90406-0 |
work_keys_str_mv | AT vangeepuramnita predictingyouthdiabetesriskusingnhanesdataandmachinelearning AT liubian predictingyouthdiabetesriskusingnhanesdataandmachinelearning AT chiupohsiang predictingyouthdiabetesriskusingnhanesdataandmachinelearning AT wanglinhua predictingyouthdiabetesriskusingnhanesdataandmachinelearning AT pandeygaurav predictingyouthdiabetesriskusingnhanesdataandmachinelearning |