Cargando…

Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms

In this study, we developed machine learning-based prediction models for early childhood caries and compared their performances with the traditional regression model. We analyzed the data of 4195 children aged 1–5 years from the Korea National Health and Nutrition Examination Survey data (2007–2018)...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, You-Hyun, Kim, Sung-Hwa, Choi, Yoon-Young
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8393254/
https://www.ncbi.nlm.nih.gov/pubmed/34444368
http://dx.doi.org/10.3390/ijerph18168613
_version_ 1783743690775199744
author Park, You-Hyun
Kim, Sung-Hwa
Choi, Yoon-Young
author_facet Park, You-Hyun
Kim, Sung-Hwa
Choi, Yoon-Young
author_sort Park, You-Hyun
collection PubMed
description In this study, we developed machine learning-based prediction models for early childhood caries and compared their performances with the traditional regression model. We analyzed the data of 4195 children aged 1–5 years from the Korea National Health and Nutrition Examination Survey data (2007–2018). Moreover, we developed prediction models using the XGBoost (version 1.3.1), random forest, and LightGBM (version 3.1.1) algorithms in addition to logistic regression. Two different methods were applied for variable selection, including a regression-based backward elimination and a random forest-based permutation importance classifier. We compared the area under the receiver operating characteristic (AUROC) values and misclassification rates of the different models and observed that all four prediction models had AUROC values ranging between 0.774 and 0.785. Furthermore, no significant difference was observed between the AUROC values of the four models. Based on the results, we can confirm that both traditional logistic regression and ML-based models can show favorable performance and can be used to predict early childhood caries, identify ECC high-risk groups, and implement active preventive treatments. However, further research is essential to improving the performance of the prediction model using recent methods, such as deep learning.
format Online
Article
Text
id pubmed-8393254
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83932542021-08-28 Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms Park, You-Hyun Kim, Sung-Hwa Choi, Yoon-Young Int J Environ Res Public Health Article In this study, we developed machine learning-based prediction models for early childhood caries and compared their performances with the traditional regression model. We analyzed the data of 4195 children aged 1–5 years from the Korea National Health and Nutrition Examination Survey data (2007–2018). Moreover, we developed prediction models using the XGBoost (version 1.3.1), random forest, and LightGBM (version 3.1.1) algorithms in addition to logistic regression. Two different methods were applied for variable selection, including a regression-based backward elimination and a random forest-based permutation importance classifier. We compared the area under the receiver operating characteristic (AUROC) values and misclassification rates of the different models and observed that all four prediction models had AUROC values ranging between 0.774 and 0.785. Furthermore, no significant difference was observed between the AUROC values of the four models. Based on the results, we can confirm that both traditional logistic regression and ML-based models can show favorable performance and can be used to predict early childhood caries, identify ECC high-risk groups, and implement active preventive treatments. However, further research is essential to improving the performance of the prediction model using recent methods, such as deep learning. MDPI 2021-08-15 /pmc/articles/PMC8393254/ /pubmed/34444368 http://dx.doi.org/10.3390/ijerph18168613 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Park, You-Hyun
Kim, Sung-Hwa
Choi, Yoon-Young
Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms
title Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms
title_full Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms
title_fullStr Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms
title_full_unstemmed Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms
title_short Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms
title_sort prediction models of early childhood caries based on machine learning algorithms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8393254/
https://www.ncbi.nlm.nih.gov/pubmed/34444368
http://dx.doi.org/10.3390/ijerph18168613
work_keys_str_mv AT parkyouhyun predictionmodelsofearlychildhoodcariesbasedonmachinelearningalgorithms
AT kimsunghwa predictionmodelsofearlychildhoodcariesbasedonmachinelearningalgorithms
AT choiyoonyoung predictionmodelsofearlychildhoodcariesbasedonmachinelearningalgorithms