Cargando…
Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms
In this study, we developed machine learning-based prediction models for early childhood caries and compared their performances with the traditional regression model. We analyzed the data of 4195 children aged 1–5 years from the Korea National Health and Nutrition Examination Survey data (2007–2018)...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8393254/ https://www.ncbi.nlm.nih.gov/pubmed/34444368 http://dx.doi.org/10.3390/ijerph18168613 |
_version_ | 1783743690775199744 |
---|---|
author | Park, You-Hyun Kim, Sung-Hwa Choi, Yoon-Young |
author_facet | Park, You-Hyun Kim, Sung-Hwa Choi, Yoon-Young |
author_sort | Park, You-Hyun |
collection | PubMed |
description | In this study, we developed machine learning-based prediction models for early childhood caries and compared their performances with the traditional regression model. We analyzed the data of 4195 children aged 1–5 years from the Korea National Health and Nutrition Examination Survey data (2007–2018). Moreover, we developed prediction models using the XGBoost (version 1.3.1), random forest, and LightGBM (version 3.1.1) algorithms in addition to logistic regression. Two different methods were applied for variable selection, including a regression-based backward elimination and a random forest-based permutation importance classifier. We compared the area under the receiver operating characteristic (AUROC) values and misclassification rates of the different models and observed that all four prediction models had AUROC values ranging between 0.774 and 0.785. Furthermore, no significant difference was observed between the AUROC values of the four models. Based on the results, we can confirm that both traditional logistic regression and ML-based models can show favorable performance and can be used to predict early childhood caries, identify ECC high-risk groups, and implement active preventive treatments. However, further research is essential to improving the performance of the prediction model using recent methods, such as deep learning. |
format | Online Article Text |
id | pubmed-8393254 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83932542021-08-28 Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms Park, You-Hyun Kim, Sung-Hwa Choi, Yoon-Young Int J Environ Res Public Health Article In this study, we developed machine learning-based prediction models for early childhood caries and compared their performances with the traditional regression model. We analyzed the data of 4195 children aged 1–5 years from the Korea National Health and Nutrition Examination Survey data (2007–2018). Moreover, we developed prediction models using the XGBoost (version 1.3.1), random forest, and LightGBM (version 3.1.1) algorithms in addition to logistic regression. Two different methods were applied for variable selection, including a regression-based backward elimination and a random forest-based permutation importance classifier. We compared the area under the receiver operating characteristic (AUROC) values and misclassification rates of the different models and observed that all four prediction models had AUROC values ranging between 0.774 and 0.785. Furthermore, no significant difference was observed between the AUROC values of the four models. Based on the results, we can confirm that both traditional logistic regression and ML-based models can show favorable performance and can be used to predict early childhood caries, identify ECC high-risk groups, and implement active preventive treatments. However, further research is essential to improving the performance of the prediction model using recent methods, such as deep learning. MDPI 2021-08-15 /pmc/articles/PMC8393254/ /pubmed/34444368 http://dx.doi.org/10.3390/ijerph18168613 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Park, You-Hyun Kim, Sung-Hwa Choi, Yoon-Young Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms |
title | Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms |
title_full | Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms |
title_fullStr | Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms |
title_full_unstemmed | Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms |
title_short | Prediction Models of Early Childhood Caries Based on Machine Learning Algorithms |
title_sort | prediction models of early childhood caries based on machine learning algorithms |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8393254/ https://www.ncbi.nlm.nih.gov/pubmed/34444368 http://dx.doi.org/10.3390/ijerph18168613 |
work_keys_str_mv | AT parkyouhyun predictionmodelsofearlychildhoodcariesbasedonmachinelearningalgorithms AT kimsunghwa predictionmodelsofearlychildhoodcariesbasedonmachinelearningalgorithms AT choiyoonyoung predictionmodelsofearlychildhoodcariesbasedonmachinelearningalgorithms |