Cargando…

A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors

Dental caries is a multifactorial disease that can be caused by interactions between genetic and environmental risk factors. Despite the availability of caries risk assessment tools, caries risk prediction models incorporating new factors, such as human genetic markers, have not yet been reported. T...

Descripción completa

Detalles Bibliográficos
Autores principales: Pang, Liangyue, Wang, Ketian, Tao, Ye, Zhi, Qinghui, Zhang, Jianming, Lin, Huancai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7990890/
https://www.ncbi.nlm.nih.gov/pubmed/33777105
http://dx.doi.org/10.3389/fgene.2021.636867
_version_ 1783669143807983616
author Pang, Liangyue
Wang, Ketian
Tao, Ye
Zhi, Qinghui
Zhang, Jianming
Lin, Huancai
author_facet Pang, Liangyue
Wang, Ketian
Tao, Ye
Zhi, Qinghui
Zhang, Jianming
Lin, Huancai
author_sort Pang, Liangyue
collection PubMed
description Dental caries is a multifactorial disease that can be caused by interactions between genetic and environmental risk factors. Despite the availability of caries risk assessment tools, caries risk prediction models incorporating new factors, such as human genetic markers, have not yet been reported. The aim of this study was to construct a new model for caries risk prediction in teenagers, based on environmental and genetic factors, using a machine learning algorithm. We performed a prospective longitudinal study of 1,055 teenagers (710 teenagers for cohort 1 and 345 teenagers for cohort 2) aged 13 years, of whom 953 (633 teenagers for cohort 1 and 320 teenagers for cohort 2) were followed for 21 months. All participants completed an oral health questionnaire, an oral examination, biological (salivary and cariostate) tests, and single nucleotide polymorphism sequencing analysis. We constructed a caries risk prediction model based on these data using a random forest with an AUC of 0.78 in cohort 1 (training cohort). We further verified the discrimination and calibration abilities of this caries risk prediction model using cohort 2. The AUC of the caries risk prediction model in cohort 2 (testing cohort) was 0.73, indicating high discrimination ability. Risk stratification revealed that our caries risk prediction model could accurately identify individuals at high and very high caries risk but underestimated risks for individuals at low and very low caries risk. Thus, our caries risk prediction model has the potential for use as a powerful community-level tool to identify individuals at high caries risk.
format Online
Article
Text
id pubmed-7990890
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79908902021-03-26 A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors Pang, Liangyue Wang, Ketian Tao, Ye Zhi, Qinghui Zhang, Jianming Lin, Huancai Front Genet Genetics Dental caries is a multifactorial disease that can be caused by interactions between genetic and environmental risk factors. Despite the availability of caries risk assessment tools, caries risk prediction models incorporating new factors, such as human genetic markers, have not yet been reported. The aim of this study was to construct a new model for caries risk prediction in teenagers, based on environmental and genetic factors, using a machine learning algorithm. We performed a prospective longitudinal study of 1,055 teenagers (710 teenagers for cohort 1 and 345 teenagers for cohort 2) aged 13 years, of whom 953 (633 teenagers for cohort 1 and 320 teenagers for cohort 2) were followed for 21 months. All participants completed an oral health questionnaire, an oral examination, biological (salivary and cariostate) tests, and single nucleotide polymorphism sequencing analysis. We constructed a caries risk prediction model based on these data using a random forest with an AUC of 0.78 in cohort 1 (training cohort). We further verified the discrimination and calibration abilities of this caries risk prediction model using cohort 2. The AUC of the caries risk prediction model in cohort 2 (testing cohort) was 0.73, indicating high discrimination ability. Risk stratification revealed that our caries risk prediction model could accurately identify individuals at high and very high caries risk but underestimated risks for individuals at low and very low caries risk. Thus, our caries risk prediction model has the potential for use as a powerful community-level tool to identify individuals at high caries risk. Frontiers Media S.A. 2021-03-11 /pmc/articles/PMC7990890/ /pubmed/33777105 http://dx.doi.org/10.3389/fgene.2021.636867 Text en Copyright © 2021 Pang, Wang, Tao, Zhi, Zhang and Lin. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Pang, Liangyue
Wang, Ketian
Tao, Ye
Zhi, Qinghui
Zhang, Jianming
Lin, Huancai
A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
title A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
title_full A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
title_fullStr A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
title_full_unstemmed A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
title_short A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
title_sort new model for caries risk prediction in teenagers using a machine learning algorithm based on environmental and genetic factors
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7990890/
https://www.ncbi.nlm.nih.gov/pubmed/33777105
http://dx.doi.org/10.3389/fgene.2021.636867
work_keys_str_mv AT pangliangyue anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT wangketian anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT taoye anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT zhiqinghui anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT zhangjianming anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT linhuancai anewmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT pangliangyue newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT wangketian newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT taoye newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT zhiqinghui newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT zhangjianming newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors
AT linhuancai newmodelforcariesriskpredictioninteenagersusingamachinelearningalgorithmbasedonenvironmentalandgeneticfactors