Cargando…

Developing and testing a prediction model for periodontal disease using machine learning and big electronic dental record data

Despite advances in periodontal disease (PD) research and periodontal treatments, 42% of the US population suffer from periodontitis. PD can be prevented if high-risk patients are identified early to provide preventive care. Prediction models can help assess risk for PD before initiation and progres...

Descripción completa

Detalles Bibliográficos
Autores principales: Patel, Jay S., Su, Chang, Tellez, Marisol, Albandar, Jasim M., Rao, Rishi, Iyer, Vishnu, Shi, Evan, Wu, Huanmei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9608121/
https://www.ncbi.nlm.nih.gov/pubmed/36311550
http://dx.doi.org/10.3389/frai.2022.979525
_version_ 1784818701678149632
author Patel, Jay S.
Su, Chang
Tellez, Marisol
Albandar, Jasim M.
Rao, Rishi
Iyer, Vishnu
Shi, Evan
Wu, Huanmei
author_facet Patel, Jay S.
Su, Chang
Tellez, Marisol
Albandar, Jasim M.
Rao, Rishi
Iyer, Vishnu
Shi, Evan
Wu, Huanmei
author_sort Patel, Jay S.
collection PubMed
description Despite advances in periodontal disease (PD) research and periodontal treatments, 42% of the US population suffer from periodontitis. PD can be prevented if high-risk patients are identified early to provide preventive care. Prediction models can help assess risk for PD before initiation and progression; nevertheless, utilization of existing PD prediction models is seldom because of their suboptimal performance. This study aims to develop and test the PD prediction model using machine learning (ML) and electronic dental record (EDR) data that could provide large sample sizes and up-to-date information. A cohort of 27,138 dental patients and grouped PD diagnoses into: healthy control, mild PD, and severe PD was generated. The ML model (XGBoost) was trained (80% training data) and tested (20% testing data) with a total of 74 features extracted from the EDR. We used a five-fold cross-validation strategy to identify the optimal hyperparameters of the model for this one-vs.-all multi-class classification task. Our prediction model differentiated healthy patients vs. mild PD cases and mild PD vs. severe PD cases with an average area under the curve of 0.72. New associations and features compared to existing models were identified that include patient-level factors such as patient anxiety, chewing problems, speaking trouble, teeth grinding, alcohol consumption, injury to teeth, presence of removable partial dentures, self-image, recreational drugs (Heroin and Marijuana), medications affecting periodontium, and medical conditions such as osteoporosis, cancer, neurological conditions, infectious diseases, endocrine conditions, cardiovascular diseases, and gastroenterology conditions. This pilot study demonstrated promising results in predicting the risk of PD using ML and EDR data. The model may provide new information to the clinicians about the PD risks and the factors responsible for the disease progression to take preventive approaches. Further studies are warned to evaluate the prediction model's performance on the external dataset and determine its usability in clinical settings.
format Online
Article
Text
id pubmed-9608121
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96081212022-10-28 Developing and testing a prediction model for periodontal disease using machine learning and big electronic dental record data Patel, Jay S. Su, Chang Tellez, Marisol Albandar, Jasim M. Rao, Rishi Iyer, Vishnu Shi, Evan Wu, Huanmei Front Artif Intell Artificial Intelligence Despite advances in periodontal disease (PD) research and periodontal treatments, 42% of the US population suffer from periodontitis. PD can be prevented if high-risk patients are identified early to provide preventive care. Prediction models can help assess risk for PD before initiation and progression; nevertheless, utilization of existing PD prediction models is seldom because of their suboptimal performance. This study aims to develop and test the PD prediction model using machine learning (ML) and electronic dental record (EDR) data that could provide large sample sizes and up-to-date information. A cohort of 27,138 dental patients and grouped PD diagnoses into: healthy control, mild PD, and severe PD was generated. The ML model (XGBoost) was trained (80% training data) and tested (20% testing data) with a total of 74 features extracted from the EDR. We used a five-fold cross-validation strategy to identify the optimal hyperparameters of the model for this one-vs.-all multi-class classification task. Our prediction model differentiated healthy patients vs. mild PD cases and mild PD vs. severe PD cases with an average area under the curve of 0.72. New associations and features compared to existing models were identified that include patient-level factors such as patient anxiety, chewing problems, speaking trouble, teeth grinding, alcohol consumption, injury to teeth, presence of removable partial dentures, self-image, recreational drugs (Heroin and Marijuana), medications affecting periodontium, and medical conditions such as osteoporosis, cancer, neurological conditions, infectious diseases, endocrine conditions, cardiovascular diseases, and gastroenterology conditions. This pilot study demonstrated promising results in predicting the risk of PD using ML and EDR data. The model may provide new information to the clinicians about the PD risks and the factors responsible for the disease progression to take preventive approaches. Further studies are warned to evaluate the prediction model's performance on the external dataset and determine its usability in clinical settings. Frontiers Media S.A. 2022-10-13 /pmc/articles/PMC9608121/ /pubmed/36311550 http://dx.doi.org/10.3389/frai.2022.979525 Text en Copyright © 2022 Patel, Su, Tellez, Albandar, Rao, Iyer, Shi and Wu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Patel, Jay S.
Su, Chang
Tellez, Marisol
Albandar, Jasim M.
Rao, Rishi
Iyer, Vishnu
Shi, Evan
Wu, Huanmei
Developing and testing a prediction model for periodontal disease using machine learning and big electronic dental record data
title Developing and testing a prediction model for periodontal disease using machine learning and big electronic dental record data
title_full Developing and testing a prediction model for periodontal disease using machine learning and big electronic dental record data
title_fullStr Developing and testing a prediction model for periodontal disease using machine learning and big electronic dental record data
title_full_unstemmed Developing and testing a prediction model for periodontal disease using machine learning and big electronic dental record data
title_short Developing and testing a prediction model for periodontal disease using machine learning and big electronic dental record data
title_sort developing and testing a prediction model for periodontal disease using machine learning and big electronic dental record data
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9608121/
https://www.ncbi.nlm.nih.gov/pubmed/36311550
http://dx.doi.org/10.3389/frai.2022.979525
work_keys_str_mv AT pateljays developingandtestingapredictionmodelforperiodontaldiseaseusingmachinelearningandbigelectronicdentalrecorddata
AT suchang developingandtestingapredictionmodelforperiodontaldiseaseusingmachinelearningandbigelectronicdentalrecorddata
AT tellezmarisol developingandtestingapredictionmodelforperiodontaldiseaseusingmachinelearningandbigelectronicdentalrecorddata
AT albandarjasimm developingandtestingapredictionmodelforperiodontaldiseaseusingmachinelearningandbigelectronicdentalrecorddata
AT raorishi developingandtestingapredictionmodelforperiodontaldiseaseusingmachinelearningandbigelectronicdentalrecorddata
AT iyervishnu developingandtestingapredictionmodelforperiodontaldiseaseusingmachinelearningandbigelectronicdentalrecorddata
AT shievan developingandtestingapredictionmodelforperiodontaldiseaseusingmachinelearningandbigelectronicdentalrecorddata
AT wuhuanmei developingandtestingapredictionmodelforperiodontaldiseaseusingmachinelearningandbigelectronicdentalrecorddata