Cargando…

Systematic comparison of machine learning algorithms to develop and validate predictive models for periodontitis

AIM: The aim of this study was to compare the validity of different machine learning algorithms to develop and validate predictive models for periodontitis. MATERIALS AND METHODS: Using national survey data from Taiwan (n = 3453) and the United States (n = 3685), predictors of periodontitis were ext...

Descripción completa

Detalles Bibliográficos
Autores principales: Bashir, Nasir Z., Rahman, Zahid, Chen, Sam Li‐Sheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Blackwell Publishing Ltd 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9796669/
https://www.ncbi.nlm.nih.gov/pubmed/35781722
http://dx.doi.org/10.1111/jcpe.13692
_version_ 1784860539185266688
author Bashir, Nasir Z.
Rahman, Zahid
Chen, Sam Li‐Sheng
author_facet Bashir, Nasir Z.
Rahman, Zahid
Chen, Sam Li‐Sheng
author_sort Bashir, Nasir Z.
collection PubMed
description AIM: The aim of this study was to compare the validity of different machine learning algorithms to develop and validate predictive models for periodontitis. MATERIALS AND METHODS: Using national survey data from Taiwan (n = 3453) and the United States (n = 3685), predictors of periodontitis were extracted from the datasets and pre‐processed, and then 10 machine learning algorithms were trained to develop predictive models. The models were validated both internally (bootstrap sampling) and externally (alternative country's dataset). The algorithms were compared across six performance metrics ([i] area under the curve for the receiver operating characteristic [AUC], [ii] accuracy, [iii] sensitivity, [iv] specificity, [v] positive predictive value, and [vi] negative predictive value) and two methods of data pre‐processing ([i] machine‐learning‐based feature selection and [ii] dimensionality reduction into principal components). RESULTS: Many algorithms showed extremely strong performance during internal validation (AUC > 0.95, accuracy > 95%). However, this was not replicated in external validation, where predictive performance of all algorithms dropped off drastically. Furthermore, predictive performance differed according to data pre‐processing methodology and the cohort on which they were trained. CONCLUSIONS: Larger sample sizes and more complex predictors of periodontitis are required before machine learning can be leveraged to its full potential.
format Online
Article
Text
id pubmed-9796669
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Blackwell Publishing Ltd
record_format MEDLINE/PubMed
spelling pubmed-97966692023-01-04 Systematic comparison of machine learning algorithms to develop and validate predictive models for periodontitis Bashir, Nasir Z. Rahman, Zahid Chen, Sam Li‐Sheng J Clin Periodontol Diagnosis, Epidemiology and Associated Co‐morbidities AIM: The aim of this study was to compare the validity of different machine learning algorithms to develop and validate predictive models for periodontitis. MATERIALS AND METHODS: Using national survey data from Taiwan (n = 3453) and the United States (n = 3685), predictors of periodontitis were extracted from the datasets and pre‐processed, and then 10 machine learning algorithms were trained to develop predictive models. The models were validated both internally (bootstrap sampling) and externally (alternative country's dataset). The algorithms were compared across six performance metrics ([i] area under the curve for the receiver operating characteristic [AUC], [ii] accuracy, [iii] sensitivity, [iv] specificity, [v] positive predictive value, and [vi] negative predictive value) and two methods of data pre‐processing ([i] machine‐learning‐based feature selection and [ii] dimensionality reduction into principal components). RESULTS: Many algorithms showed extremely strong performance during internal validation (AUC > 0.95, accuracy > 95%). However, this was not replicated in external validation, where predictive performance of all algorithms dropped off drastically. Furthermore, predictive performance differed according to data pre‐processing methodology and the cohort on which they were trained. CONCLUSIONS: Larger sample sizes and more complex predictors of periodontitis are required before machine learning can be leveraged to its full potential. Blackwell Publishing Ltd 2022-07-28 2022-10 /pmc/articles/PMC9796669/ /pubmed/35781722 http://dx.doi.org/10.1111/jcpe.13692 Text en © 2022 The Authors. Journal of Clinical Periodontology published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Diagnosis, Epidemiology and Associated Co‐morbidities
Bashir, Nasir Z.
Rahman, Zahid
Chen, Sam Li‐Sheng
Systematic comparison of machine learning algorithms to develop and validate predictive models for periodontitis
title Systematic comparison of machine learning algorithms to develop and validate predictive models for periodontitis
title_full Systematic comparison of machine learning algorithms to develop and validate predictive models for periodontitis
title_fullStr Systematic comparison of machine learning algorithms to develop and validate predictive models for periodontitis
title_full_unstemmed Systematic comparison of machine learning algorithms to develop and validate predictive models for periodontitis
title_short Systematic comparison of machine learning algorithms to develop and validate predictive models for periodontitis
title_sort systematic comparison of machine learning algorithms to develop and validate predictive models for periodontitis
topic Diagnosis, Epidemiology and Associated Co‐morbidities
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9796669/
https://www.ncbi.nlm.nih.gov/pubmed/35781722
http://dx.doi.org/10.1111/jcpe.13692
work_keys_str_mv AT bashirnasirz systematiccomparisonofmachinelearningalgorithmstodevelopandvalidatepredictivemodelsforperiodontitis
AT rahmanzahid systematiccomparisonofmachinelearningalgorithmstodevelopandvalidatepredictivemodelsforperiodontitis
AT chensamlisheng systematiccomparisonofmachinelearningalgorithmstodevelopandvalidatepredictivemodelsforperiodontitis