Cargando…

Predicting preterm birth through vaginal microbiota, cervical length, and WBC using a machine learning model

An association between the vaginal microbiome and preterm birth has been reported. However, in practice, it is difficult to predict premature birth using the microbiome because the vaginal microbial community varies highly among samples depending on the individual, and the prediction rate is very lo...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, Sunwha, Moon, Jeongsup, Kang, Nayeon, Kim, Young-Han, You, Young-Ah, Kwon, Eunjin, Ansari, AbuZar, Hur, Young Min, Park, Taesung, Kim, Young Ju
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9378785/
https://www.ncbi.nlm.nih.gov/pubmed/35983325
http://dx.doi.org/10.3389/fmicb.2022.912853
_version_ 1784768585588015104
author Park, Sunwha
Moon, Jeongsup
Kang, Nayeon
Kim, Young-Han
You, Young-Ah
Kwon, Eunjin
Ansari, AbuZar
Hur, Young Min
Park, Taesung
Kim, Young Ju
author_facet Park, Sunwha
Moon, Jeongsup
Kang, Nayeon
Kim, Young-Han
You, Young-Ah
Kwon, Eunjin
Ansari, AbuZar
Hur, Young Min
Park, Taesung
Kim, Young Ju
author_sort Park, Sunwha
collection PubMed
description An association between the vaginal microbiome and preterm birth has been reported. However, in practice, it is difficult to predict premature birth using the microbiome because the vaginal microbial community varies highly among samples depending on the individual, and the prediction rate is very low. The purpose of this study was to select markers that improve predictive power through machine learning among various vaginal microbiota and develop a prediction algorithm with better predictive power that combines clinical information. As a multicenter case–control study with 150 Korean pregnant women with 54 preterm delivery group and 96 full-term delivery group, cervicovaginal fluid was collected from pregnant women during mid-pregnancy. Their demographic profiles (age, BMI, education level, and PTB history), white blood cell count, and cervical length were recorded, and the microbiome profiles of the cervicovaginal fluid were analyzed. The subjects were randomly divided into a training (n = 101) and a test set (n = 49) in a two-to-one ratio. When training ML models using selected markers, five-fold cross-validation was performed on the training set. A univariate analysis was performed to select markers using seven statistical tests, including the Wilcoxon rank-sum test. Using the selected markers, including Lactobacillus spp., Gardnerella vaginalis, Ureaplasma parvum, Atopobium vaginae, Prevotella timonensis, and Peptoniphilus grossensis, machine learning models (logistic regression, random forest, extreme gradient boosting, support vector machine, and GUIDE) were used to build prediction models. The test area under the curve of the logistic regression model was 0.72 when it was trained with the 17 selected markers. When analyzed by combining white blood cell count and cervical length with the seven vaginal microbiome markers, the random forest model showed the highest test area under the curve of 0.84. The GUIDE, the single tree model, provided a more reasonable biological interpretation, using the 10 selected markers (A. vaginae, G. vaginalis, Lactobacillus crispatus, Lactobacillus fornicalis, Lactobacillus gasseri, Lactobacillus iners, Lactobacillus jensenii, Peptoniphilus grossensis, P. timonensis, and U. parvum), and the covariates produced a tree with a test area under the curve of 0.77. It was confirmed that the association with preterm birth increased when P. timonensis and U. parvum increased (AUC = 0.77), which could also be explained by the fact that as the number of Peptoniphilus lacrimalis increased, the association with preterm birth was high (AUC = 0.77). Our study demonstrates that several candidate bacteria could be used as potential predictors for preterm birth, and that the predictive rate can be increased through a machine learning model employing a combination of cervical length and white blood cell count information.
format Online
Article
Text
id pubmed-9378785
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93787852022-08-17 Predicting preterm birth through vaginal microbiota, cervical length, and WBC using a machine learning model Park, Sunwha Moon, Jeongsup Kang, Nayeon Kim, Young-Han You, Young-Ah Kwon, Eunjin Ansari, AbuZar Hur, Young Min Park, Taesung Kim, Young Ju Front Microbiol Microbiology An association between the vaginal microbiome and preterm birth has been reported. However, in practice, it is difficult to predict premature birth using the microbiome because the vaginal microbial community varies highly among samples depending on the individual, and the prediction rate is very low. The purpose of this study was to select markers that improve predictive power through machine learning among various vaginal microbiota and develop a prediction algorithm with better predictive power that combines clinical information. As a multicenter case–control study with 150 Korean pregnant women with 54 preterm delivery group and 96 full-term delivery group, cervicovaginal fluid was collected from pregnant women during mid-pregnancy. Their demographic profiles (age, BMI, education level, and PTB history), white blood cell count, and cervical length were recorded, and the microbiome profiles of the cervicovaginal fluid were analyzed. The subjects were randomly divided into a training (n = 101) and a test set (n = 49) in a two-to-one ratio. When training ML models using selected markers, five-fold cross-validation was performed on the training set. A univariate analysis was performed to select markers using seven statistical tests, including the Wilcoxon rank-sum test. Using the selected markers, including Lactobacillus spp., Gardnerella vaginalis, Ureaplasma parvum, Atopobium vaginae, Prevotella timonensis, and Peptoniphilus grossensis, machine learning models (logistic regression, random forest, extreme gradient boosting, support vector machine, and GUIDE) were used to build prediction models. The test area under the curve of the logistic regression model was 0.72 when it was trained with the 17 selected markers. When analyzed by combining white blood cell count and cervical length with the seven vaginal microbiome markers, the random forest model showed the highest test area under the curve of 0.84. The GUIDE, the single tree model, provided a more reasonable biological interpretation, using the 10 selected markers (A. vaginae, G. vaginalis, Lactobacillus crispatus, Lactobacillus fornicalis, Lactobacillus gasseri, Lactobacillus iners, Lactobacillus jensenii, Peptoniphilus grossensis, P. timonensis, and U. parvum), and the covariates produced a tree with a test area under the curve of 0.77. It was confirmed that the association with preterm birth increased when P. timonensis and U. parvum increased (AUC = 0.77), which could also be explained by the fact that as the number of Peptoniphilus lacrimalis increased, the association with preterm birth was high (AUC = 0.77). Our study demonstrates that several candidate bacteria could be used as potential predictors for preterm birth, and that the predictive rate can be increased through a machine learning model employing a combination of cervical length and white blood cell count information. Frontiers Media S.A. 2022-08-02 /pmc/articles/PMC9378785/ /pubmed/35983325 http://dx.doi.org/10.3389/fmicb.2022.912853 Text en Copyright © 2022 Park, Moon, Kang, Kim, You, Kwon, Ansari, Hur, Park and Kim. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Park, Sunwha
Moon, Jeongsup
Kang, Nayeon
Kim, Young-Han
You, Young-Ah
Kwon, Eunjin
Ansari, AbuZar
Hur, Young Min
Park, Taesung
Kim, Young Ju
Predicting preterm birth through vaginal microbiota, cervical length, and WBC using a machine learning model
title Predicting preterm birth through vaginal microbiota, cervical length, and WBC using a machine learning model
title_full Predicting preterm birth through vaginal microbiota, cervical length, and WBC using a machine learning model
title_fullStr Predicting preterm birth through vaginal microbiota, cervical length, and WBC using a machine learning model
title_full_unstemmed Predicting preterm birth through vaginal microbiota, cervical length, and WBC using a machine learning model
title_short Predicting preterm birth through vaginal microbiota, cervical length, and WBC using a machine learning model
title_sort predicting preterm birth through vaginal microbiota, cervical length, and wbc using a machine learning model
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9378785/
https://www.ncbi.nlm.nih.gov/pubmed/35983325
http://dx.doi.org/10.3389/fmicb.2022.912853
work_keys_str_mv AT parksunwha predictingpretermbirththroughvaginalmicrobiotacervicallengthandwbcusingamachinelearningmodel
AT moonjeongsup predictingpretermbirththroughvaginalmicrobiotacervicallengthandwbcusingamachinelearningmodel
AT kangnayeon predictingpretermbirththroughvaginalmicrobiotacervicallengthandwbcusingamachinelearningmodel
AT kimyounghan predictingpretermbirththroughvaginalmicrobiotacervicallengthandwbcusingamachinelearningmodel
AT youyoungah predictingpretermbirththroughvaginalmicrobiotacervicallengthandwbcusingamachinelearningmodel
AT kwoneunjin predictingpretermbirththroughvaginalmicrobiotacervicallengthandwbcusingamachinelearningmodel
AT ansariabuzar predictingpretermbirththroughvaginalmicrobiotacervicallengthandwbcusingamachinelearningmodel
AT huryoungmin predictingpretermbirththroughvaginalmicrobiotacervicallengthandwbcusingamachinelearningmodel
AT parktaesung predictingpretermbirththroughvaginalmicrobiotacervicallengthandwbcusingamachinelearningmodel
AT kimyoungju predictingpretermbirththroughvaginalmicrobiotacervicallengthandwbcusingamachinelearningmodel