Cargando…

Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis

BACKGROUND: Bacterial vaginosis (BV) is a disease associated with the vagina microbiome. It is highly prevalent and is characterized by symptoms including odor, discharge and irritation. No single microbe has been found to cause BV. In this paper we use random forests and logistic regression classif...

Descripción completa

Detalles Bibliográficos
Autores principales: Beck, Daniel, Foster, James A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542107/
https://www.ncbi.nlm.nih.gov/pubmed/26294933
http://dx.doi.org/10.1186/s13040-015-0055-3
_version_ 1782386487027826688
author Beck, Daniel
Foster, James A.
author_facet Beck, Daniel
Foster, James A.
author_sort Beck, Daniel
collection PubMed
description BACKGROUND: Bacterial vaginosis (BV) is a disease associated with the vagina microbiome. It is highly prevalent and is characterized by symptoms including odor, discharge and irritation. No single microbe has been found to cause BV. In this paper we use random forests and logistic regression classifiers to model the relationship between the microbial community and BV. We use subsets of the microbial community features in order to determine which features are important to the classification models. RESULTS: We find that models generated using logistic regression and random forests perform nearly identically and identify largely similar important features. Only a few features are necessary to obtain high BV classification accuracy. Additionally, there appears to be substantial redundancy between the microbial community features. CONCLUSIONS: These results are in contrast to a previous study in which the important features identified by the classifiers were dissimilar. This difference appears to be the result of using different feature importance measures. It is not clear whether machine learning classifiers are capturing patterns different from simple correlations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-015-0055-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4542107
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45421072015-08-21 Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis Beck, Daniel Foster, James A. BioData Min Research BACKGROUND: Bacterial vaginosis (BV) is a disease associated with the vagina microbiome. It is highly prevalent and is characterized by symptoms including odor, discharge and irritation. No single microbe has been found to cause BV. In this paper we use random forests and logistic regression classifiers to model the relationship between the microbial community and BV. We use subsets of the microbial community features in order to determine which features are important to the classification models. RESULTS: We find that models generated using logistic regression and random forests perform nearly identically and identify largely similar important features. Only a few features are necessary to obtain high BV classification accuracy. Additionally, there appears to be substantial redundancy between the microbial community features. CONCLUSIONS: These results are in contrast to a previous study in which the important features identified by the classifiers were dissimilar. This difference appears to be the result of using different feature importance measures. It is not clear whether machine learning classifiers are capturing patterns different from simple correlations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-015-0055-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-12 /pmc/articles/PMC4542107/ /pubmed/26294933 http://dx.doi.org/10.1186/s13040-015-0055-3 Text en © Beck and Foster; licensee BioMed Central. 2015 This article is published under license to BioMed Central Ltd. Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/ (http://creativecommons.org/licenses/by/4.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Beck, Daniel
Foster, James A.
Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis
title Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis
title_full Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis
title_fullStr Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis
title_full_unstemmed Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis
title_short Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis
title_sort machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542107/
https://www.ncbi.nlm.nih.gov/pubmed/26294933
http://dx.doi.org/10.1186/s13040-015-0055-3
work_keys_str_mv AT beckdaniel machinelearningclassifiersprovideinsightintotherelationshipbetweenmicrobialcommunitiesandbacterialvaginosis
AT fosterjamesa machinelearningclassifiersprovideinsightintotherelationshipbetweenmicrobialcommunitiesandbacterialvaginosis