Cargando…
Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis
BACKGROUND: Bacterial vaginosis (BV) is a disease associated with the vagina microbiome. It is highly prevalent and is characterized by symptoms including odor, discharge and irritation. No single microbe has been found to cause BV. In this paper we use random forests and logistic regression classif...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542107/ https://www.ncbi.nlm.nih.gov/pubmed/26294933 http://dx.doi.org/10.1186/s13040-015-0055-3 |
_version_ | 1782386487027826688 |
---|---|
author | Beck, Daniel Foster, James A. |
author_facet | Beck, Daniel Foster, James A. |
author_sort | Beck, Daniel |
collection | PubMed |
description | BACKGROUND: Bacterial vaginosis (BV) is a disease associated with the vagina microbiome. It is highly prevalent and is characterized by symptoms including odor, discharge and irritation. No single microbe has been found to cause BV. In this paper we use random forests and logistic regression classifiers to model the relationship between the microbial community and BV. We use subsets of the microbial community features in order to determine which features are important to the classification models. RESULTS: We find that models generated using logistic regression and random forests perform nearly identically and identify largely similar important features. Only a few features are necessary to obtain high BV classification accuracy. Additionally, there appears to be substantial redundancy between the microbial community features. CONCLUSIONS: These results are in contrast to a previous study in which the important features identified by the classifiers were dissimilar. This difference appears to be the result of using different feature importance measures. It is not clear whether machine learning classifiers are capturing patterns different from simple correlations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-015-0055-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4542107 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45421072015-08-21 Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis Beck, Daniel Foster, James A. BioData Min Research BACKGROUND: Bacterial vaginosis (BV) is a disease associated with the vagina microbiome. It is highly prevalent and is characterized by symptoms including odor, discharge and irritation. No single microbe has been found to cause BV. In this paper we use random forests and logistic regression classifiers to model the relationship between the microbial community and BV. We use subsets of the microbial community features in order to determine which features are important to the classification models. RESULTS: We find that models generated using logistic regression and random forests perform nearly identically and identify largely similar important features. Only a few features are necessary to obtain high BV classification accuracy. Additionally, there appears to be substantial redundancy between the microbial community features. CONCLUSIONS: These results are in contrast to a previous study in which the important features identified by the classifiers were dissimilar. This difference appears to be the result of using different feature importance measures. It is not clear whether machine learning classifiers are capturing patterns different from simple correlations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-015-0055-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-12 /pmc/articles/PMC4542107/ /pubmed/26294933 http://dx.doi.org/10.1186/s13040-015-0055-3 Text en © Beck and Foster; licensee BioMed Central. 2015 This article is published under license to BioMed Central Ltd. Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/ (http://creativecommons.org/licenses/by/4.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Beck, Daniel Foster, James A. Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis |
title | Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis |
title_full | Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis |
title_fullStr | Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis |
title_full_unstemmed | Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis |
title_short | Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis |
title_sort | machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542107/ https://www.ncbi.nlm.nih.gov/pubmed/26294933 http://dx.doi.org/10.1186/s13040-015-0055-3 |
work_keys_str_mv | AT beckdaniel machinelearningclassifiersprovideinsightintotherelationshipbetweenmicrobialcommunitiesandbacterialvaginosis AT fosterjamesa machinelearningclassifiersprovideinsightintotherelationshipbetweenmicrobialcommunitiesandbacterialvaginosis |