Cargando…

The effect of training set on the classification of honey bee gut microbiota using the Naïve Bayesian Classifier

BACKGROUND: Microbial ecologists now routinely utilize next-generation sequencing methods to assess microbial diversity in the environment. One tool heavily utilized by many groups is the Naïve Bayesian Classifier developed by the Ribosomal Database Project (RDP-NBC). However, the consistency and co...

Descripción completa

Detalles Bibliográficos
Autores principales: Newton, Irene LG, Roeselers, Guus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3520854/
https://www.ncbi.nlm.nih.gov/pubmed/23013113
http://dx.doi.org/10.1186/1471-2180-12-221
_version_ 1782252846535671808
author Newton, Irene LG
Roeselers, Guus
author_facet Newton, Irene LG
Roeselers, Guus
author_sort Newton, Irene LG
collection PubMed
description BACKGROUND: Microbial ecologists now routinely utilize next-generation sequencing methods to assess microbial diversity in the environment. One tool heavily utilized by many groups is the Naïve Bayesian Classifier developed by the Ribosomal Database Project (RDP-NBC). However, the consistency and confidence of classifications provided by the RDP-NBC is dependent on the training set utilized. RESULTS: We explored the stability of classification of honey bee gut microbiota sequences by the RDP-NBC utilizing three publically available ribosomal RNA sequence databases as training sets: ARB-SILVA, Greengenes and RDP. We found that the inclusion of previously published, high-quality, full-length sequences from 16S rRNA clone libraries improved the precision in classification of novel bee-associated sequences. Specifically, by including bee-specific 16S rRNA gene sequences a larger fraction of sequences were classified at a higher confidence by the RDP-NBC (based on bootstrap scores). CONCLUSIONS: Results from the analysis of these bee-associated sequences have ramifications for other environments represented by few sequences in the public databases or few bacterial isolates. We conclude that for the exploration of relatively novel habitats, the inclusion of high-quality, full-length 16S rRNA gene sequences allows for a more confident taxonomic classification.
format Online
Article
Text
id pubmed-3520854
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35208542012-12-13 The effect of training set on the classification of honey bee gut microbiota using the Naïve Bayesian Classifier Newton, Irene LG Roeselers, Guus BMC Microbiol Research Article BACKGROUND: Microbial ecologists now routinely utilize next-generation sequencing methods to assess microbial diversity in the environment. One tool heavily utilized by many groups is the Naïve Bayesian Classifier developed by the Ribosomal Database Project (RDP-NBC). However, the consistency and confidence of classifications provided by the RDP-NBC is dependent on the training set utilized. RESULTS: We explored the stability of classification of honey bee gut microbiota sequences by the RDP-NBC utilizing three publically available ribosomal RNA sequence databases as training sets: ARB-SILVA, Greengenes and RDP. We found that the inclusion of previously published, high-quality, full-length sequences from 16S rRNA clone libraries improved the precision in classification of novel bee-associated sequences. Specifically, by including bee-specific 16S rRNA gene sequences a larger fraction of sequences were classified at a higher confidence by the RDP-NBC (based on bootstrap scores). CONCLUSIONS: Results from the analysis of these bee-associated sequences have ramifications for other environments represented by few sequences in the public databases or few bacterial isolates. We conclude that for the exploration of relatively novel habitats, the inclusion of high-quality, full-length 16S rRNA gene sequences allows for a more confident taxonomic classification. BioMed Central 2012-09-26 /pmc/articles/PMC3520854/ /pubmed/23013113 http://dx.doi.org/10.1186/1471-2180-12-221 Text en Copyright ©2012 Newton and Roeselers; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Newton, Irene LG
Roeselers, Guus
The effect of training set on the classification of honey bee gut microbiota using the Naïve Bayesian Classifier
title The effect of training set on the classification of honey bee gut microbiota using the Naïve Bayesian Classifier
title_full The effect of training set on the classification of honey bee gut microbiota using the Naïve Bayesian Classifier
title_fullStr The effect of training set on the classification of honey bee gut microbiota using the Naïve Bayesian Classifier
title_full_unstemmed The effect of training set on the classification of honey bee gut microbiota using the Naïve Bayesian Classifier
title_short The effect of training set on the classification of honey bee gut microbiota using the Naïve Bayesian Classifier
title_sort effect of training set on the classification of honey bee gut microbiota using the naïve bayesian classifier
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3520854/
https://www.ncbi.nlm.nih.gov/pubmed/23013113
http://dx.doi.org/10.1186/1471-2180-12-221
work_keys_str_mv AT newtonirenelg theeffectoftrainingsetontheclassificationofhoneybeegutmicrobiotausingthenaivebayesianclassifier
AT roeselersguus theeffectoftrainingsetontheclassificationofhoneybeegutmicrobiotausingthenaivebayesianclassifier
AT newtonirenelg effectoftrainingsetontheclassificationofhoneybeegutmicrobiotausingthenaivebayesianclassifier
AT roeselersguus effectoftrainingsetontheclassificationofhoneybeegutmicrobiotausingthenaivebayesianclassifier