Cargando…

An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea

Reference phylogenies are crucial for providing a taxonomic framework for interpretation of marker gene and metagenomic surveys, which continue to reveal novel species at a remarkable rate. Greengenes is a dedicated full-length 16S rRNA gene database that provides users with a curated taxonomy based...

Descripción completa

Detalles Bibliográficos
Autores principales: McDonald, Daniel, Price, Morgan N, Goodrich, Julia, Nawrocki, Eric P, DeSantis, Todd Z, Probst, Alexander, Andersen, Gary L, Knight, Rob, Hugenholtz, Philip
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3280142/
https://www.ncbi.nlm.nih.gov/pubmed/22134646
http://dx.doi.org/10.1038/ismej.2011.139
_version_ 1782223778107883520
author McDonald, Daniel
Price, Morgan N
Goodrich, Julia
Nawrocki, Eric P
DeSantis, Todd Z
Probst, Alexander
Andersen, Gary L
Knight, Rob
Hugenholtz, Philip
author_facet McDonald, Daniel
Price, Morgan N
Goodrich, Julia
Nawrocki, Eric P
DeSantis, Todd Z
Probst, Alexander
Andersen, Gary L
Knight, Rob
Hugenholtz, Philip
author_sort McDonald, Daniel
collection PubMed
description Reference phylogenies are crucial for providing a taxonomic framework for interpretation of marker gene and metagenomic surveys, which continue to reveal novel species at a remarkable rate. Greengenes is a dedicated full-length 16S rRNA gene database that provides users with a curated taxonomy based on de novo tree inference. We developed a ‘taxonomy to tree' approach for transferring group names from an existing taxonomy to a tree topology, and used it to apply the Greengenes, National Center for Biotechnology Information (NCBI) and cyanoDB (Cyanobacteria only) taxonomies to a de novo tree comprising 408 315 sequences. We also incorporated explicit rank information provided by the NCBI taxonomy to group names (by prefixing rank designations) for better user orientation and classification consistency. The resulting merged taxonomy improved the classification of 75% of the sequences by one or more ranks relative to the original NCBI taxonomy with the most pronounced improvements occurring in under-classified environmental sequences. We also assessed candidate phyla (divisions) currently defined by NCBI and present recommendations for consolidation of 34 redundantly named groups. All intermediate results from the pipeline, which includes tree inference, jackknifing and transfer of a donor taxonomy to a recipient tree (tax2tree) are available for download. The improved Greengenes taxonomy should provide important infrastructure for a wide range of megasequencing projects studying ecosystems on scales ranging from our own bodies (the Human Microbiome Project) to the entire planet (the Earth Microbiome Project). The implementation of the software can be obtained from http://sourceforge.net/projects/tax2tree/.
format Online
Article
Text
id pubmed-3280142
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-32801422012-03-01 An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea McDonald, Daniel Price, Morgan N Goodrich, Julia Nawrocki, Eric P DeSantis, Todd Z Probst, Alexander Andersen, Gary L Knight, Rob Hugenholtz, Philip ISME J Original Article Reference phylogenies are crucial for providing a taxonomic framework for interpretation of marker gene and metagenomic surveys, which continue to reveal novel species at a remarkable rate. Greengenes is a dedicated full-length 16S rRNA gene database that provides users with a curated taxonomy based on de novo tree inference. We developed a ‘taxonomy to tree' approach for transferring group names from an existing taxonomy to a tree topology, and used it to apply the Greengenes, National Center for Biotechnology Information (NCBI) and cyanoDB (Cyanobacteria only) taxonomies to a de novo tree comprising 408 315 sequences. We also incorporated explicit rank information provided by the NCBI taxonomy to group names (by prefixing rank designations) for better user orientation and classification consistency. The resulting merged taxonomy improved the classification of 75% of the sequences by one or more ranks relative to the original NCBI taxonomy with the most pronounced improvements occurring in under-classified environmental sequences. We also assessed candidate phyla (divisions) currently defined by NCBI and present recommendations for consolidation of 34 redundantly named groups. All intermediate results from the pipeline, which includes tree inference, jackknifing and transfer of a donor taxonomy to a recipient tree (tax2tree) are available for download. The improved Greengenes taxonomy should provide important infrastructure for a wide range of megasequencing projects studying ecosystems on scales ranging from our own bodies (the Human Microbiome Project) to the entire planet (the Earth Microbiome Project). The implementation of the software can be obtained from http://sourceforge.net/projects/tax2tree/. Nature Publishing Group 2012-03 2011-12-01 /pmc/articles/PMC3280142/ /pubmed/22134646 http://dx.doi.org/10.1038/ismej.2011.139 Text en Copyright © 2012 International Society for Microbial Ecology http://creativecommons.org/licenses/by-nc-sa/3.0/ This work is licensed under the Creative Commons Attribution-NonCommercial-Share Alike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/
spellingShingle Original Article
McDonald, Daniel
Price, Morgan N
Goodrich, Julia
Nawrocki, Eric P
DeSantis, Todd Z
Probst, Alexander
Andersen, Gary L
Knight, Rob
Hugenholtz, Philip
An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea
title An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea
title_full An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea
title_fullStr An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea
title_full_unstemmed An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea
title_short An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea
title_sort improved greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3280142/
https://www.ncbi.nlm.nih.gov/pubmed/22134646
http://dx.doi.org/10.1038/ismej.2011.139
work_keys_str_mv AT mcdonalddaniel animprovedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT pricemorgann animprovedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT goodrichjulia animprovedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT nawrockiericp animprovedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT desantistoddz animprovedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT probstalexander animprovedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT andersengaryl animprovedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT knightrob animprovedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT hugenholtzphilip animprovedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT mcdonalddaniel improvedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT pricemorgann improvedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT goodrichjulia improvedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT nawrockiericp improvedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT desantistoddz improvedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT probstalexander improvedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT andersengaryl improvedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT knightrob improvedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea
AT hugenholtzphilip improvedgreengenestaxonomywithexplicitranksforecologicalandevolutionaryanalysesofbacteriaandarchaea