Cargando…

Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers

The recent introduction of massively parallel pyrosequencers allows rapid, inexpensive analysis of microbial community composition using 16S ribosomal RNA (rRNA) sequences. However, a major challenge is to design a workflow so that taxonomic information can be accurately and rapidly assigned to each...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zongzhi, DeSantis, Todd Z., Andersen, Gary L., Knight, Rob
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2566877/
https://www.ncbi.nlm.nih.gov/pubmed/18723574
http://dx.doi.org/10.1093/nar/gkn491
_version_ 1782159966762696704
author Liu, Zongzhi
DeSantis, Todd Z.
Andersen, Gary L.
Knight, Rob
author_facet Liu, Zongzhi
DeSantis, Todd Z.
Andersen, Gary L.
Knight, Rob
author_sort Liu, Zongzhi
collection PubMed
description The recent introduction of massively parallel pyrosequencers allows rapid, inexpensive analysis of microbial community composition using 16S ribosomal RNA (rRNA) sequences. However, a major challenge is to design a workflow so that taxonomic information can be accurately and rapidly assigned to each read, so that the composition of each community can be linked back to likely ecological roles played by members of each species, genus, family or phylum. Here, we use three large 16S rRNA datasets to test whether taxonomic information based on the full-length sequences can be recaptured by short reads that simulate the pyrosequencer outputs. We find that different taxonomic assignment methods vary radically in their ability to recapture the taxonomic information in full-length 16S rRNA sequences: most methods are sensitive to the region of the 16S rRNA gene that is targeted for sequencing, but many combinations of methods and rRNA regions produce consistent and accurate results. To process large datasets of partial 16S rRNA sequences obtained from surveys of various microbial communities, including those from human body habitats, we recommend the use of Greengenes or RDP classifier with fragments of at least 250 bases, starting from one of the primers R357, R534, R798, F343 or F517.
format Text
id pubmed-2566877
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-25668772009-01-22 Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers Liu, Zongzhi DeSantis, Todd Z. Andersen, Gary L. Knight, Rob Nucleic Acids Res Methods Online The recent introduction of massively parallel pyrosequencers allows rapid, inexpensive analysis of microbial community composition using 16S ribosomal RNA (rRNA) sequences. However, a major challenge is to design a workflow so that taxonomic information can be accurately and rapidly assigned to each read, so that the composition of each community can be linked back to likely ecological roles played by members of each species, genus, family or phylum. Here, we use three large 16S rRNA datasets to test whether taxonomic information based on the full-length sequences can be recaptured by short reads that simulate the pyrosequencer outputs. We find that different taxonomic assignment methods vary radically in their ability to recapture the taxonomic information in full-length 16S rRNA sequences: most methods are sensitive to the region of the 16S rRNA gene that is targeted for sequencing, but many combinations of methods and rRNA regions produce consistent and accurate results. To process large datasets of partial 16S rRNA sequences obtained from surveys of various microbial communities, including those from human body habitats, we recommend the use of Greengenes or RDP classifier with fragments of at least 250 bases, starting from one of the primers R357, R534, R798, F343 or F517. Oxford University Press 2008-10 2008-08-22 /pmc/articles/PMC2566877/ /pubmed/18723574 http://dx.doi.org/10.1093/nar/gkn491 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Liu, Zongzhi
DeSantis, Todd Z.
Andersen, Gary L.
Knight, Rob
Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers
title Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers
title_full Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers
title_fullStr Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers
title_full_unstemmed Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers
title_short Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers
title_sort accurate taxonomy assignments from 16s rrna sequences produced by highly parallel pyrosequencers
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2566877/
https://www.ncbi.nlm.nih.gov/pubmed/18723574
http://dx.doi.org/10.1093/nar/gkn491
work_keys_str_mv AT liuzongzhi accuratetaxonomyassignmentsfrom16srrnasequencesproducedbyhighlyparallelpyrosequencers
AT desantistoddz accuratetaxonomyassignmentsfrom16srrnasequencesproducedbyhighlyparallelpyrosequencers
AT andersengaryl accuratetaxonomyassignmentsfrom16srrnasequencesproducedbyhighlyparallelpyrosequencers
AT knightrob accuratetaxonomyassignmentsfrom16srrnasequencesproducedbyhighlyparallelpyrosequencers