Cargando…

Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene

BACKGROUND: Profiling of 16S rRNA gene sequences is an important tool for testing hypotheses in complex microbial communities, and analysis methods must be updated and validated as sequencing technologies advance. In host-associated bacterial communities, the V1–V3 region of the 16S rRNA gene is a v...

Descripción completa

Detalles Bibliográficos
Autores principales: Allen, Heather K., Bayles, Darrell O., Looft, Torey, Trachsel, Julian, Bass, Benjamin E., Alt, David P., Bearson, Shawn M. D., Nicholson, Tracy, Casey, Thomas A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4970291/
https://www.ncbi.nlm.nih.gov/pubmed/27485508
http://dx.doi.org/10.1186/s13104-016-2172-6
_version_ 1782445951737135104
author Allen, Heather K.
Bayles, Darrell O.
Looft, Torey
Trachsel, Julian
Bass, Benjamin E.
Alt, David P.
Bearson, Shawn M. D.
Nicholson, Tracy
Casey, Thomas A.
author_facet Allen, Heather K.
Bayles, Darrell O.
Looft, Torey
Trachsel, Julian
Bass, Benjamin E.
Alt, David P.
Bearson, Shawn M. D.
Nicholson, Tracy
Casey, Thomas A.
author_sort Allen, Heather K.
collection PubMed
description BACKGROUND: Profiling of 16S rRNA gene sequences is an important tool for testing hypotheses in complex microbial communities, and analysis methods must be updated and validated as sequencing technologies advance. In host-associated bacterial communities, the V1–V3 region of the 16S rRNA gene is a valuable region to profile because it provides a useful level of taxonomic resolution; however, use of Illumina MiSeq data for experiments targeting this region needs validation. RESULTS: Using a MiSeq machine and the version 3 (300 × 2) chemistry, we sequenced the V1–V3 region of the 16S rRNA gene within a mock community. Nineteen bacteria and one archaeon comprised the mock community, and 12 replicate amplifications of the community were performed and sequenced. Sequencing the large fragment (490 bp) that encompasses V1–V3 yielded a higher error rate (3.6 %) than has been reported when using smaller fragment sizes. This higher error rate was due to a large number of sequences that occurred only one or two times among all mock community samples. Removing sequences that occurred one time among all samples (singletons) reduced the error rate to 1.4 %. Diversity estimates of the mock community containing all sequences were inflated, whereas estimates following singleton removal more closely reflected the actual mock community membership. A higher percentage of the sequences could be taxonomically assigned after singleton and doubleton sequences were removed, and the assignments reflected the membership of the input DNA. CONCLUSIONS: Sequencing the V1–V3 region of the 16S rRNA gene on the MiSeq platform may require additional sequence curation in silico, and improved error rates and diversity estimates show that removing low-frequency sequences is reasonable. When datasets have a high number of singletons, these singletons can be removed from the analysis without losing statistical power while reducing error and improving microbiota assessment. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-016-2172-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4970291
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49702912016-08-03 Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene Allen, Heather K. Bayles, Darrell O. Looft, Torey Trachsel, Julian Bass, Benjamin E. Alt, David P. Bearson, Shawn M. D. Nicholson, Tracy Casey, Thomas A. BMC Res Notes Technical Note BACKGROUND: Profiling of 16S rRNA gene sequences is an important tool for testing hypotheses in complex microbial communities, and analysis methods must be updated and validated as sequencing technologies advance. In host-associated bacterial communities, the V1–V3 region of the 16S rRNA gene is a valuable region to profile because it provides a useful level of taxonomic resolution; however, use of Illumina MiSeq data for experiments targeting this region needs validation. RESULTS: Using a MiSeq machine and the version 3 (300 × 2) chemistry, we sequenced the V1–V3 region of the 16S rRNA gene within a mock community. Nineteen bacteria and one archaeon comprised the mock community, and 12 replicate amplifications of the community were performed and sequenced. Sequencing the large fragment (490 bp) that encompasses V1–V3 yielded a higher error rate (3.6 %) than has been reported when using smaller fragment sizes. This higher error rate was due to a large number of sequences that occurred only one or two times among all mock community samples. Removing sequences that occurred one time among all samples (singletons) reduced the error rate to 1.4 %. Diversity estimates of the mock community containing all sequences were inflated, whereas estimates following singleton removal more closely reflected the actual mock community membership. A higher percentage of the sequences could be taxonomically assigned after singleton and doubleton sequences were removed, and the assignments reflected the membership of the input DNA. CONCLUSIONS: Sequencing the V1–V3 region of the 16S rRNA gene on the MiSeq platform may require additional sequence curation in silico, and improved error rates and diversity estimates show that removing low-frequency sequences is reasonable. When datasets have a high number of singletons, these singletons can be removed from the analysis without losing statistical power while reducing error and improving microbiota assessment. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-016-2172-6) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-02 /pmc/articles/PMC4970291/ /pubmed/27485508 http://dx.doi.org/10.1186/s13104-016-2172-6 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Note
Allen, Heather K.
Bayles, Darrell O.
Looft, Torey
Trachsel, Julian
Bass, Benjamin E.
Alt, David P.
Bearson, Shawn M. D.
Nicholson, Tracy
Casey, Thomas A.
Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene
title Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene
title_full Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene
title_fullStr Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene
title_full_unstemmed Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene
title_short Pipeline for amplifying and analyzing amplicons of the V1–V3 region of the 16S rRNA gene
title_sort pipeline for amplifying and analyzing amplicons of the v1–v3 region of the 16s rrna gene
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4970291/
https://www.ncbi.nlm.nih.gov/pubmed/27485508
http://dx.doi.org/10.1186/s13104-016-2172-6
work_keys_str_mv AT allenheatherk pipelineforamplifyingandanalyzingampliconsofthev1v3regionofthe16srrnagene
AT baylesdarrello pipelineforamplifyingandanalyzingampliconsofthev1v3regionofthe16srrnagene
AT loofttorey pipelineforamplifyingandanalyzingampliconsofthev1v3regionofthe16srrnagene
AT trachseljulian pipelineforamplifyingandanalyzingampliconsofthev1v3regionofthe16srrnagene
AT bassbenjamine pipelineforamplifyingandanalyzingampliconsofthev1v3regionofthe16srrnagene
AT altdavidp pipelineforamplifyingandanalyzingampliconsofthev1v3regionofthe16srrnagene
AT bearsonshawnmd pipelineforamplifyingandanalyzingampliconsofthev1v3regionofthe16srrnagene
AT nicholsontracy pipelineforamplifyingandanalyzingampliconsofthev1v3regionofthe16srrnagene
AT caseythomasa pipelineforamplifyingandanalyzingampliconsofthev1v3regionofthe16srrnagene