Cargando…

A comprehensive evaluation of the sl1p pipeline for 16S rRNA gene sequencing analysis

BACKGROUND: Advances in next-generation sequencing technologies have allowed for detailed, molecular-based studies of microbial communities such as the human gut, soil, and ocean waters. Sequencing of the 16S rRNA gene, specific to prokaryotes, using universal PCR primers has become a common approac...

Descripción completa

Detalles Bibliográficos
Autores principales: Whelan, Fiona J., Surette, Michael G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5557527/
https://www.ncbi.nlm.nih.gov/pubmed/28807046
http://dx.doi.org/10.1186/s40168-017-0314-2
_version_ 1783257223170883584
author Whelan, Fiona J.
Surette, Michael G.
author_facet Whelan, Fiona J.
Surette, Michael G.
author_sort Whelan, Fiona J.
collection PubMed
description BACKGROUND: Advances in next-generation sequencing technologies have allowed for detailed, molecular-based studies of microbial communities such as the human gut, soil, and ocean waters. Sequencing of the 16S rRNA gene, specific to prokaryotes, using universal PCR primers has become a common approach to studying the composition of these microbiota. However, the bioinformatic processing of the resulting millions of DNA sequences can be challenging, and a standardized protocol would aid in reproducible analyses. METHODS: The short-read library 16S rRNA gene sequencing pipeline (sl1p, pronounced “slip”) was designed with the purpose of mitigating this lack of reproducibility by combining pre-existing tools into a computational pipeline. This pipeline automates the processing of raw 16S rRNA gene sequencing data to create human-readable tables, graphs, and figures to make the collected data more readily accessible. RESULTS: Data generated from mock communities were compared using eight OTU clustering algorithms, two taxon assignment approaches, and three 16S rRNA gene reference databases. While all of these algorithms and options are available to sl1p users, through testing with human-associated mock communities, AbundantOTU+, the RDP Classifier, and the Greengenes 2011 reference database were chosen as sl1p’s defaults based on their ability to best represent the known input communities. CONCLUSIONS: sl1p promotes reproducible research by providing a comprehensive log file, and reduces the computational knowledge needed by the user to process next-generation sequencing data. sl1p is freely available at https://bitbucket.org/fwhelan/sl1p. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40168-017-0314-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5557527
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55575272017-08-16 A comprehensive evaluation of the sl1p pipeline for 16S rRNA gene sequencing analysis Whelan, Fiona J. Surette, Michael G. Microbiome Methodology Article BACKGROUND: Advances in next-generation sequencing technologies have allowed for detailed, molecular-based studies of microbial communities such as the human gut, soil, and ocean waters. Sequencing of the 16S rRNA gene, specific to prokaryotes, using universal PCR primers has become a common approach to studying the composition of these microbiota. However, the bioinformatic processing of the resulting millions of DNA sequences can be challenging, and a standardized protocol would aid in reproducible analyses. METHODS: The short-read library 16S rRNA gene sequencing pipeline (sl1p, pronounced “slip”) was designed with the purpose of mitigating this lack of reproducibility by combining pre-existing tools into a computational pipeline. This pipeline automates the processing of raw 16S rRNA gene sequencing data to create human-readable tables, graphs, and figures to make the collected data more readily accessible. RESULTS: Data generated from mock communities were compared using eight OTU clustering algorithms, two taxon assignment approaches, and three 16S rRNA gene reference databases. While all of these algorithms and options are available to sl1p users, through testing with human-associated mock communities, AbundantOTU+, the RDP Classifier, and the Greengenes 2011 reference database were chosen as sl1p’s defaults based on their ability to best represent the known input communities. CONCLUSIONS: sl1p promotes reproducible research by providing a comprehensive log file, and reduces the computational knowledge needed by the user to process next-generation sequencing data. sl1p is freely available at https://bitbucket.org/fwhelan/sl1p. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40168-017-0314-2) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-14 /pmc/articles/PMC5557527/ /pubmed/28807046 http://dx.doi.org/10.1186/s40168-017-0314-2 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Whelan, Fiona J.
Surette, Michael G.
A comprehensive evaluation of the sl1p pipeline for 16S rRNA gene sequencing analysis
title A comprehensive evaluation of the sl1p pipeline for 16S rRNA gene sequencing analysis
title_full A comprehensive evaluation of the sl1p pipeline for 16S rRNA gene sequencing analysis
title_fullStr A comprehensive evaluation of the sl1p pipeline for 16S rRNA gene sequencing analysis
title_full_unstemmed A comprehensive evaluation of the sl1p pipeline for 16S rRNA gene sequencing analysis
title_short A comprehensive evaluation of the sl1p pipeline for 16S rRNA gene sequencing analysis
title_sort comprehensive evaluation of the sl1p pipeline for 16s rrna gene sequencing analysis
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5557527/
https://www.ncbi.nlm.nih.gov/pubmed/28807046
http://dx.doi.org/10.1186/s40168-017-0314-2
work_keys_str_mv AT whelanfionaj acomprehensiveevaluationofthesl1ppipelinefor16srrnagenesequencinganalysis
AT surettemichaelg acomprehensiveevaluationofthesl1ppipelinefor16srrnagenesequencinganalysis
AT whelanfionaj comprehensiveevaluationofthesl1ppipelinefor16srrnagenesequencinganalysis
AT surettemichaelg comprehensiveevaluationofthesl1ppipelinefor16srrnagenesequencinganalysis