Cargando…

Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing

Microbial amplicon sequencing studies are an important tool in biological and biomedical research. Widespread 16S rRNA gene microbial surveys have shed light on the structure of many ecosystems inhabited by bacteria, including the human body. However, specialized software and algorithms are needed t...

Descripción completa

Detalles Bibliográficos
Autores principales: Prodan, Andrei, Tremaroli, Valentina, Brolin, Harald, Zwinderman, Aeilko H., Nieuwdorp, Max, Levin, Evgeni
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6964864/
https://www.ncbi.nlm.nih.gov/pubmed/31945086
http://dx.doi.org/10.1371/journal.pone.0227434
_version_ 1783488537004343296
author Prodan, Andrei
Tremaroli, Valentina
Brolin, Harald
Zwinderman, Aeilko H.
Nieuwdorp, Max
Levin, Evgeni
author_facet Prodan, Andrei
Tremaroli, Valentina
Brolin, Harald
Zwinderman, Aeilko H.
Nieuwdorp, Max
Levin, Evgeni
author_sort Prodan, Andrei
collection PubMed
description Microbial amplicon sequencing studies are an important tool in biological and biomedical research. Widespread 16S rRNA gene microbial surveys have shed light on the structure of many ecosystems inhabited by bacteria, including the human body. However, specialized software and algorithms are needed to convert raw sequencing data into biologically meaningful information (i.e. tables of bacterial counts). While different bioinformatic pipelines are available in a rapidly changing and improving field, users are often unaware of limitations and biases associated with individual pipelines and there is a lack of agreement regarding best practices. Here, we compared six bioinformatic pipelines for the analysis of amplicon sequence data: three OTU-level flows (QIIME-uclust, MOTHUR, and USEARCH-UPARSE) and three ASV-level (DADA2, Qiime2-Deblur, and USEARCH-UNOISE3). We tested workflows with different quality control options, clustering algorithms, and cutoff parameters on a mock community as well as on a large (N = 2170) recently published fecal sample dataset from the multi-ethnic HELIUS study. We assessed the sensitivity, specificity, and degree of consensus of the different outputs. DADA2 offered the best sensitivity, at the expense of decreased specificity compared to USEARCH-UNOISE3 and Qiime2-Deblur. USEARCH-UNOISE3 showed the best balance between resolution and specificity. OTU-level USEARCH-UPARSE and MOTHUR performed well, but with lower specificity than ASV-level pipelines. QIIME-uclust produced large number of spurious OTUs as well as inflated alpha-diversity measures and should be avoided in future studies. This study provides guidance for researchers using amplicon sequencing to gain biological insights.
format Online
Article
Text
id pubmed-6964864
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69648642020-01-26 Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing Prodan, Andrei Tremaroli, Valentina Brolin, Harald Zwinderman, Aeilko H. Nieuwdorp, Max Levin, Evgeni PLoS One Research Article Microbial amplicon sequencing studies are an important tool in biological and biomedical research. Widespread 16S rRNA gene microbial surveys have shed light on the structure of many ecosystems inhabited by bacteria, including the human body. However, specialized software and algorithms are needed to convert raw sequencing data into biologically meaningful information (i.e. tables of bacterial counts). While different bioinformatic pipelines are available in a rapidly changing and improving field, users are often unaware of limitations and biases associated with individual pipelines and there is a lack of agreement regarding best practices. Here, we compared six bioinformatic pipelines for the analysis of amplicon sequence data: three OTU-level flows (QIIME-uclust, MOTHUR, and USEARCH-UPARSE) and three ASV-level (DADA2, Qiime2-Deblur, and USEARCH-UNOISE3). We tested workflows with different quality control options, clustering algorithms, and cutoff parameters on a mock community as well as on a large (N = 2170) recently published fecal sample dataset from the multi-ethnic HELIUS study. We assessed the sensitivity, specificity, and degree of consensus of the different outputs. DADA2 offered the best sensitivity, at the expense of decreased specificity compared to USEARCH-UNOISE3 and Qiime2-Deblur. USEARCH-UNOISE3 showed the best balance between resolution and specificity. OTU-level USEARCH-UPARSE and MOTHUR performed well, but with lower specificity than ASV-level pipelines. QIIME-uclust produced large number of spurious OTUs as well as inflated alpha-diversity measures and should be avoided in future studies. This study provides guidance for researchers using amplicon sequencing to gain biological insights. Public Library of Science 2020-01-16 /pmc/articles/PMC6964864/ /pubmed/31945086 http://dx.doi.org/10.1371/journal.pone.0227434 Text en © 2020 Prodan et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Prodan, Andrei
Tremaroli, Valentina
Brolin, Harald
Zwinderman, Aeilko H.
Nieuwdorp, Max
Levin, Evgeni
Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing
title Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing
title_full Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing
title_fullStr Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing
title_full_unstemmed Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing
title_short Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing
title_sort comparing bioinformatic pipelines for microbial 16s rrna amplicon sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6964864/
https://www.ncbi.nlm.nih.gov/pubmed/31945086
http://dx.doi.org/10.1371/journal.pone.0227434
work_keys_str_mv AT prodanandrei comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing
AT tremarolivalentina comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing
AT brolinharald comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing
AT zwindermanaeilkoh comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing
AT nieuwdorpmax comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing
AT levinevgeni comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing