Cargando…
Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing
Microbial amplicon sequencing studies are an important tool in biological and biomedical research. Widespread 16S rRNA gene microbial surveys have shed light on the structure of many ecosystems inhabited by bacteria, including the human body. However, specialized software and algorithms are needed t...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6964864/ https://www.ncbi.nlm.nih.gov/pubmed/31945086 http://dx.doi.org/10.1371/journal.pone.0227434 |
_version_ | 1783488537004343296 |
---|---|
author | Prodan, Andrei Tremaroli, Valentina Brolin, Harald Zwinderman, Aeilko H. Nieuwdorp, Max Levin, Evgeni |
author_facet | Prodan, Andrei Tremaroli, Valentina Brolin, Harald Zwinderman, Aeilko H. Nieuwdorp, Max Levin, Evgeni |
author_sort | Prodan, Andrei |
collection | PubMed |
description | Microbial amplicon sequencing studies are an important tool in biological and biomedical research. Widespread 16S rRNA gene microbial surveys have shed light on the structure of many ecosystems inhabited by bacteria, including the human body. However, specialized software and algorithms are needed to convert raw sequencing data into biologically meaningful information (i.e. tables of bacterial counts). While different bioinformatic pipelines are available in a rapidly changing and improving field, users are often unaware of limitations and biases associated with individual pipelines and there is a lack of agreement regarding best practices. Here, we compared six bioinformatic pipelines for the analysis of amplicon sequence data: three OTU-level flows (QIIME-uclust, MOTHUR, and USEARCH-UPARSE) and three ASV-level (DADA2, Qiime2-Deblur, and USEARCH-UNOISE3). We tested workflows with different quality control options, clustering algorithms, and cutoff parameters on a mock community as well as on a large (N = 2170) recently published fecal sample dataset from the multi-ethnic HELIUS study. We assessed the sensitivity, specificity, and degree of consensus of the different outputs. DADA2 offered the best sensitivity, at the expense of decreased specificity compared to USEARCH-UNOISE3 and Qiime2-Deblur. USEARCH-UNOISE3 showed the best balance between resolution and specificity. OTU-level USEARCH-UPARSE and MOTHUR performed well, but with lower specificity than ASV-level pipelines. QIIME-uclust produced large number of spurious OTUs as well as inflated alpha-diversity measures and should be avoided in future studies. This study provides guidance for researchers using amplicon sequencing to gain biological insights. |
format | Online Article Text |
id | pubmed-6964864 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-69648642020-01-26 Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing Prodan, Andrei Tremaroli, Valentina Brolin, Harald Zwinderman, Aeilko H. Nieuwdorp, Max Levin, Evgeni PLoS One Research Article Microbial amplicon sequencing studies are an important tool in biological and biomedical research. Widespread 16S rRNA gene microbial surveys have shed light on the structure of many ecosystems inhabited by bacteria, including the human body. However, specialized software and algorithms are needed to convert raw sequencing data into biologically meaningful information (i.e. tables of bacterial counts). While different bioinformatic pipelines are available in a rapidly changing and improving field, users are often unaware of limitations and biases associated with individual pipelines and there is a lack of agreement regarding best practices. Here, we compared six bioinformatic pipelines for the analysis of amplicon sequence data: three OTU-level flows (QIIME-uclust, MOTHUR, and USEARCH-UPARSE) and three ASV-level (DADA2, Qiime2-Deblur, and USEARCH-UNOISE3). We tested workflows with different quality control options, clustering algorithms, and cutoff parameters on a mock community as well as on a large (N = 2170) recently published fecal sample dataset from the multi-ethnic HELIUS study. We assessed the sensitivity, specificity, and degree of consensus of the different outputs. DADA2 offered the best sensitivity, at the expense of decreased specificity compared to USEARCH-UNOISE3 and Qiime2-Deblur. USEARCH-UNOISE3 showed the best balance between resolution and specificity. OTU-level USEARCH-UPARSE and MOTHUR performed well, but with lower specificity than ASV-level pipelines. QIIME-uclust produced large number of spurious OTUs as well as inflated alpha-diversity measures and should be avoided in future studies. This study provides guidance for researchers using amplicon sequencing to gain biological insights. Public Library of Science 2020-01-16 /pmc/articles/PMC6964864/ /pubmed/31945086 http://dx.doi.org/10.1371/journal.pone.0227434 Text en © 2020 Prodan et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Prodan, Andrei Tremaroli, Valentina Brolin, Harald Zwinderman, Aeilko H. Nieuwdorp, Max Levin, Evgeni Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing |
title | Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing |
title_full | Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing |
title_fullStr | Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing |
title_full_unstemmed | Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing |
title_short | Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing |
title_sort | comparing bioinformatic pipelines for microbial 16s rrna amplicon sequencing |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6964864/ https://www.ncbi.nlm.nih.gov/pubmed/31945086 http://dx.doi.org/10.1371/journal.pone.0227434 |
work_keys_str_mv | AT prodanandrei comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing AT tremarolivalentina comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing AT brolinharald comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing AT zwindermanaeilkoh comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing AT nieuwdorpmax comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing AT levinevgeni comparingbioinformaticpipelinesformicrobial16srrnaampliconsequencing |