Cargando…
Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota
BACKGROUND: In recent years, the microbiome field has undergone a shift from clustering-based methods of operational taxonomic unit (OTU) designation based on sequence similarity to denoising algorithms that identify exact amplicon sequence variants (ASVs), and methods to identify contaminating bact...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7501722/ https://www.ncbi.nlm.nih.gov/pubmed/32948144 http://dx.doi.org/10.1186/s12866-020-01949-7 |
_version_ | 1783584086845030400 |
---|---|
author | Moossavi, Shirin Atakora, Faisal Fehr, Kelsey Khafipour, Ehsan |
author_facet | Moossavi, Shirin Atakora, Faisal Fehr, Kelsey Khafipour, Ehsan |
author_sort | Moossavi, Shirin |
collection | PubMed |
description | BACKGROUND: In recent years, the microbiome field has undergone a shift from clustering-based methods of operational taxonomic unit (OTU) designation based on sequence similarity to denoising algorithms that identify exact amplicon sequence variants (ASVs), and methods to identify contaminating bacterial DNA sequences from low biomass samples have been developed. Although these methods improve accuracy when analyzing mock communities, their impact on real samples and downstream analysis of biological associations is less clear. RESULTS: Here, we re-processed our recently published milk microbiota data using Qiime1 to identify OTUs, and Qiime2 to identify ASVs, with or without contaminant removal using decontam. Qiime2 resolved the mock community more accurately, primarily because Qiime1 failed to detect Lactobacillus. Qiime2 also considerably reduced the average number of ASVs detected in human milk samples (364 ± 145 OTUs vs. 170 ± 73 ASVs, p < 0.001). Compared to the richness, the estimated diversity measures had a similar range using both methods albeit statistically different (inverse Simpson index: 14.3 ± 8.5 vs. 15.6 ± 8.7, p = 0.031) and there was strong consistency and agreement for the relative abundances of the most abundant bacterial taxa, including Staphylococcaceae and Streptococcaceae. One notable exception was Oxalobacteriaceae, which was overrepresented using Qiime1 regardless of contaminant removal. Downstream statistical analyses were not impacted by the choice of algorithm in terms of the direction, strength, and significance of associations of host factors with bacterial diversity and overall community composition. CONCLUSION: Overall, the biological observations and conclusions were robust to the choice of the sequencing processing methods and contaminant removal. |
format | Online Article Text |
id | pubmed-7501722 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-75017222020-09-22 Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota Moossavi, Shirin Atakora, Faisal Fehr, Kelsey Khafipour, Ehsan BMC Microbiol Research Article BACKGROUND: In recent years, the microbiome field has undergone a shift from clustering-based methods of operational taxonomic unit (OTU) designation based on sequence similarity to denoising algorithms that identify exact amplicon sequence variants (ASVs), and methods to identify contaminating bacterial DNA sequences from low biomass samples have been developed. Although these methods improve accuracy when analyzing mock communities, their impact on real samples and downstream analysis of biological associations is less clear. RESULTS: Here, we re-processed our recently published milk microbiota data using Qiime1 to identify OTUs, and Qiime2 to identify ASVs, with or without contaminant removal using decontam. Qiime2 resolved the mock community more accurately, primarily because Qiime1 failed to detect Lactobacillus. Qiime2 also considerably reduced the average number of ASVs detected in human milk samples (364 ± 145 OTUs vs. 170 ± 73 ASVs, p < 0.001). Compared to the richness, the estimated diversity measures had a similar range using both methods albeit statistically different (inverse Simpson index: 14.3 ± 8.5 vs. 15.6 ± 8.7, p = 0.031) and there was strong consistency and agreement for the relative abundances of the most abundant bacterial taxa, including Staphylococcaceae and Streptococcaceae. One notable exception was Oxalobacteriaceae, which was overrepresented using Qiime1 regardless of contaminant removal. Downstream statistical analyses were not impacted by the choice of algorithm in terms of the direction, strength, and significance of associations of host factors with bacterial diversity and overall community composition. CONCLUSION: Overall, the biological observations and conclusions were robust to the choice of the sequencing processing methods and contaminant removal. BioMed Central 2020-09-18 /pmc/articles/PMC7501722/ /pubmed/32948144 http://dx.doi.org/10.1186/s12866-020-01949-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Moossavi, Shirin Atakora, Faisal Fehr, Kelsey Khafipour, Ehsan Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota |
title | Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota |
title_full | Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota |
title_fullStr | Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota |
title_full_unstemmed | Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota |
title_short | Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota |
title_sort | biological observations in microbiota analysis are robust to the choice of 16s rrna gene sequencing processing algorithm: case study on human milk microbiota |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7501722/ https://www.ncbi.nlm.nih.gov/pubmed/32948144 http://dx.doi.org/10.1186/s12866-020-01949-7 |
work_keys_str_mv | AT moossavishirin biologicalobservationsinmicrobiotaanalysisarerobusttothechoiceof16srrnagenesequencingprocessingalgorithmcasestudyonhumanmilkmicrobiota AT atakorafaisal biologicalobservationsinmicrobiotaanalysisarerobusttothechoiceof16srrnagenesequencingprocessingalgorithmcasestudyonhumanmilkmicrobiota AT fehrkelsey biologicalobservationsinmicrobiotaanalysisarerobusttothechoiceof16srrnagenesequencingprocessingalgorithmcasestudyonhumanmilkmicrobiota AT khafipourehsan biologicalobservationsinmicrobiotaanalysisarerobusttothechoiceof16srrnagenesequencingprocessingalgorithmcasestudyonhumanmilkmicrobiota |