Cargando…

Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota

BACKGROUND: In recent years, the microbiome field has undergone a shift from clustering-based methods of operational taxonomic unit (OTU) designation based on sequence similarity to denoising algorithms that identify exact amplicon sequence variants (ASVs), and methods to identify contaminating bact...

Descripción completa

Detalles Bibliográficos
Autores principales: Moossavi, Shirin, Atakora, Faisal, Fehr, Kelsey, Khafipour, Ehsan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7501722/
https://www.ncbi.nlm.nih.gov/pubmed/32948144
http://dx.doi.org/10.1186/s12866-020-01949-7
_version_ 1783584086845030400
author Moossavi, Shirin
Atakora, Faisal
Fehr, Kelsey
Khafipour, Ehsan
author_facet Moossavi, Shirin
Atakora, Faisal
Fehr, Kelsey
Khafipour, Ehsan
author_sort Moossavi, Shirin
collection PubMed
description BACKGROUND: In recent years, the microbiome field has undergone a shift from clustering-based methods of operational taxonomic unit (OTU) designation based on sequence similarity to denoising algorithms that identify exact amplicon sequence variants (ASVs), and methods to identify contaminating bacterial DNA sequences from low biomass samples have been developed. Although these methods improve accuracy when analyzing mock communities, their impact on real samples and downstream analysis of biological associations is less clear. RESULTS: Here, we re-processed our recently published milk microbiota data using Qiime1 to identify OTUs, and Qiime2 to identify ASVs, with or without contaminant removal using decontam. Qiime2 resolved the mock community more accurately, primarily because Qiime1 failed to detect Lactobacillus. Qiime2 also considerably reduced the average number of ASVs detected in human milk samples (364 ± 145 OTUs vs. 170 ± 73 ASVs, p < 0.001). Compared to the richness, the estimated diversity measures had a similar range using both methods albeit statistically different (inverse Simpson index: 14.3 ± 8.5 vs. 15.6 ± 8.7, p = 0.031) and there was strong consistency and agreement for the relative abundances of the most abundant bacterial taxa, including Staphylococcaceae and Streptococcaceae. One notable exception was Oxalobacteriaceae, which was overrepresented using Qiime1 regardless of contaminant removal. Downstream statistical analyses were not impacted by the choice of algorithm in terms of the direction, strength, and significance of associations of host factors with bacterial diversity and overall community composition. CONCLUSION: Overall, the biological observations and conclusions were robust to the choice of the sequencing processing methods and contaminant removal.
format Online
Article
Text
id pubmed-7501722
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-75017222020-09-22 Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota Moossavi, Shirin Atakora, Faisal Fehr, Kelsey Khafipour, Ehsan BMC Microbiol Research Article BACKGROUND: In recent years, the microbiome field has undergone a shift from clustering-based methods of operational taxonomic unit (OTU) designation based on sequence similarity to denoising algorithms that identify exact amplicon sequence variants (ASVs), and methods to identify contaminating bacterial DNA sequences from low biomass samples have been developed. Although these methods improve accuracy when analyzing mock communities, their impact on real samples and downstream analysis of biological associations is less clear. RESULTS: Here, we re-processed our recently published milk microbiota data using Qiime1 to identify OTUs, and Qiime2 to identify ASVs, with or without contaminant removal using decontam. Qiime2 resolved the mock community more accurately, primarily because Qiime1 failed to detect Lactobacillus. Qiime2 also considerably reduced the average number of ASVs detected in human milk samples (364 ± 145 OTUs vs. 170 ± 73 ASVs, p < 0.001). Compared to the richness, the estimated diversity measures had a similar range using both methods albeit statistically different (inverse Simpson index: 14.3 ± 8.5 vs. 15.6 ± 8.7, p = 0.031) and there was strong consistency and agreement for the relative abundances of the most abundant bacterial taxa, including Staphylococcaceae and Streptococcaceae. One notable exception was Oxalobacteriaceae, which was overrepresented using Qiime1 regardless of contaminant removal. Downstream statistical analyses were not impacted by the choice of algorithm in terms of the direction, strength, and significance of associations of host factors with bacterial diversity and overall community composition. CONCLUSION: Overall, the biological observations and conclusions were robust to the choice of the sequencing processing methods and contaminant removal. BioMed Central 2020-09-18 /pmc/articles/PMC7501722/ /pubmed/32948144 http://dx.doi.org/10.1186/s12866-020-01949-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Moossavi, Shirin
Atakora, Faisal
Fehr, Kelsey
Khafipour, Ehsan
Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota
title Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota
title_full Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota
title_fullStr Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota
title_full_unstemmed Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota
title_short Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota
title_sort biological observations in microbiota analysis are robust to the choice of 16s rrna gene sequencing processing algorithm: case study on human milk microbiota
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7501722/
https://www.ncbi.nlm.nih.gov/pubmed/32948144
http://dx.doi.org/10.1186/s12866-020-01949-7
work_keys_str_mv AT moossavishirin biologicalobservationsinmicrobiotaanalysisarerobusttothechoiceof16srrnagenesequencingprocessingalgorithmcasestudyonhumanmilkmicrobiota
AT atakorafaisal biologicalobservationsinmicrobiotaanalysisarerobusttothechoiceof16srrnagenesequencingprocessingalgorithmcasestudyonhumanmilkmicrobiota
AT fehrkelsey biologicalobservationsinmicrobiotaanalysisarerobusttothechoiceof16srrnagenesequencingprocessingalgorithmcasestudyonhumanmilkmicrobiota
AT khafipourehsan biologicalobservationsinmicrobiotaanalysisarerobusttothechoiceof16srrnagenesequencingprocessingalgorithmcasestudyonhumanmilkmicrobiota