Cargando…
Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens
BACKGROUND: Careful consideration of experimental artefacts is required in order to successfully apply high-throughput 16S ribosomal ribonucleic acid (rRNA) gene sequencing technology. Here we introduce experimental design, quality control and “denoising” approaches for sequencing low biomass specim...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7218582/ https://www.ncbi.nlm.nih.gov/pubmed/32397992 http://dx.doi.org/10.1186/s12866-020-01795-7 |
_version_ | 1783532828572516352 |
---|---|
author | Claassen-Weitz, Shantelle Gardner-Lubbe, Sugnet Mwaikono, Kilaza S. du Toit, Elloise Zar, Heather J. Nicol, Mark P. |
author_facet | Claassen-Weitz, Shantelle Gardner-Lubbe, Sugnet Mwaikono, Kilaza S. du Toit, Elloise Zar, Heather J. Nicol, Mark P. |
author_sort | Claassen-Weitz, Shantelle |
collection | PubMed |
description | BACKGROUND: Careful consideration of experimental artefacts is required in order to successfully apply high-throughput 16S ribosomal ribonucleic acid (rRNA) gene sequencing technology. Here we introduce experimental design, quality control and “denoising” approaches for sequencing low biomass specimens. RESULTS: We found that bacterial biomass is a key driver of 16S rRNA gene sequencing profiles generated from bacterial mock communities and that the use of different deoxyribonucleic acid (DNA) extraction methods [DSP Virus/Pathogen Mini Kit® (Kit-QS) and ZymoBIOMICS DNA Miniprep Kit (Kit-ZB)] and storage buffers [PrimeStore® Molecular Transport medium (Primestore) and Skim-milk, Tryptone, Glucose and Glycerol (STGG)] further influence these profiles. Kit-QS better represented hard-to-lyse bacteria from bacterial mock communities compared to Kit-ZB. Primestore storage buffer yielded lower levels of background operational taxonomic units (OTUs) from low biomass bacterial mock community controls compared to STGG. In addition to bacterial mock community controls, we used technical repeats (nasopharyngeal and induced sputum processed in duplicate, triplicate or quadruplicate) to further evaluate the effect of specimen biomass and participant age at specimen collection on resultant sequencing profiles. We observed a positive correlation (r = 0.16) between specimen biomass and participant age at specimen collection: low biomass technical repeats (represented by < 500 16S rRNA gene copies/μl) were primarily collected at < 14 days of age. We found that low biomass technical repeats also produced higher alpha diversities (r = − 0.28); 16S rRNA gene profiles similar to no template controls (Primestore); and reduced sequencing reproducibility. Finally, we show that the use of statistical tools for in silico contaminant identification, as implemented through the decontam package in R, provides better representations of indigenous bacteria following decontamination. CONCLUSIONS: We provide insight into experimental design, quality control steps and “denoising” approaches for 16S rRNA gene high-throughput sequencing of low biomass specimens. We highlight the need for careful assessment of DNA extraction methods and storage buffers; sequence quality and reproducibility; and in silico identification of contaminant profiles in order to avoid spurious results. |
format | Online Article Text |
id | pubmed-7218582 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-72185822020-05-18 Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens Claassen-Weitz, Shantelle Gardner-Lubbe, Sugnet Mwaikono, Kilaza S. du Toit, Elloise Zar, Heather J. Nicol, Mark P. BMC Microbiol Methodology Article BACKGROUND: Careful consideration of experimental artefacts is required in order to successfully apply high-throughput 16S ribosomal ribonucleic acid (rRNA) gene sequencing technology. Here we introduce experimental design, quality control and “denoising” approaches for sequencing low biomass specimens. RESULTS: We found that bacterial biomass is a key driver of 16S rRNA gene sequencing profiles generated from bacterial mock communities and that the use of different deoxyribonucleic acid (DNA) extraction methods [DSP Virus/Pathogen Mini Kit® (Kit-QS) and ZymoBIOMICS DNA Miniprep Kit (Kit-ZB)] and storage buffers [PrimeStore® Molecular Transport medium (Primestore) and Skim-milk, Tryptone, Glucose and Glycerol (STGG)] further influence these profiles. Kit-QS better represented hard-to-lyse bacteria from bacterial mock communities compared to Kit-ZB. Primestore storage buffer yielded lower levels of background operational taxonomic units (OTUs) from low biomass bacterial mock community controls compared to STGG. In addition to bacterial mock community controls, we used technical repeats (nasopharyngeal and induced sputum processed in duplicate, triplicate or quadruplicate) to further evaluate the effect of specimen biomass and participant age at specimen collection on resultant sequencing profiles. We observed a positive correlation (r = 0.16) between specimen biomass and participant age at specimen collection: low biomass technical repeats (represented by < 500 16S rRNA gene copies/μl) were primarily collected at < 14 days of age. We found that low biomass technical repeats also produced higher alpha diversities (r = − 0.28); 16S rRNA gene profiles similar to no template controls (Primestore); and reduced sequencing reproducibility. Finally, we show that the use of statistical tools for in silico contaminant identification, as implemented through the decontam package in R, provides better representations of indigenous bacteria following decontamination. CONCLUSIONS: We provide insight into experimental design, quality control steps and “denoising” approaches for 16S rRNA gene high-throughput sequencing of low biomass specimens. We highlight the need for careful assessment of DNA extraction methods and storage buffers; sequence quality and reproducibility; and in silico identification of contaminant profiles in order to avoid spurious results. BioMed Central 2020-05-12 /pmc/articles/PMC7218582/ /pubmed/32397992 http://dx.doi.org/10.1186/s12866-020-01795-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Claassen-Weitz, Shantelle Gardner-Lubbe, Sugnet Mwaikono, Kilaza S. du Toit, Elloise Zar, Heather J. Nicol, Mark P. Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens |
title | Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens |
title_full | Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens |
title_fullStr | Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens |
title_full_unstemmed | Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens |
title_short | Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens |
title_sort | optimizing 16s rrna gene profile analysis from low biomass nasopharyngeal and induced sputum specimens |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7218582/ https://www.ncbi.nlm.nih.gov/pubmed/32397992 http://dx.doi.org/10.1186/s12866-020-01795-7 |
work_keys_str_mv | AT claassenweitzshantelle optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens AT gardnerlubbesugnet optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens AT mwaikonokilazas optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens AT dutoitelloise optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens AT zarheatherj optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens AT nicolmarkp optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens |