Cargando…

Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens

BACKGROUND: Careful consideration of experimental artefacts is required in order to successfully apply high-throughput 16S ribosomal ribonucleic acid (rRNA) gene sequencing technology. Here we introduce experimental design, quality control and “denoising” approaches for sequencing low biomass specim...

Descripción completa

Detalles Bibliográficos
Autores principales: Claassen-Weitz, Shantelle, Gardner-Lubbe, Sugnet, Mwaikono, Kilaza S., du Toit, Elloise, Zar, Heather J., Nicol, Mark P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7218582/
https://www.ncbi.nlm.nih.gov/pubmed/32397992
http://dx.doi.org/10.1186/s12866-020-01795-7
_version_ 1783532828572516352
author Claassen-Weitz, Shantelle
Gardner-Lubbe, Sugnet
Mwaikono, Kilaza S.
du Toit, Elloise
Zar, Heather J.
Nicol, Mark P.
author_facet Claassen-Weitz, Shantelle
Gardner-Lubbe, Sugnet
Mwaikono, Kilaza S.
du Toit, Elloise
Zar, Heather J.
Nicol, Mark P.
author_sort Claassen-Weitz, Shantelle
collection PubMed
description BACKGROUND: Careful consideration of experimental artefacts is required in order to successfully apply high-throughput 16S ribosomal ribonucleic acid (rRNA) gene sequencing technology. Here we introduce experimental design, quality control and “denoising” approaches for sequencing low biomass specimens. RESULTS: We found that bacterial biomass is a key driver of 16S rRNA gene sequencing profiles generated from bacterial mock communities and that the use of different deoxyribonucleic acid (DNA) extraction methods [DSP Virus/Pathogen Mini Kit® (Kit-QS) and ZymoBIOMICS DNA Miniprep Kit (Kit-ZB)] and storage buffers [PrimeStore® Molecular Transport medium (Primestore) and Skim-milk, Tryptone, Glucose and Glycerol (STGG)] further influence these profiles. Kit-QS better represented hard-to-lyse bacteria from bacterial mock communities compared to Kit-ZB. Primestore storage buffer yielded lower levels of background operational taxonomic units (OTUs) from low biomass bacterial mock community controls compared to STGG. In addition to bacterial mock community controls, we used technical repeats (nasopharyngeal and induced sputum processed in duplicate, triplicate or quadruplicate) to further evaluate the effect of specimen biomass and participant age at specimen collection on resultant sequencing profiles. We observed a positive correlation (r = 0.16) between specimen biomass and participant age at specimen collection: low biomass technical repeats (represented by < 500 16S rRNA gene copies/μl) were primarily collected at < 14 days of age. We found that low biomass technical repeats also produced higher alpha diversities (r = − 0.28); 16S rRNA gene profiles similar to no template controls (Primestore); and reduced sequencing reproducibility. Finally, we show that the use of statistical tools for in silico contaminant identification, as implemented through the decontam package in R, provides better representations of indigenous bacteria following decontamination. CONCLUSIONS: We provide insight into experimental design, quality control steps and “denoising” approaches for 16S rRNA gene high-throughput sequencing of low biomass specimens. We highlight the need for careful assessment of DNA extraction methods and storage buffers; sequence quality and reproducibility; and in silico identification of contaminant profiles in order to avoid spurious results.
format Online
Article
Text
id pubmed-7218582
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-72185822020-05-18 Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens Claassen-Weitz, Shantelle Gardner-Lubbe, Sugnet Mwaikono, Kilaza S. du Toit, Elloise Zar, Heather J. Nicol, Mark P. BMC Microbiol Methodology Article BACKGROUND: Careful consideration of experimental artefacts is required in order to successfully apply high-throughput 16S ribosomal ribonucleic acid (rRNA) gene sequencing technology. Here we introduce experimental design, quality control and “denoising” approaches for sequencing low biomass specimens. RESULTS: We found that bacterial biomass is a key driver of 16S rRNA gene sequencing profiles generated from bacterial mock communities and that the use of different deoxyribonucleic acid (DNA) extraction methods [DSP Virus/Pathogen Mini Kit® (Kit-QS) and ZymoBIOMICS DNA Miniprep Kit (Kit-ZB)] and storage buffers [PrimeStore® Molecular Transport medium (Primestore) and Skim-milk, Tryptone, Glucose and Glycerol (STGG)] further influence these profiles. Kit-QS better represented hard-to-lyse bacteria from bacterial mock communities compared to Kit-ZB. Primestore storage buffer yielded lower levels of background operational taxonomic units (OTUs) from low biomass bacterial mock community controls compared to STGG. In addition to bacterial mock community controls, we used technical repeats (nasopharyngeal and induced sputum processed in duplicate, triplicate or quadruplicate) to further evaluate the effect of specimen biomass and participant age at specimen collection on resultant sequencing profiles. We observed a positive correlation (r = 0.16) between specimen biomass and participant age at specimen collection: low biomass technical repeats (represented by < 500 16S rRNA gene copies/μl) were primarily collected at < 14 days of age. We found that low biomass technical repeats also produced higher alpha diversities (r = − 0.28); 16S rRNA gene profiles similar to no template controls (Primestore); and reduced sequencing reproducibility. Finally, we show that the use of statistical tools for in silico contaminant identification, as implemented through the decontam package in R, provides better representations of indigenous bacteria following decontamination. CONCLUSIONS: We provide insight into experimental design, quality control steps and “denoising” approaches for 16S rRNA gene high-throughput sequencing of low biomass specimens. We highlight the need for careful assessment of DNA extraction methods and storage buffers; sequence quality and reproducibility; and in silico identification of contaminant profiles in order to avoid spurious results. BioMed Central 2020-05-12 /pmc/articles/PMC7218582/ /pubmed/32397992 http://dx.doi.org/10.1186/s12866-020-01795-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Claassen-Weitz, Shantelle
Gardner-Lubbe, Sugnet
Mwaikono, Kilaza S.
du Toit, Elloise
Zar, Heather J.
Nicol, Mark P.
Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens
title Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens
title_full Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens
title_fullStr Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens
title_full_unstemmed Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens
title_short Optimizing 16S rRNA gene profile analysis from low biomass nasopharyngeal and induced sputum specimens
title_sort optimizing 16s rrna gene profile analysis from low biomass nasopharyngeal and induced sputum specimens
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7218582/
https://www.ncbi.nlm.nih.gov/pubmed/32397992
http://dx.doi.org/10.1186/s12866-020-01795-7
work_keys_str_mv AT claassenweitzshantelle optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens
AT gardnerlubbesugnet optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens
AT mwaikonokilazas optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens
AT dutoitelloise optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens
AT zarheatherj optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens
AT nicolmarkp optimizing16srrnageneprofileanalysisfromlowbiomassnasopharyngealandinducedsputumspecimens