Cargando…

Managing Contamination and Diverse Bacterial Loads in 16S rRNA Deep Sequencing of Clinical Samples: Implications of the Law of Small Numbers

In this article, we investigate patterns of microbial DNA contamination in targeted 16S rRNA amplicon sequencing (16S deep sequencing) and demonstrate how this can be used to filter background bacterial DNA in diagnostic microbiology. We also investigate the importance of sequencing depth. We first...

Descripción completa

Detalles Bibliográficos
Autores principales: Dyrhovden, Ruben, Rippin, Martin, Øvrebø, Kjell Kåre, Nygaard, Randi M., Ulvestad, Elling, Kommedal, Øyvind
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8262989/
https://www.ncbi.nlm.nih.gov/pubmed/34101489
http://dx.doi.org/10.1128/mBio.00598-21
_version_ 1783719291302969344
author Dyrhovden, Ruben
Rippin, Martin
Øvrebø, Kjell Kåre
Nygaard, Randi M.
Ulvestad, Elling
Kommedal, Øyvind
author_facet Dyrhovden, Ruben
Rippin, Martin
Øvrebø, Kjell Kåre
Nygaard, Randi M.
Ulvestad, Elling
Kommedal, Øyvind
author_sort Dyrhovden, Ruben
collection PubMed
description In this article, we investigate patterns of microbial DNA contamination in targeted 16S rRNA amplicon sequencing (16S deep sequencing) and demonstrate how this can be used to filter background bacterial DNA in diagnostic microbiology. We also investigate the importance of sequencing depth. We first determined the patterns of contamination by performing repeat 16S deep sequencing of negative and positive extraction controls. This process identified a few bacterial species dominating across all replicates but also a high intersample variability among low abundance contaminant species in replicates split before PCR amplification. Replicates split after PCR amplification yielded almost identical sequencing results. On the basis of these observations, we suggest using the abundance of the most dominant contaminant species to define a threshold in each clinical sample from where identifications with lower abundances possibly represent contamination. We evaluated this approach by sequencing of a diluted, staggered mock community and of bile samples from 41 patients with acute cholangitis and noninfectious bile duct stenosis. All clinical samples were sequenced twice using different sequencing depths. We were able to demonstrate the following: (i) The high intersample variability between sequencing replicates is caused by events occurring before or during the PCR amplification step. (ii) Knowledge about the most dominant contaminant species can be used to establish sample-specific cutoffs for reliable identifications. (iii) Below the level of the most abundant contaminant, it rapidly becomes very demanding to reliably discriminate between background and true findings. (iv) Adequate sequencing depth can be claimed only when the analysis also picks up background contamination.
format Online
Article
Text
id pubmed-8262989
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-82629892021-07-23 Managing Contamination and Diverse Bacterial Loads in 16S rRNA Deep Sequencing of Clinical Samples: Implications of the Law of Small Numbers Dyrhovden, Ruben Rippin, Martin Øvrebø, Kjell Kåre Nygaard, Randi M. Ulvestad, Elling Kommedal, Øyvind mBio Research Article In this article, we investigate patterns of microbial DNA contamination in targeted 16S rRNA amplicon sequencing (16S deep sequencing) and demonstrate how this can be used to filter background bacterial DNA in diagnostic microbiology. We also investigate the importance of sequencing depth. We first determined the patterns of contamination by performing repeat 16S deep sequencing of negative and positive extraction controls. This process identified a few bacterial species dominating across all replicates but also a high intersample variability among low abundance contaminant species in replicates split before PCR amplification. Replicates split after PCR amplification yielded almost identical sequencing results. On the basis of these observations, we suggest using the abundance of the most dominant contaminant species to define a threshold in each clinical sample from where identifications with lower abundances possibly represent contamination. We evaluated this approach by sequencing of a diluted, staggered mock community and of bile samples from 41 patients with acute cholangitis and noninfectious bile duct stenosis. All clinical samples were sequenced twice using different sequencing depths. We were able to demonstrate the following: (i) The high intersample variability between sequencing replicates is caused by events occurring before or during the PCR amplification step. (ii) Knowledge about the most dominant contaminant species can be used to establish sample-specific cutoffs for reliable identifications. (iii) Below the level of the most abundant contaminant, it rapidly becomes very demanding to reliably discriminate between background and true findings. (iv) Adequate sequencing depth can be claimed only when the analysis also picks up background contamination. American Society for Microbiology 2021-06-08 /pmc/articles/PMC8262989/ /pubmed/34101489 http://dx.doi.org/10.1128/mBio.00598-21 Text en Copyright © 2021 Dyrhovden et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Dyrhovden, Ruben
Rippin, Martin
Øvrebø, Kjell Kåre
Nygaard, Randi M.
Ulvestad, Elling
Kommedal, Øyvind
Managing Contamination and Diverse Bacterial Loads in 16S rRNA Deep Sequencing of Clinical Samples: Implications of the Law of Small Numbers
title Managing Contamination and Diverse Bacterial Loads in 16S rRNA Deep Sequencing of Clinical Samples: Implications of the Law of Small Numbers
title_full Managing Contamination and Diverse Bacterial Loads in 16S rRNA Deep Sequencing of Clinical Samples: Implications of the Law of Small Numbers
title_fullStr Managing Contamination and Diverse Bacterial Loads in 16S rRNA Deep Sequencing of Clinical Samples: Implications of the Law of Small Numbers
title_full_unstemmed Managing Contamination and Diverse Bacterial Loads in 16S rRNA Deep Sequencing of Clinical Samples: Implications of the Law of Small Numbers
title_short Managing Contamination and Diverse Bacterial Loads in 16S rRNA Deep Sequencing of Clinical Samples: Implications of the Law of Small Numbers
title_sort managing contamination and diverse bacterial loads in 16s rrna deep sequencing of clinical samples: implications of the law of small numbers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8262989/
https://www.ncbi.nlm.nih.gov/pubmed/34101489
http://dx.doi.org/10.1128/mBio.00598-21
work_keys_str_mv AT dyrhovdenruben managingcontaminationanddiversebacterialloadsin16srrnadeepsequencingofclinicalsamplesimplicationsofthelawofsmallnumbers
AT rippinmartin managingcontaminationanddiversebacterialloadsin16srrnadeepsequencingofclinicalsamplesimplicationsofthelawofsmallnumbers
AT øvrebøkjellkare managingcontaminationanddiversebacterialloadsin16srrnadeepsequencingofclinicalsamplesimplicationsofthelawofsmallnumbers
AT nygaardrandim managingcontaminationanddiversebacterialloadsin16srrnadeepsequencingofclinicalsamplesimplicationsofthelawofsmallnumbers
AT ulvestadelling managingcontaminationanddiversebacterialloadsin16srrnadeepsequencingofclinicalsamplesimplicationsofthelawofsmallnumbers
AT kommedaløyvind managingcontaminationanddiversebacterialloadsin16srrnadeepsequencingofclinicalsamplesimplicationsofthelawofsmallnumbers