Cargando…

The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies

BACKGROUND: Characterizing microbial communities via next-generation sequencing is subject to a number of pitfalls involving sample processing. The observed community composition can be a severe distortion of the quantities of bacteria actually present in the microbiome, hampering analysis and threa...

Descripción completa

Detalles Bibliográficos
Autores principales: Brooks, J Paul, Edwards, David J, Harwich, Michael D, Rivera, Maria C, Fettweis, Jennifer M, Serrano, Myrna G, Reris, Robert A, Sheth, Nihar U, Huang, Bernice, Girerd, Philippe, Strauss, Jerome F, Jefferson, Kimberly K, Buck, Gregory A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4433096/
https://www.ncbi.nlm.nih.gov/pubmed/25880246
http://dx.doi.org/10.1186/s12866-015-0351-6
_version_ 1782371582076780544
author Brooks, J Paul
Edwards, David J
Harwich, Michael D
Rivera, Maria C
Fettweis, Jennifer M
Serrano, Myrna G
Reris, Robert A
Sheth, Nihar U
Huang, Bernice
Girerd, Philippe
Strauss, Jerome F
Jefferson, Kimberly K
Buck, Gregory A
author_facet Brooks, J Paul
Edwards, David J
Harwich, Michael D
Rivera, Maria C
Fettweis, Jennifer M
Serrano, Myrna G
Reris, Robert A
Sheth, Nihar U
Huang, Bernice
Girerd, Philippe
Strauss, Jerome F
Jefferson, Kimberly K
Buck, Gregory A
author_sort Brooks, J Paul
collection PubMed
description BACKGROUND: Characterizing microbial communities via next-generation sequencing is subject to a number of pitfalls involving sample processing. The observed community composition can be a severe distortion of the quantities of bacteria actually present in the microbiome, hampering analysis and threatening the validity of conclusions from metagenomic studies. We introduce an experimental protocol using mock communities for quantifying and characterizing bias introduced in the sample processing pipeline. We used 80 bacterial mock communities comprised of prescribed proportions of cells from seven vaginally-relevant bacterial strains to assess the bias introduced in the sample processing pipeline. We created two additional sets of 80 mock communities by mixing prescribed quantities of DNA and PCR product to quantify the relative contribution to bias of (1) DNA extraction, (2) PCR amplification, and (3) sequencing and taxonomic classification for particular choices of protocols for each step. We developed models to predict the “true” composition of environmental samples based on the observed proportions, and applied them to a set of clinical vaginal samples from a single subject during four visits. RESULTS: We observed that using different DNA extraction kits can produce dramatically different results but bias is introduced regardless of the choice of kit. We observed error rates from bias of over 85% in some samples, while technical variation was very low at less than 5% for most bacteria. The effects of DNA extraction and PCR amplification for our protocols were much larger than those due to sequencing and classification. The processing steps affected different bacteria in different ways, resulting in amplified and suppressed observed proportions of a community. When predictive models were applied to clinical samples from a subject, the predicted microbiome profiles were better reflections of the physiology and diagnosis of the subject at the visits than the observed community compositions. CONCLUSIONS: Bias in 16S studies due to DNA extraction and PCR amplification will continue to require attention despite further advances in sequencing technology. Analysis of mock communities can help assess bias and facilitate the interpretation of results from environmental samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12866-015-0351-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4433096
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44330962015-05-16 The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies Brooks, J Paul Edwards, David J Harwich, Michael D Rivera, Maria C Fettweis, Jennifer M Serrano, Myrna G Reris, Robert A Sheth, Nihar U Huang, Bernice Girerd, Philippe Strauss, Jerome F Jefferson, Kimberly K Buck, Gregory A BMC Microbiol Methodology Article BACKGROUND: Characterizing microbial communities via next-generation sequencing is subject to a number of pitfalls involving sample processing. The observed community composition can be a severe distortion of the quantities of bacteria actually present in the microbiome, hampering analysis and threatening the validity of conclusions from metagenomic studies. We introduce an experimental protocol using mock communities for quantifying and characterizing bias introduced in the sample processing pipeline. We used 80 bacterial mock communities comprised of prescribed proportions of cells from seven vaginally-relevant bacterial strains to assess the bias introduced in the sample processing pipeline. We created two additional sets of 80 mock communities by mixing prescribed quantities of DNA and PCR product to quantify the relative contribution to bias of (1) DNA extraction, (2) PCR amplification, and (3) sequencing and taxonomic classification for particular choices of protocols for each step. We developed models to predict the “true” composition of environmental samples based on the observed proportions, and applied them to a set of clinical vaginal samples from a single subject during four visits. RESULTS: We observed that using different DNA extraction kits can produce dramatically different results but bias is introduced regardless of the choice of kit. We observed error rates from bias of over 85% in some samples, while technical variation was very low at less than 5% for most bacteria. The effects of DNA extraction and PCR amplification for our protocols were much larger than those due to sequencing and classification. The processing steps affected different bacteria in different ways, resulting in amplified and suppressed observed proportions of a community. When predictive models were applied to clinical samples from a subject, the predicted microbiome profiles were better reflections of the physiology and diagnosis of the subject at the visits than the observed community compositions. CONCLUSIONS: Bias in 16S studies due to DNA extraction and PCR amplification will continue to require attention despite further advances in sequencing technology. Analysis of mock communities can help assess bias and facilitate the interpretation of results from environmental samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12866-015-0351-6) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-21 /pmc/articles/PMC4433096/ /pubmed/25880246 http://dx.doi.org/10.1186/s12866-015-0351-6 Text en © Brooks et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Brooks, J Paul
Edwards, David J
Harwich, Michael D
Rivera, Maria C
Fettweis, Jennifer M
Serrano, Myrna G
Reris, Robert A
Sheth, Nihar U
Huang, Bernice
Girerd, Philippe
Strauss, Jerome F
Jefferson, Kimberly K
Buck, Gregory A
The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies
title The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies
title_full The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies
title_fullStr The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies
title_full_unstemmed The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies
title_short The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies
title_sort truth about metagenomics: quantifying and counteracting bias in 16s rrna studies
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4433096/
https://www.ncbi.nlm.nih.gov/pubmed/25880246
http://dx.doi.org/10.1186/s12866-015-0351-6
work_keys_str_mv AT brooksjpaul thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT edwardsdavidj thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT harwichmichaeld thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT riveramariac thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT fettweisjenniferm thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT serranomyrnag thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT rerisroberta thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT shethniharu thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT huangbernice thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT girerdphilippe thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT straussjeromef thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT jeffersonkimberlyk thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT buckgregorya thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT brooksjpaul truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT edwardsdavidj truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT harwichmichaeld truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT riveramariac truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT fettweisjenniferm truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT serranomyrnag truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT rerisroberta truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT shethniharu truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT huangbernice truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT girerdphilippe truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT straussjeromef truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT jeffersonkimberlyk truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies
AT buckgregorya truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies