Cargando…
The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies
BACKGROUND: Characterizing microbial communities via next-generation sequencing is subject to a number of pitfalls involving sample processing. The observed community composition can be a severe distortion of the quantities of bacteria actually present in the microbiome, hampering analysis and threa...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4433096/ https://www.ncbi.nlm.nih.gov/pubmed/25880246 http://dx.doi.org/10.1186/s12866-015-0351-6 |
_version_ | 1782371582076780544 |
---|---|
author | Brooks, J Paul Edwards, David J Harwich, Michael D Rivera, Maria C Fettweis, Jennifer M Serrano, Myrna G Reris, Robert A Sheth, Nihar U Huang, Bernice Girerd, Philippe Strauss, Jerome F Jefferson, Kimberly K Buck, Gregory A |
author_facet | Brooks, J Paul Edwards, David J Harwich, Michael D Rivera, Maria C Fettweis, Jennifer M Serrano, Myrna G Reris, Robert A Sheth, Nihar U Huang, Bernice Girerd, Philippe Strauss, Jerome F Jefferson, Kimberly K Buck, Gregory A |
author_sort | Brooks, J Paul |
collection | PubMed |
description | BACKGROUND: Characterizing microbial communities via next-generation sequencing is subject to a number of pitfalls involving sample processing. The observed community composition can be a severe distortion of the quantities of bacteria actually present in the microbiome, hampering analysis and threatening the validity of conclusions from metagenomic studies. We introduce an experimental protocol using mock communities for quantifying and characterizing bias introduced in the sample processing pipeline. We used 80 bacterial mock communities comprised of prescribed proportions of cells from seven vaginally-relevant bacterial strains to assess the bias introduced in the sample processing pipeline. We created two additional sets of 80 mock communities by mixing prescribed quantities of DNA and PCR product to quantify the relative contribution to bias of (1) DNA extraction, (2) PCR amplification, and (3) sequencing and taxonomic classification for particular choices of protocols for each step. We developed models to predict the “true” composition of environmental samples based on the observed proportions, and applied them to a set of clinical vaginal samples from a single subject during four visits. RESULTS: We observed that using different DNA extraction kits can produce dramatically different results but bias is introduced regardless of the choice of kit. We observed error rates from bias of over 85% in some samples, while technical variation was very low at less than 5% for most bacteria. The effects of DNA extraction and PCR amplification for our protocols were much larger than those due to sequencing and classification. The processing steps affected different bacteria in different ways, resulting in amplified and suppressed observed proportions of a community. When predictive models were applied to clinical samples from a subject, the predicted microbiome profiles were better reflections of the physiology and diagnosis of the subject at the visits than the observed community compositions. CONCLUSIONS: Bias in 16S studies due to DNA extraction and PCR amplification will continue to require attention despite further advances in sequencing technology. Analysis of mock communities can help assess bias and facilitate the interpretation of results from environmental samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12866-015-0351-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4433096 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44330962015-05-16 The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies Brooks, J Paul Edwards, David J Harwich, Michael D Rivera, Maria C Fettweis, Jennifer M Serrano, Myrna G Reris, Robert A Sheth, Nihar U Huang, Bernice Girerd, Philippe Strauss, Jerome F Jefferson, Kimberly K Buck, Gregory A BMC Microbiol Methodology Article BACKGROUND: Characterizing microbial communities via next-generation sequencing is subject to a number of pitfalls involving sample processing. The observed community composition can be a severe distortion of the quantities of bacteria actually present in the microbiome, hampering analysis and threatening the validity of conclusions from metagenomic studies. We introduce an experimental protocol using mock communities for quantifying and characterizing bias introduced in the sample processing pipeline. We used 80 bacterial mock communities comprised of prescribed proportions of cells from seven vaginally-relevant bacterial strains to assess the bias introduced in the sample processing pipeline. We created two additional sets of 80 mock communities by mixing prescribed quantities of DNA and PCR product to quantify the relative contribution to bias of (1) DNA extraction, (2) PCR amplification, and (3) sequencing and taxonomic classification for particular choices of protocols for each step. We developed models to predict the “true” composition of environmental samples based on the observed proportions, and applied them to a set of clinical vaginal samples from a single subject during four visits. RESULTS: We observed that using different DNA extraction kits can produce dramatically different results but bias is introduced regardless of the choice of kit. We observed error rates from bias of over 85% in some samples, while technical variation was very low at less than 5% for most bacteria. The effects of DNA extraction and PCR amplification for our protocols were much larger than those due to sequencing and classification. The processing steps affected different bacteria in different ways, resulting in amplified and suppressed observed proportions of a community. When predictive models were applied to clinical samples from a subject, the predicted microbiome profiles were better reflections of the physiology and diagnosis of the subject at the visits than the observed community compositions. CONCLUSIONS: Bias in 16S studies due to DNA extraction and PCR amplification will continue to require attention despite further advances in sequencing technology. Analysis of mock communities can help assess bias and facilitate the interpretation of results from environmental samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12866-015-0351-6) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-21 /pmc/articles/PMC4433096/ /pubmed/25880246 http://dx.doi.org/10.1186/s12866-015-0351-6 Text en © Brooks et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Brooks, J Paul Edwards, David J Harwich, Michael D Rivera, Maria C Fettweis, Jennifer M Serrano, Myrna G Reris, Robert A Sheth, Nihar U Huang, Bernice Girerd, Philippe Strauss, Jerome F Jefferson, Kimberly K Buck, Gregory A The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies |
title | The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies |
title_full | The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies |
title_fullStr | The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies |
title_full_unstemmed | The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies |
title_short | The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies |
title_sort | truth about metagenomics: quantifying and counteracting bias in 16s rrna studies |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4433096/ https://www.ncbi.nlm.nih.gov/pubmed/25880246 http://dx.doi.org/10.1186/s12866-015-0351-6 |
work_keys_str_mv | AT brooksjpaul thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT edwardsdavidj thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT harwichmichaeld thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT riveramariac thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT fettweisjenniferm thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT serranomyrnag thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT rerisroberta thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT shethniharu thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT huangbernice thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT girerdphilippe thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT straussjeromef thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT jeffersonkimberlyk thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT buckgregorya thetruthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT brooksjpaul truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT edwardsdavidj truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT harwichmichaeld truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT riveramariac truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT fettweisjenniferm truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT serranomyrnag truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT rerisroberta truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT shethniharu truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT huangbernice truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT girerdphilippe truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT straussjeromef truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT jeffersonkimberlyk truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies AT buckgregorya truthaboutmetagenomicsquantifyingandcounteractingbiasin16srrnastudies |