Cargando…
CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction
BACKGROUND: Culture-independent molecular surveys targeting conserved marker genes, most notably 16S rRNA, to assess microbial diversity remain semi-quantitative due to variations in the number of gene copies between species. RESULTS: Based on 2,900 sequenced reference genomes, we show that 16S rRNA...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021573/ https://www.ncbi.nlm.nih.gov/pubmed/24708850 http://dx.doi.org/10.1186/2049-2618-2-11 |
_version_ | 1782316263388741632 |
---|---|
author | Angly, Florent E Dennis, Paul G Skarshewski, Adam Vanwonterghem, Inka Hugenholtz, Philip Tyson, Gene W |
author_facet | Angly, Florent E Dennis, Paul G Skarshewski, Adam Vanwonterghem, Inka Hugenholtz, Philip Tyson, Gene W |
author_sort | Angly, Florent E |
collection | PubMed |
description | BACKGROUND: Culture-independent molecular surveys targeting conserved marker genes, most notably 16S rRNA, to assess microbial diversity remain semi-quantitative due to variations in the number of gene copies between species. RESULTS: Based on 2,900 sequenced reference genomes, we show that 16S rRNA gene copy number (GCN) is strongly linked to microbial phylogenetic taxonomy, potentially under-representing Archaea in amplicon microbial profiles. Using this relationship, we inferred the GCN of all bacterial and archaeal lineages in the Greengenes database within a phylogenetic framework. We created CopyRighter, new software which uses these estimates to correct 16S rRNA amplicon microbial profiles and associated quantitative (q)PCR total abundance. CopyRighter parses microbial profiles and, because GCN estimates are pre-computed for all taxa in the reference taxonomy, rapidly corrects GCN bias. Software validation with in silico and in vitro mock communities indicated that GCN correction results in more accurate estimates of microbial relative abundance and improves the agreement between metagenomic and amplicon profiles. Analyses of human-associated and anaerobic digester microbiomes illustrate that correction makes tangible changes to estimates of qPCR total abundance, α and β diversity, and can significantly change biological interpretation. For example, human gut microbiomes from twins were reclassified into three rather than two enterotypes after GCN correction. CONCLUSIONS: The CopyRighter bioinformatic tools permits rapid correction of GCN in microbial surveys, resulting in improved estimates of microbial abundance, α and β diversity. |
format | Online Article Text |
id | pubmed-4021573 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40215732014-05-28 CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction Angly, Florent E Dennis, Paul G Skarshewski, Adam Vanwonterghem, Inka Hugenholtz, Philip Tyson, Gene W Microbiome Methodology BACKGROUND: Culture-independent molecular surveys targeting conserved marker genes, most notably 16S rRNA, to assess microbial diversity remain semi-quantitative due to variations in the number of gene copies between species. RESULTS: Based on 2,900 sequenced reference genomes, we show that 16S rRNA gene copy number (GCN) is strongly linked to microbial phylogenetic taxonomy, potentially under-representing Archaea in amplicon microbial profiles. Using this relationship, we inferred the GCN of all bacterial and archaeal lineages in the Greengenes database within a phylogenetic framework. We created CopyRighter, new software which uses these estimates to correct 16S rRNA amplicon microbial profiles and associated quantitative (q)PCR total abundance. CopyRighter parses microbial profiles and, because GCN estimates are pre-computed for all taxa in the reference taxonomy, rapidly corrects GCN bias. Software validation with in silico and in vitro mock communities indicated that GCN correction results in more accurate estimates of microbial relative abundance and improves the agreement between metagenomic and amplicon profiles. Analyses of human-associated and anaerobic digester microbiomes illustrate that correction makes tangible changes to estimates of qPCR total abundance, α and β diversity, and can significantly change biological interpretation. For example, human gut microbiomes from twins were reclassified into three rather than two enterotypes after GCN correction. CONCLUSIONS: The CopyRighter bioinformatic tools permits rapid correction of GCN in microbial surveys, resulting in improved estimates of microbial abundance, α and β diversity. BioMed Central 2014-04-07 /pmc/articles/PMC4021573/ /pubmed/24708850 http://dx.doi.org/10.1186/2049-2618-2-11 Text en Copyright © 2014 Angly et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Angly, Florent E Dennis, Paul G Skarshewski, Adam Vanwonterghem, Inka Hugenholtz, Philip Tyson, Gene W CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction |
title | CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction |
title_full | CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction |
title_fullStr | CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction |
title_full_unstemmed | CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction |
title_short | CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction |
title_sort | copyrighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021573/ https://www.ncbi.nlm.nih.gov/pubmed/24708850 http://dx.doi.org/10.1186/2049-2618-2-11 |
work_keys_str_mv | AT anglyflorente copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection AT dennispaulg copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection AT skarshewskiadam copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection AT vanwontergheminka copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection AT hugenholtzphilip copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection AT tysongenew copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection |