Cargando…

CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction

BACKGROUND: Culture-independent molecular surveys targeting conserved marker genes, most notably 16S rRNA, to assess microbial diversity remain semi-quantitative due to variations in the number of gene copies between species. RESULTS: Based on 2,900 sequenced reference genomes, we show that 16S rRNA...

Descripción completa

Detalles Bibliográficos
Autores principales: Angly, Florent E, Dennis, Paul G, Skarshewski, Adam, Vanwonterghem, Inka, Hugenholtz, Philip, Tyson, Gene W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021573/
https://www.ncbi.nlm.nih.gov/pubmed/24708850
http://dx.doi.org/10.1186/2049-2618-2-11
_version_ 1782316263388741632
author Angly, Florent E
Dennis, Paul G
Skarshewski, Adam
Vanwonterghem, Inka
Hugenholtz, Philip
Tyson, Gene W
author_facet Angly, Florent E
Dennis, Paul G
Skarshewski, Adam
Vanwonterghem, Inka
Hugenholtz, Philip
Tyson, Gene W
author_sort Angly, Florent E
collection PubMed
description BACKGROUND: Culture-independent molecular surveys targeting conserved marker genes, most notably 16S rRNA, to assess microbial diversity remain semi-quantitative due to variations in the number of gene copies between species. RESULTS: Based on 2,900 sequenced reference genomes, we show that 16S rRNA gene copy number (GCN) is strongly linked to microbial phylogenetic taxonomy, potentially under-representing Archaea in amplicon microbial profiles. Using this relationship, we inferred the GCN of all bacterial and archaeal lineages in the Greengenes database within a phylogenetic framework. We created CopyRighter, new software which uses these estimates to correct 16S rRNA amplicon microbial profiles and associated quantitative (q)PCR total abundance. CopyRighter parses microbial profiles and, because GCN estimates are pre-computed for all taxa in the reference taxonomy, rapidly corrects GCN bias. Software validation with in silico and in vitro mock communities indicated that GCN correction results in more accurate estimates of microbial relative abundance and improves the agreement between metagenomic and amplicon profiles. Analyses of human-associated and anaerobic digester microbiomes illustrate that correction makes tangible changes to estimates of qPCR total abundance, α and β diversity, and can significantly change biological interpretation. For example, human gut microbiomes from twins were reclassified into three rather than two enterotypes after GCN correction. CONCLUSIONS: The CopyRighter bioinformatic tools permits rapid correction of GCN in microbial surveys, resulting in improved estimates of microbial abundance, α and β diversity.
format Online
Article
Text
id pubmed-4021573
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40215732014-05-28 CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction Angly, Florent E Dennis, Paul G Skarshewski, Adam Vanwonterghem, Inka Hugenholtz, Philip Tyson, Gene W Microbiome Methodology BACKGROUND: Culture-independent molecular surveys targeting conserved marker genes, most notably 16S rRNA, to assess microbial diversity remain semi-quantitative due to variations in the number of gene copies between species. RESULTS: Based on 2,900 sequenced reference genomes, we show that 16S rRNA gene copy number (GCN) is strongly linked to microbial phylogenetic taxonomy, potentially under-representing Archaea in amplicon microbial profiles. Using this relationship, we inferred the GCN of all bacterial and archaeal lineages in the Greengenes database within a phylogenetic framework. We created CopyRighter, new software which uses these estimates to correct 16S rRNA amplicon microbial profiles and associated quantitative (q)PCR total abundance. CopyRighter parses microbial profiles and, because GCN estimates are pre-computed for all taxa in the reference taxonomy, rapidly corrects GCN bias. Software validation with in silico and in vitro mock communities indicated that GCN correction results in more accurate estimates of microbial relative abundance and improves the agreement between metagenomic and amplicon profiles. Analyses of human-associated and anaerobic digester microbiomes illustrate that correction makes tangible changes to estimates of qPCR total abundance, α and β diversity, and can significantly change biological interpretation. For example, human gut microbiomes from twins were reclassified into three rather than two enterotypes after GCN correction. CONCLUSIONS: The CopyRighter bioinformatic tools permits rapid correction of GCN in microbial surveys, resulting in improved estimates of microbial abundance, α and β diversity. BioMed Central 2014-04-07 /pmc/articles/PMC4021573/ /pubmed/24708850 http://dx.doi.org/10.1186/2049-2618-2-11 Text en Copyright © 2014 Angly et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Angly, Florent E
Dennis, Paul G
Skarshewski, Adam
Vanwonterghem, Inka
Hugenholtz, Philip
Tyson, Gene W
CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction
title CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction
title_full CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction
title_fullStr CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction
title_full_unstemmed CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction
title_short CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction
title_sort copyrighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021573/
https://www.ncbi.nlm.nih.gov/pubmed/24708850
http://dx.doi.org/10.1186/2049-2618-2-11
work_keys_str_mv AT anglyflorente copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection
AT dennispaulg copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection
AT skarshewskiadam copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection
AT vanwontergheminka copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection
AT hugenholtzphilip copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection
AT tysongenew copyrighterarapidtoolforimprovingtheaccuracyofmicrobialcommunityprofilesthroughlineagespecificgenecopynumbercorrection