Cargando…

Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences

BACKGROUND: A major goal of metagenomics is to characterize the microbial composition of an environment. The most popular approach relies on 16S rRNA sequencing, however this approach can generate biased estimates due to differences in the copy number of the gene between even closely related organis...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Bo, Gibbons, Theodore, Ghodsi, Mohammad, Treangen, Todd, Pop, Mihai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3194235/
https://www.ncbi.nlm.nih.gov/pubmed/21989143
http://dx.doi.org/10.1186/1471-2164-12-S2-S4
_version_ 1782213934635286528
author Liu, Bo
Gibbons, Theodore
Ghodsi, Mohammad
Treangen, Todd
Pop, Mihai
author_facet Liu, Bo
Gibbons, Theodore
Ghodsi, Mohammad
Treangen, Todd
Pop, Mihai
author_sort Liu, Bo
collection PubMed
description BACKGROUND: A major goal of metagenomics is to characterize the microbial composition of an environment. The most popular approach relies on 16S rRNA sequencing, however this approach can generate biased estimates due to differences in the copy number of the gene between even closely related organisms, and due to PCR artifacts. The taxonomic composition can also be determined from metagenomic shotgun sequencing data by matching individual reads against a database of reference sequences. One major limitation of prior computational methods used for this purpose is the use of a universal classification threshold for all genes at all taxonomic levels. RESULTS: We propose that better classification results can be obtained by tuning the taxonomic classifier to each matching length, reference gene, and taxonomic level. We present a novel taxonomic classifier MetaPhyler (http://metaphyler.cbcb.umd.edu), which uses phylogenetic marker genes as a taxonomic reference. Results on simulated datasets demonstrate that MetaPhyler outperforms other tools commonly used in this context (CARMA, Megan and PhymmBL). We also present interesting results by analyzing a real metagenomic dataset. CONCLUSIONS: We have introduced a novel taxonomic classification method for analyzing the microbial diversity from whole-metagenome shotgun sequences. Compared with previous approaches, MetaPhyler is much more accurate in estimating the phylogenetic composition. In addition, we have shown that MetaPhyler can be used to guide the discovery of novel organisms from metagenomic samples.
format Online
Article
Text
id pubmed-3194235
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31942352011-10-17 Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences Liu, Bo Gibbons, Theodore Ghodsi, Mohammad Treangen, Todd Pop, Mihai BMC Genomics Proceedings BACKGROUND: A major goal of metagenomics is to characterize the microbial composition of an environment. The most popular approach relies on 16S rRNA sequencing, however this approach can generate biased estimates due to differences in the copy number of the gene between even closely related organisms, and due to PCR artifacts. The taxonomic composition can also be determined from metagenomic shotgun sequencing data by matching individual reads against a database of reference sequences. One major limitation of prior computational methods used for this purpose is the use of a universal classification threshold for all genes at all taxonomic levels. RESULTS: We propose that better classification results can be obtained by tuning the taxonomic classifier to each matching length, reference gene, and taxonomic level. We present a novel taxonomic classifier MetaPhyler (http://metaphyler.cbcb.umd.edu), which uses phylogenetic marker genes as a taxonomic reference. Results on simulated datasets demonstrate that MetaPhyler outperforms other tools commonly used in this context (CARMA, Megan and PhymmBL). We also present interesting results by analyzing a real metagenomic dataset. CONCLUSIONS: We have introduced a novel taxonomic classification method for analyzing the microbial diversity from whole-metagenome shotgun sequences. Compared with previous approaches, MetaPhyler is much more accurate in estimating the phylogenetic composition. In addition, we have shown that MetaPhyler can be used to guide the discovery of novel organisms from metagenomic samples. BioMed Central 2011-07-27 /pmc/articles/PMC3194235/ /pubmed/21989143 http://dx.doi.org/10.1186/1471-2164-12-S2-S4 Text en Copyright ©2011 Liu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Liu, Bo
Gibbons, Theodore
Ghodsi, Mohammad
Treangen, Todd
Pop, Mihai
Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
title Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
title_full Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
title_fullStr Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
title_full_unstemmed Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
title_short Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
title_sort accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3194235/
https://www.ncbi.nlm.nih.gov/pubmed/21989143
http://dx.doi.org/10.1186/1471-2164-12-S2-S4
work_keys_str_mv AT liubo accurateandfastestimationoftaxonomicprofilesfrommetagenomicshotgunsequences
AT gibbonstheodore accurateandfastestimationoftaxonomicprofilesfrommetagenomicshotgunsequences
AT ghodsimohammad accurateandfastestimationoftaxonomicprofilesfrommetagenomicshotgunsequences
AT treangentodd accurateandfastestimationoftaxonomicprofilesfrommetagenomicshotgunsequences
AT popmihai accurateandfastestimationoftaxonomicprofilesfrommetagenomicshotgunsequences