Cargando…

Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam

BACKGROUND: Phylogenetic studies have provided detailed knowledge on the evolutionary mechanisms of genes and species in Bacteria and Archaea. However, the evolution of cellular functions, represented by metabolic pathways and biological processes, has not been systematically characterized. Many cla...

Descripción completa

Detalles Bibliográficos
Autores principales: Chai, Juanjuan, Kora, Guruprasad, Ahn, Tae-Hyuk, Hyatt, Doug, Pan, Chongle
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4194380/
https://www.ncbi.nlm.nih.gov/pubmed/25293379
http://dx.doi.org/10.1186/s12862-014-0207-y
_version_ 1782339102824202240
author Chai, Juanjuan
Kora, Guruprasad
Ahn, Tae-Hyuk
Hyatt, Doug
Pan, Chongle
author_facet Chai, Juanjuan
Kora, Guruprasad
Ahn, Tae-Hyuk
Hyatt, Doug
Pan, Chongle
author_sort Chai, Juanjuan
collection PubMed
description BACKGROUND: Phylogenetic studies have provided detailed knowledge on the evolutionary mechanisms of genes and species in Bacteria and Archaea. However, the evolution of cellular functions, represented by metabolic pathways and biological processes, has not been systematically characterized. Many clades in the prokaryotic tree of life have now been covered by sequenced genomes in GenBank. This enables a large-scale functional phylogenomics study of many computationally inferred cellular functions across all sequenced prokaryotes. RESULTS: A total of 14,727 GenBank prokaryotic genomes were re-annotated using a new protein family database, UniFam, to obtain consistent functional annotations for accurate comparison. The functional profile of a genome was represented by the biological process Gene Ontology (GO) terms in its annotation. The GO term enrichment analysis differentiated the functional profiles between selected archaeal taxa. 706 prokaryotic metabolic pathways were inferred from these genomes using Pathway Tools and MetaCyc. The consistency between the distribution of metabolic pathways in the genomes and the phylogenetic tree of the genomes was measured using parsimony scores and retention indices. The ancestral functional profiles at the internal nodes of the phylogenetic tree were reconstructed to track the gains and losses of metabolic pathways in evolutionary history. CONCLUSIONS: Our functional phylogenomics analysis shows divergent functional profiles of taxa and clades. Such function-phylogeny correlation stems from a set of clade-specific cellular functions with low parsimony scores. On the other hand, many cellular functions are sparsely dispersed across many clades with high parsimony scores. These different types of cellular functions have distinct evolutionary patterns reconstructed from the prokaryotic tree. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-014-0207-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4194380
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41943802014-10-14 Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam Chai, Juanjuan Kora, Guruprasad Ahn, Tae-Hyuk Hyatt, Doug Pan, Chongle BMC Evol Biol Research Article BACKGROUND: Phylogenetic studies have provided detailed knowledge on the evolutionary mechanisms of genes and species in Bacteria and Archaea. However, the evolution of cellular functions, represented by metabolic pathways and biological processes, has not been systematically characterized. Many clades in the prokaryotic tree of life have now been covered by sequenced genomes in GenBank. This enables a large-scale functional phylogenomics study of many computationally inferred cellular functions across all sequenced prokaryotes. RESULTS: A total of 14,727 GenBank prokaryotic genomes were re-annotated using a new protein family database, UniFam, to obtain consistent functional annotations for accurate comparison. The functional profile of a genome was represented by the biological process Gene Ontology (GO) terms in its annotation. The GO term enrichment analysis differentiated the functional profiles between selected archaeal taxa. 706 prokaryotic metabolic pathways were inferred from these genomes using Pathway Tools and MetaCyc. The consistency between the distribution of metabolic pathways in the genomes and the phylogenetic tree of the genomes was measured using parsimony scores and retention indices. The ancestral functional profiles at the internal nodes of the phylogenetic tree were reconstructed to track the gains and losses of metabolic pathways in evolutionary history. CONCLUSIONS: Our functional phylogenomics analysis shows divergent functional profiles of taxa and clades. Such function-phylogeny correlation stems from a set of clade-specific cellular functions with low parsimony scores. On the other hand, many cellular functions are sparsely dispersed across many clades with high parsimony scores. These different types of cellular functions have distinct evolutionary patterns reconstructed from the prokaryotic tree. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-014-0207-y) contains supplementary material, which is available to authorized users. BioMed Central 2014-10-09 /pmc/articles/PMC4194380/ /pubmed/25293379 http://dx.doi.org/10.1186/s12862-014-0207-y Text en © Chai et al.; licensee BioMed Central Ltd. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Chai, Juanjuan
Kora, Guruprasad
Ahn, Tae-Hyuk
Hyatt, Doug
Pan, Chongle
Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam
title Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam
title_full Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam
title_fullStr Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam
title_full_unstemmed Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam
title_short Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam
title_sort functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with unifam
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4194380/
https://www.ncbi.nlm.nih.gov/pubmed/25293379
http://dx.doi.org/10.1186/s12862-014-0207-y
work_keys_str_mv AT chaijuanjuan functionalphylogenomicsanalysisofbacteriaandarchaeausingconsistentgenomeannotationwithunifam
AT koraguruprasad functionalphylogenomicsanalysisofbacteriaandarchaeausingconsistentgenomeannotationwithunifam
AT ahntaehyuk functionalphylogenomicsanalysisofbacteriaandarchaeausingconsistentgenomeannotationwithunifam
AT hyattdoug functionalphylogenomicsanalysisofbacteriaandarchaeausingconsistentgenomeannotationwithunifam
AT panchongle functionalphylogenomicsanalysisofbacteriaandarchaeausingconsistentgenomeannotationwithunifam