Cargando…

PHOG: a database of supergenomes built from proteome complements

BACKGROUND: Orthologs and paralogs are widely used terms in modern comparative genomics. Existing procedures for resolving orthologous/paralogous relationships are often based on manual revision of clusters of orthologous groups and/or lack any rigorous evolutionary base. DESCRIPTION: We developed a...

Descripción completa

Detalles Bibliográficos
Autores principales: Merkeev, Igor V, Novichkov, Pavel S, Mironov, Andrey A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1523204/
https://www.ncbi.nlm.nih.gov/pubmed/16792803
http://dx.doi.org/10.1186/1471-2148-6-52
_version_ 1782128799282888704
author Merkeev, Igor V
Novichkov, Pavel S
Mironov, Andrey A
author_facet Merkeev, Igor V
Novichkov, Pavel S
Mironov, Andrey A
author_sort Merkeev, Igor V
collection PubMed
description BACKGROUND: Orthologs and paralogs are widely used terms in modern comparative genomics. Existing procedures for resolving orthologous/paralogous relationships are often based on manual revision of clusters of orthologous groups and/or lack any rigorous evolutionary base. DESCRIPTION: We developed a completely automated procedure that creates clusters of orthologous groups at each node of the taxonomy tree (PHOGs – Phylogenetic Orthologous Groups). As a result of this procedure, a tree of orthologous groups was obtained. Each cluster is a "supergene" and it is represented by an "ancestral" sequence obtained from the multiple alignment of orthologous and paralogous genes. The procedure has been applied to the taxonomy tree of organisms from all three domains of life. Protein complements from 50 bacterial, archaeal and eukaryotic species were used to create PHOGs at all tree nodes. 51367 PHOGs were obtained at the root node. CONCLUSION: The PHOG database demonstrates that it is possible to automatically process any number of sequenced genomes and to reconstruct orthologous and paralogous relationships between genomes using a rigorous evolutionary approach. This database can become a very useful tool in various areas of comparative genomics.
format Text
id pubmed-1523204
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15232042006-07-27 PHOG: a database of supergenomes built from proteome complements Merkeev, Igor V Novichkov, Pavel S Mironov, Andrey A BMC Evol Biol Database BACKGROUND: Orthologs and paralogs are widely used terms in modern comparative genomics. Existing procedures for resolving orthologous/paralogous relationships are often based on manual revision of clusters of orthologous groups and/or lack any rigorous evolutionary base. DESCRIPTION: We developed a completely automated procedure that creates clusters of orthologous groups at each node of the taxonomy tree (PHOGs – Phylogenetic Orthologous Groups). As a result of this procedure, a tree of orthologous groups was obtained. Each cluster is a "supergene" and it is represented by an "ancestral" sequence obtained from the multiple alignment of orthologous and paralogous genes. The procedure has been applied to the taxonomy tree of organisms from all three domains of life. Protein complements from 50 bacterial, archaeal and eukaryotic species were used to create PHOGs at all tree nodes. 51367 PHOGs were obtained at the root node. CONCLUSION: The PHOG database demonstrates that it is possible to automatically process any number of sequenced genomes and to reconstruct orthologous and paralogous relationships between genomes using a rigorous evolutionary approach. This database can become a very useful tool in various areas of comparative genomics. BioMed Central 2006-06-22 /pmc/articles/PMC1523204/ /pubmed/16792803 http://dx.doi.org/10.1186/1471-2148-6-52 Text en Copyright © 2006 Merkeev et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Merkeev, Igor V
Novichkov, Pavel S
Mironov, Andrey A
PHOG: a database of supergenomes built from proteome complements
title PHOG: a database of supergenomes built from proteome complements
title_full PHOG: a database of supergenomes built from proteome complements
title_fullStr PHOG: a database of supergenomes built from proteome complements
title_full_unstemmed PHOG: a database of supergenomes built from proteome complements
title_short PHOG: a database of supergenomes built from proteome complements
title_sort phog: a database of supergenomes built from proteome complements
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1523204/
https://www.ncbi.nlm.nih.gov/pubmed/16792803
http://dx.doi.org/10.1186/1471-2148-6-52
work_keys_str_mv AT merkeevigorv phogadatabaseofsupergenomesbuiltfromproteomecomplements
AT novichkovpavels phogadatabaseofsupergenomesbuiltfromproteomecomplements
AT mironovandreya phogadatabaseofsupergenomesbuiltfromproteomecomplements