Cargando…
PHOG: a database of supergenomes built from proteome complements
BACKGROUND: Orthologs and paralogs are widely used terms in modern comparative genomics. Existing procedures for resolving orthologous/paralogous relationships are often based on manual revision of clusters of orthologous groups and/or lack any rigorous evolutionary base. DESCRIPTION: We developed a...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1523204/ https://www.ncbi.nlm.nih.gov/pubmed/16792803 http://dx.doi.org/10.1186/1471-2148-6-52 |
_version_ | 1782128799282888704 |
---|---|
author | Merkeev, Igor V Novichkov, Pavel S Mironov, Andrey A |
author_facet | Merkeev, Igor V Novichkov, Pavel S Mironov, Andrey A |
author_sort | Merkeev, Igor V |
collection | PubMed |
description | BACKGROUND: Orthologs and paralogs are widely used terms in modern comparative genomics. Existing procedures for resolving orthologous/paralogous relationships are often based on manual revision of clusters of orthologous groups and/or lack any rigorous evolutionary base. DESCRIPTION: We developed a completely automated procedure that creates clusters of orthologous groups at each node of the taxonomy tree (PHOGs – Phylogenetic Orthologous Groups). As a result of this procedure, a tree of orthologous groups was obtained. Each cluster is a "supergene" and it is represented by an "ancestral" sequence obtained from the multiple alignment of orthologous and paralogous genes. The procedure has been applied to the taxonomy tree of organisms from all three domains of life. Protein complements from 50 bacterial, archaeal and eukaryotic species were used to create PHOGs at all tree nodes. 51367 PHOGs were obtained at the root node. CONCLUSION: The PHOG database demonstrates that it is possible to automatically process any number of sequenced genomes and to reconstruct orthologous and paralogous relationships between genomes using a rigorous evolutionary approach. This database can become a very useful tool in various areas of comparative genomics. |
format | Text |
id | pubmed-1523204 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-15232042006-07-27 PHOG: a database of supergenomes built from proteome complements Merkeev, Igor V Novichkov, Pavel S Mironov, Andrey A BMC Evol Biol Database BACKGROUND: Orthologs and paralogs are widely used terms in modern comparative genomics. Existing procedures for resolving orthologous/paralogous relationships are often based on manual revision of clusters of orthologous groups and/or lack any rigorous evolutionary base. DESCRIPTION: We developed a completely automated procedure that creates clusters of orthologous groups at each node of the taxonomy tree (PHOGs – Phylogenetic Orthologous Groups). As a result of this procedure, a tree of orthologous groups was obtained. Each cluster is a "supergene" and it is represented by an "ancestral" sequence obtained from the multiple alignment of orthologous and paralogous genes. The procedure has been applied to the taxonomy tree of organisms from all three domains of life. Protein complements from 50 bacterial, archaeal and eukaryotic species were used to create PHOGs at all tree nodes. 51367 PHOGs were obtained at the root node. CONCLUSION: The PHOG database demonstrates that it is possible to automatically process any number of sequenced genomes and to reconstruct orthologous and paralogous relationships between genomes using a rigorous evolutionary approach. This database can become a very useful tool in various areas of comparative genomics. BioMed Central 2006-06-22 /pmc/articles/PMC1523204/ /pubmed/16792803 http://dx.doi.org/10.1186/1471-2148-6-52 Text en Copyright © 2006 Merkeev et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Merkeev, Igor V Novichkov, Pavel S Mironov, Andrey A PHOG: a database of supergenomes built from proteome complements |
title | PHOG: a database of supergenomes built from proteome complements |
title_full | PHOG: a database of supergenomes built from proteome complements |
title_fullStr | PHOG: a database of supergenomes built from proteome complements |
title_full_unstemmed | PHOG: a database of supergenomes built from proteome complements |
title_short | PHOG: a database of supergenomes built from proteome complements |
title_sort | phog: a database of supergenomes built from proteome complements |
topic | Database |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1523204/ https://www.ncbi.nlm.nih.gov/pubmed/16792803 http://dx.doi.org/10.1186/1471-2148-6-52 |
work_keys_str_mv | AT merkeevigorv phogadatabaseofsupergenomesbuiltfromproteomecomplements AT novichkovpavels phogadatabaseofsupergenomesbuiltfromproteomecomplements AT mironovandreya phogadatabaseofsupergenomesbuiltfromproteomecomplements |