Cargando…
OrthoParaMap: Distinguishing orthologs from paralogs by integrating comparative genome data and gene phylogenies
BACKGROUND: In eukaryotic genomes, most genes are members of gene families. When comparing genes from two species, therefore, most genes in one species will be homologous to multiple genes in the second. This often makes it difficult to distinguish orthologs (separated through speciation) from paral...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2003
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC200972/ https://www.ncbi.nlm.nih.gov/pubmed/12952558 http://dx.doi.org/10.1186/1471-2105-4-35 |
_version_ | 1782120941981007872 |
---|---|
author | Cannon, Steven B Young, Nevin D |
author_facet | Cannon, Steven B Young, Nevin D |
author_sort | Cannon, Steven B |
collection | PubMed |
description | BACKGROUND: In eukaryotic genomes, most genes are members of gene families. When comparing genes from two species, therefore, most genes in one species will be homologous to multiple genes in the second. This often makes it difficult to distinguish orthologs (separated through speciation) from paralogs (separated by other types of gene duplication). Combining phylogenetic relationships and genomic position in both genomes helps to distinguish between these scenarios. This kind of comparison can also help to describe how gene families have evolved within a single genome that has undergone polyploidy or other large-scale duplications, as in the case of Arabidopsis thaliana – and probably most plant genomes. RESULTS: We describe a suite of programs called OrthoParaMap (OPM) that makes genomic comparisons, identifies syntenic regions, determines whether sets of genes in a gene family are related through speciation or internal chromosomal duplications, maps this information onto phylogenetic trees, and infers internal nodes within the phylogenetic tree that may represent local – as opposed to speciation or segmental – duplication. We describe the application of the software using three examples: the melanoma-associated antigen (MAGE) gene family on the X chromosomes of mouse and human; the 20S proteasome subunit gene family in Arabidopsis, and the major latex protein gene family in Arabidopsis. CONCLUSION: OPM combines comparative genomic positional information and phylogenetic reconstructions to identify which gene duplications are likely to have arisen through internal genomic duplications (such as polyploidy), through speciation, or through local duplications (such as unequal crossing-over). The software is freely available at . |
format | Text |
id | pubmed-200972 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2003 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-2009722003-09-30 OrthoParaMap: Distinguishing orthologs from paralogs by integrating comparative genome data and gene phylogenies Cannon, Steven B Young, Nevin D BMC Bioinformatics Software BACKGROUND: In eukaryotic genomes, most genes are members of gene families. When comparing genes from two species, therefore, most genes in one species will be homologous to multiple genes in the second. This often makes it difficult to distinguish orthologs (separated through speciation) from paralogs (separated by other types of gene duplication). Combining phylogenetic relationships and genomic position in both genomes helps to distinguish between these scenarios. This kind of comparison can also help to describe how gene families have evolved within a single genome that has undergone polyploidy or other large-scale duplications, as in the case of Arabidopsis thaliana – and probably most plant genomes. RESULTS: We describe a suite of programs called OrthoParaMap (OPM) that makes genomic comparisons, identifies syntenic regions, determines whether sets of genes in a gene family are related through speciation or internal chromosomal duplications, maps this information onto phylogenetic trees, and infers internal nodes within the phylogenetic tree that may represent local – as opposed to speciation or segmental – duplication. We describe the application of the software using three examples: the melanoma-associated antigen (MAGE) gene family on the X chromosomes of mouse and human; the 20S proteasome subunit gene family in Arabidopsis, and the major latex protein gene family in Arabidopsis. CONCLUSION: OPM combines comparative genomic positional information and phylogenetic reconstructions to identify which gene duplications are likely to have arisen through internal genomic duplications (such as polyploidy), through speciation, or through local duplications (such as unequal crossing-over). The software is freely available at . BioMed Central 2003-09-02 /pmc/articles/PMC200972/ /pubmed/12952558 http://dx.doi.org/10.1186/1471-2105-4-35 Text en Copyright © 2003 Cannon and Young; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL. |
spellingShingle | Software Cannon, Steven B Young, Nevin D OrthoParaMap: Distinguishing orthologs from paralogs by integrating comparative genome data and gene phylogenies |
title | OrthoParaMap: Distinguishing orthologs from paralogs by integrating comparative genome data and gene phylogenies |
title_full | OrthoParaMap: Distinguishing orthologs from paralogs by integrating comparative genome data and gene phylogenies |
title_fullStr | OrthoParaMap: Distinguishing orthologs from paralogs by integrating comparative genome data and gene phylogenies |
title_full_unstemmed | OrthoParaMap: Distinguishing orthologs from paralogs by integrating comparative genome data and gene phylogenies |
title_short | OrthoParaMap: Distinguishing orthologs from paralogs by integrating comparative genome data and gene phylogenies |
title_sort | orthoparamap: distinguishing orthologs from paralogs by integrating comparative genome data and gene phylogenies |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC200972/ https://www.ncbi.nlm.nih.gov/pubmed/12952558 http://dx.doi.org/10.1186/1471-2105-4-35 |
work_keys_str_mv | AT cannonstevenb orthoparamapdistinguishingorthologsfromparalogsbyintegratingcomparativegenomedataandgenephylogenies AT youngnevind orthoparamapdistinguishingorthologsfromparalogsbyintegratingcomparativegenomedataandgenephylogenies |