Cargando…

Revealing Mammalian Evolutionary Relationships by Comparative Analysis of Gene Clusters

Many software tools for comparative analysis of genomic sequence data have been released in recent decades. Despite this, it remains challenging to determine evolutionary relationships in gene clusters due to their complex histories involving duplications, deletions, inversions, and conversions. One...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Giltae, Riemer, Cathy, Dickins, Benjamin, Kim, Hie Lim, Zhang, Louxin, Zhang, Yu, Hsu, Chih-Hao, Hardison, Ross C., NISC Comparative Sequencing Program, Green, Eric D., Miller, Webb
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3342878/
https://www.ncbi.nlm.nih.gov/pubmed/22454131
http://dx.doi.org/10.1093/gbe/evs032
_version_ 1782231741477421056
author Song, Giltae
Riemer, Cathy
Dickins, Benjamin
Kim, Hie Lim
Zhang, Louxin
Zhang, Yu
Hsu, Chih-Hao
Hardison, Ross C.
NISC Comparative Sequencing Program,
Green, Eric D.
Miller, Webb
author_facet Song, Giltae
Riemer, Cathy
Dickins, Benjamin
Kim, Hie Lim
Zhang, Louxin
Zhang, Yu
Hsu, Chih-Hao
Hardison, Ross C.
NISC Comparative Sequencing Program,
Green, Eric D.
Miller, Webb
author_sort Song, Giltae
collection PubMed
description Many software tools for comparative analysis of genomic sequence data have been released in recent decades. Despite this, it remains challenging to determine evolutionary relationships in gene clusters due to their complex histories involving duplications, deletions, inversions, and conversions. One concept describing these relationships is orthology. Orthologs derive from a common ancestor by speciation, in contrast to paralogs, which derive from duplication. Discriminating orthologs from paralogs is a necessary step in most multispecies sequence analyses, but doing so accurately is impeded by the occurrence of gene conversion events. We propose a refined method of orthology assignment based on two paradigms for interpreting its definition: by genomic context or by sequence content. X-orthology (based on context) traces orthology resulting from speciation and duplication only, while N-orthology (based on content) includes the influence of conversion events. We developed a computational method for automatically mapping both types of orthology on a per-nucleotide basis in gene cluster regions studied by comparative sequencing, and we make this mapping accessible by visualizing the output. All of these steps are incorporated into our newly extended CHAP 2 package. We evaluate our method using both simulated data and real gene clusters (including the well-characterized α-globin and β-globin clusters). We also illustrate use of CHAP 2 by analyzing four more loci: CCL (chemokine ligand), IFN (interferon), CYP2abf (part of cytochrome P450 family 2), and KIR (killer cell immunoglobulin-like receptors). These new methods facilitate and extend our understanding of evolution at these and other loci by adding automated accurate evolutionary inference to the biologist's toolkit. The CHAP 2 package is freely available from http://www.bx.psu.edu/miller_lab.
format Online
Article
Text
id pubmed-3342878
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33428782012-05-04 Revealing Mammalian Evolutionary Relationships by Comparative Analysis of Gene Clusters Song, Giltae Riemer, Cathy Dickins, Benjamin Kim, Hie Lim Zhang, Louxin Zhang, Yu Hsu, Chih-Hao Hardison, Ross C. NISC Comparative Sequencing Program, Green, Eric D. Miller, Webb Genome Biol Evol Research Articles Many software tools for comparative analysis of genomic sequence data have been released in recent decades. Despite this, it remains challenging to determine evolutionary relationships in gene clusters due to their complex histories involving duplications, deletions, inversions, and conversions. One concept describing these relationships is orthology. Orthologs derive from a common ancestor by speciation, in contrast to paralogs, which derive from duplication. Discriminating orthologs from paralogs is a necessary step in most multispecies sequence analyses, but doing so accurately is impeded by the occurrence of gene conversion events. We propose a refined method of orthology assignment based on two paradigms for interpreting its definition: by genomic context or by sequence content. X-orthology (based on context) traces orthology resulting from speciation and duplication only, while N-orthology (based on content) includes the influence of conversion events. We developed a computational method for automatically mapping both types of orthology on a per-nucleotide basis in gene cluster regions studied by comparative sequencing, and we make this mapping accessible by visualizing the output. All of these steps are incorporated into our newly extended CHAP 2 package. We evaluate our method using both simulated data and real gene clusters (including the well-characterized α-globin and β-globin clusters). We also illustrate use of CHAP 2 by analyzing four more loci: CCL (chemokine ligand), IFN (interferon), CYP2abf (part of cytochrome P450 family 2), and KIR (killer cell immunoglobulin-like receptors). These new methods facilitate and extend our understanding of evolution at these and other loci by adding automated accurate evolutionary inference to the biologist's toolkit. The CHAP 2 package is freely available from http://www.bx.psu.edu/miller_lab. Oxford University Press 2012 2012-03-27 /pmc/articles/PMC3342878/ /pubmed/22454131 http://dx.doi.org/10.1093/gbe/evs032 Text en © The Author(s) 2012. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Song, Giltae
Riemer, Cathy
Dickins, Benjamin
Kim, Hie Lim
Zhang, Louxin
Zhang, Yu
Hsu, Chih-Hao
Hardison, Ross C.
NISC Comparative Sequencing Program,
Green, Eric D.
Miller, Webb
Revealing Mammalian Evolutionary Relationships by Comparative Analysis of Gene Clusters
title Revealing Mammalian Evolutionary Relationships by Comparative Analysis of Gene Clusters
title_full Revealing Mammalian Evolutionary Relationships by Comparative Analysis of Gene Clusters
title_fullStr Revealing Mammalian Evolutionary Relationships by Comparative Analysis of Gene Clusters
title_full_unstemmed Revealing Mammalian Evolutionary Relationships by Comparative Analysis of Gene Clusters
title_short Revealing Mammalian Evolutionary Relationships by Comparative Analysis of Gene Clusters
title_sort revealing mammalian evolutionary relationships by comparative analysis of gene clusters
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3342878/
https://www.ncbi.nlm.nih.gov/pubmed/22454131
http://dx.doi.org/10.1093/gbe/evs032
work_keys_str_mv AT songgiltae revealingmammalianevolutionaryrelationshipsbycomparativeanalysisofgeneclusters
AT riemercathy revealingmammalianevolutionaryrelationshipsbycomparativeanalysisofgeneclusters
AT dickinsbenjamin revealingmammalianevolutionaryrelationshipsbycomparativeanalysisofgeneclusters
AT kimhielim revealingmammalianevolutionaryrelationshipsbycomparativeanalysisofgeneclusters
AT zhanglouxin revealingmammalianevolutionaryrelationshipsbycomparativeanalysisofgeneclusters
AT zhangyu revealingmammalianevolutionaryrelationshipsbycomparativeanalysisofgeneclusters
AT hsuchihhao revealingmammalianevolutionaryrelationshipsbycomparativeanalysisofgeneclusters
AT hardisonrossc revealingmammalianevolutionaryrelationshipsbycomparativeanalysisofgeneclusters
AT nisccomparativesequencingprogram revealingmammalianevolutionaryrelationshipsbycomparativeanalysisofgeneclusters
AT greenericd revealingmammalianevolutionaryrelationshipsbycomparativeanalysisofgeneclusters
AT millerwebb revealingmammalianevolutionaryrelationshipsbycomparativeanalysisofgeneclusters