Cargando…

Estimating true evolutionary distances under the DCJ model

Motivation: Modern techniques can yield the ordering and strandedness of genes on each chromosome of a genome; such data already exists for hundreds of organisms. The evolutionary mechanisms through which the set of the genes of an organism is altered and reordered are of great interest to systemati...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Yu, Moret, Bernard M.E.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718626/
https://www.ncbi.nlm.nih.gov/pubmed/18586703
http://dx.doi.org/10.1093/bioinformatics/btn148
_version_ 1782170002406768640
author Lin, Yu
Moret, Bernard M.E.
author_facet Lin, Yu
Moret, Bernard M.E.
author_sort Lin, Yu
collection PubMed
description Motivation: Modern techniques can yield the ordering and strandedness of genes on each chromosome of a genome; such data already exists for hundreds of organisms. The evolutionary mechanisms through which the set of the genes of an organism is altered and reordered are of great interest to systematists, evolutionary biologists, comparative genomicists and biomedical researchers. Perhaps the most basic concept in this area is that of evolutionary distance between two genomes: under a given model of genomic evolution, how many events most likely took place to account for the difference between the two genomes? Results: We present a method to estimate the true evolutionary distance between two genomes under the ‘double-cut-and-join’ (DCJ) model of genome rearrangement, a model under which a single multichromosomal operation accounts for all genomic rearrangement events: inversion, transposition, translocation, block interchange and chromosomal fusion and fission. Our method relies on a simple structural characterization of a genome pair and is both analytically and computationally tractable. We provide analytical results to describe the asymptotic behavior of genomes under the DCJ model, as well as experimental results on a wide variety of genome structures to exemplify the very high accuracy (and low variance) of our estimator. Our results provide a tool for accurate phylogenetic reconstruction from multichromosomal gene rearrangement data as well as a theoretical basis for refinements of the DCJ model to account for biological constraints. Availability: All of our software is available in source form under GPL at http://lcbb.epfl.ch Contact: bernard.moret@epfl.ch
format Text
id pubmed-2718626
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27186262009-07-31 Estimating true evolutionary distances under the DCJ model Lin, Yu Moret, Bernard M.E. Bioinformatics Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto Motivation: Modern techniques can yield the ordering and strandedness of genes on each chromosome of a genome; such data already exists for hundreds of organisms. The evolutionary mechanisms through which the set of the genes of an organism is altered and reordered are of great interest to systematists, evolutionary biologists, comparative genomicists and biomedical researchers. Perhaps the most basic concept in this area is that of evolutionary distance between two genomes: under a given model of genomic evolution, how many events most likely took place to account for the difference between the two genomes? Results: We present a method to estimate the true evolutionary distance between two genomes under the ‘double-cut-and-join’ (DCJ) model of genome rearrangement, a model under which a single multichromosomal operation accounts for all genomic rearrangement events: inversion, transposition, translocation, block interchange and chromosomal fusion and fission. Our method relies on a simple structural characterization of a genome pair and is both analytically and computationally tractable. We provide analytical results to describe the asymptotic behavior of genomes under the DCJ model, as well as experimental results on a wide variety of genome structures to exemplify the very high accuracy (and low variance) of our estimator. Our results provide a tool for accurate phylogenetic reconstruction from multichromosomal gene rearrangement data as well as a theoretical basis for refinements of the DCJ model to account for biological constraints. Availability: All of our software is available in source form under GPL at http://lcbb.epfl.ch Contact: bernard.moret@epfl.ch Oxford University Press 2008-07-01 /pmc/articles/PMC2718626/ /pubmed/18586703 http://dx.doi.org/10.1093/bioinformatics/btn148 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
Lin, Yu
Moret, Bernard M.E.
Estimating true evolutionary distances under the DCJ model
title Estimating true evolutionary distances under the DCJ model
title_full Estimating true evolutionary distances under the DCJ model
title_fullStr Estimating true evolutionary distances under the DCJ model
title_full_unstemmed Estimating true evolutionary distances under the DCJ model
title_short Estimating true evolutionary distances under the DCJ model
title_sort estimating true evolutionary distances under the dcj model
topic Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718626/
https://www.ncbi.nlm.nih.gov/pubmed/18586703
http://dx.doi.org/10.1093/bioinformatics/btn148
work_keys_str_mv AT linyu estimatingtrueevolutionarydistancesunderthedcjmodel
AT moretbernardme estimatingtrueevolutionarydistancesunderthedcjmodel