Cargando…

DCJ-indel and DCJ-substitution distances with distinct operation costs

BACKGROUND: Classical approaches to compute the genomic distance are usually limited to genomes with the same content and take into consideration only rearrangements that change the organization of the genome (i.e. positions and orientation of pieces of DNA, number and type of chromosomes, etc.), su...

Descripción completa

Detalles Bibliográficos
Autores principales: da Silva, Poly H, Machado, Raphael, Dantas, Simone, Braga, Marília DV
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3765133/
https://www.ncbi.nlm.nih.gov/pubmed/23879938
http://dx.doi.org/10.1186/1748-7188-8-21
_version_ 1782283231027003392
author da Silva, Poly H
Machado, Raphael
Dantas, Simone
Braga, Marília DV
author_facet da Silva, Poly H
Machado, Raphael
Dantas, Simone
Braga, Marília DV
author_sort da Silva, Poly H
collection PubMed
description BACKGROUND: Classical approaches to compute the genomic distance are usually limited to genomes with the same content and take into consideration only rearrangements that change the organization of the genome (i.e. positions and orientation of pieces of DNA, number and type of chromosomes, etc.), such as inversions, translocations, fusions and fissions. These operations are generically represented by the double-cut and join (DCJ) operation. The distance between two genomes, in terms of number of DCJ operations, can be computed in linear time. In order to handle genomes with distinct contents, also insertions and deletions of fragments of DNA – named indels – must be allowed. More powerful than an indel is a substitution of a fragment of DNA by another fragment of DNA. Indels and substitutions are called content-modifying operations. It has been shown that both the DCJ-indel and the DCJ-substitution distances can also be computed in linear time, assuming that the same cost is assigned to any DCJ or content-modifying operation. RESULTS: In the present study we extend the DCJ-indel and the DCJ-substitution models, considering that the content-modifying cost is distinct from and upper bounded by the DCJ cost, and show that the distance in both models can still be computed in linear time. Although the triangular inequality can be disrupted in both models, we also show how to efficiently fix this problem a posteriori.
format Online
Article
Text
id pubmed-3765133
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37651332013-09-10 DCJ-indel and DCJ-substitution distances with distinct operation costs da Silva, Poly H Machado, Raphael Dantas, Simone Braga, Marília DV Algorithms Mol Biol Research BACKGROUND: Classical approaches to compute the genomic distance are usually limited to genomes with the same content and take into consideration only rearrangements that change the organization of the genome (i.e. positions and orientation of pieces of DNA, number and type of chromosomes, etc.), such as inversions, translocations, fusions and fissions. These operations are generically represented by the double-cut and join (DCJ) operation. The distance between two genomes, in terms of number of DCJ operations, can be computed in linear time. In order to handle genomes with distinct contents, also insertions and deletions of fragments of DNA – named indels – must be allowed. More powerful than an indel is a substitution of a fragment of DNA by another fragment of DNA. Indels and substitutions are called content-modifying operations. It has been shown that both the DCJ-indel and the DCJ-substitution distances can also be computed in linear time, assuming that the same cost is assigned to any DCJ or content-modifying operation. RESULTS: In the present study we extend the DCJ-indel and the DCJ-substitution models, considering that the content-modifying cost is distinct from and upper bounded by the DCJ cost, and show that the distance in both models can still be computed in linear time. Although the triangular inequality can be disrupted in both models, we also show how to efficiently fix this problem a posteriori. BioMed Central 2013-07-23 /pmc/articles/PMC3765133/ /pubmed/23879938 http://dx.doi.org/10.1186/1748-7188-8-21 Text en Copyright © 2013 da Silva et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
da Silva, Poly H
Machado, Raphael
Dantas, Simone
Braga, Marília DV
DCJ-indel and DCJ-substitution distances with distinct operation costs
title DCJ-indel and DCJ-substitution distances with distinct operation costs
title_full DCJ-indel and DCJ-substitution distances with distinct operation costs
title_fullStr DCJ-indel and DCJ-substitution distances with distinct operation costs
title_full_unstemmed DCJ-indel and DCJ-substitution distances with distinct operation costs
title_short DCJ-indel and DCJ-substitution distances with distinct operation costs
title_sort dcj-indel and dcj-substitution distances with distinct operation costs
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3765133/
https://www.ncbi.nlm.nih.gov/pubmed/23879938
http://dx.doi.org/10.1186/1748-7188-8-21
work_keys_str_mv AT dasilvapolyh dcjindelanddcjsubstitutiondistanceswithdistinctoperationcosts
AT machadoraphael dcjindelanddcjsubstitutiondistanceswithdistinctoperationcosts
AT dantassimone dcjindelanddcjsubstitutiondistanceswithdistinctoperationcosts
AT bragamariliadv dcjindelanddcjsubstitutiondistanceswithdistinctoperationcosts