Cargando…

On the family-free DCJ distance and similarity

Structural variation in genomes can be revealed by many (dis)similarity measures. Rearrangement operations, such as the so called double-cut-and-join (DCJ), are large-scale mutations that can create complex changes and produce such variations in genomes. A basic task in comparative genomics is to fi...

Descripción completa

Detalles Bibliográficos
Autores principales: Martinez, Fábio V, Feijão, Pedro, Braga, Marília DV, Stoye, Jens
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391664/
https://www.ncbi.nlm.nih.gov/pubmed/25859276
http://dx.doi.org/10.1186/s13015-015-0041-9
_version_ 1782365854709579776
author Martinez, Fábio V
Feijão, Pedro
Braga, Marília DV
Stoye, Jens
author_facet Martinez, Fábio V
Feijão, Pedro
Braga, Marília DV
Stoye, Jens
author_sort Martinez, Fábio V
collection PubMed
description Structural variation in genomes can be revealed by many (dis)similarity measures. Rearrangement operations, such as the so called double-cut-and-join (DCJ), are large-scale mutations that can create complex changes and produce such variations in genomes. A basic task in comparative genomics is to find the rearrangement distance between two given genomes, i.e., the minimum number of rearragement operations that transform one given genome into another one. In a family-based setting, genes are grouped into gene families and efficient algorithms have already been presented to compute the DCJ distance between two given genomes. In this work we propose the problem of computing the DCJ distance of two given genomes without prior gene family assignment, directly using the pairwise similarities between genes. We prove that this new family-free DCJ distance problem is APX-hard and provide an integer linear program to its solution. We also study a family-free DCJ similarity and prove that its computation is NP-hard.
format Online
Article
Text
id pubmed-4391664
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43916642015-04-10 On the family-free DCJ distance and similarity Martinez, Fábio V Feijão, Pedro Braga, Marília DV Stoye, Jens Algorithms Mol Biol Research Structural variation in genomes can be revealed by many (dis)similarity measures. Rearrangement operations, such as the so called double-cut-and-join (DCJ), are large-scale mutations that can create complex changes and produce such variations in genomes. A basic task in comparative genomics is to find the rearrangement distance between two given genomes, i.e., the minimum number of rearragement operations that transform one given genome into another one. In a family-based setting, genes are grouped into gene families and efficient algorithms have already been presented to compute the DCJ distance between two given genomes. In this work we propose the problem of computing the DCJ distance of two given genomes without prior gene family assignment, directly using the pairwise similarities between genes. We prove that this new family-free DCJ distance problem is APX-hard and provide an integer linear program to its solution. We also study a family-free DCJ similarity and prove that its computation is NP-hard. BioMed Central 2015-04-01 /pmc/articles/PMC4391664/ /pubmed/25859276 http://dx.doi.org/10.1186/s13015-015-0041-9 Text en © Martinez et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Martinez, Fábio V
Feijão, Pedro
Braga, Marília DV
Stoye, Jens
On the family-free DCJ distance and similarity
title On the family-free DCJ distance and similarity
title_full On the family-free DCJ distance and similarity
title_fullStr On the family-free DCJ distance and similarity
title_full_unstemmed On the family-free DCJ distance and similarity
title_short On the family-free DCJ distance and similarity
title_sort on the family-free dcj distance and similarity
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391664/
https://www.ncbi.nlm.nih.gov/pubmed/25859276
http://dx.doi.org/10.1186/s13015-015-0041-9
work_keys_str_mv AT martinezfabiov onthefamilyfreedcjdistanceandsimilarity
AT feijaopedro onthefamilyfreedcjdistanceandsimilarity
AT bragamariliadv onthefamilyfreedcjdistanceandsimilarity
AT stoyejens onthefamilyfreedcjdistanceandsimilarity