Cargando…
Computing the family-free DCJ similarity
BACKGROUND: The genomic similarity is a large-scale measure for comparing two given genomes. In this work we study the (NP-hard) problem of computing the genomic similarity under the DCJ model in a setting that does not assume that the genes of the compared genomes are grouped into gene families. Th...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5998916/ https://www.ncbi.nlm.nih.gov/pubmed/29745861 http://dx.doi.org/10.1186/s12859-018-2130-5 |
_version_ | 1783331330342256640 |
---|---|
author | Rubert, Diego P. Hoshino, Edna A. Braga, Marília D. V. Stoye, Jens Martinez, Fábio V. |
author_facet | Rubert, Diego P. Hoshino, Edna A. Braga, Marília D. V. Stoye, Jens Martinez, Fábio V. |
author_sort | Rubert, Diego P. |
collection | PubMed |
description | BACKGROUND: The genomic similarity is a large-scale measure for comparing two given genomes. In this work we study the (NP-hard) problem of computing the genomic similarity under the DCJ model in a setting that does not assume that the genes of the compared genomes are grouped into gene families. This problem is called family-free DCJ similarity. RESULTS: We propose an exact ILP algorithm to solve the family-free DCJ similarity problem, then we show its APX-hardness and present four combinatorial heuristics with computational experiments comparing their results to the ILP. CONCLUSIONS: We show that the family-free DCJ similarity can be computed in reasonable time, although for larger genomes it is necessary to resort to heuristics. This provides a basis for further studies on the applicability and model refinement of family-free whole genome similarity measures. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2130-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5998916 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-59989162018-06-25 Computing the family-free DCJ similarity Rubert, Diego P. Hoshino, Edna A. Braga, Marília D. V. Stoye, Jens Martinez, Fábio V. BMC Bioinformatics Research BACKGROUND: The genomic similarity is a large-scale measure for comparing two given genomes. In this work we study the (NP-hard) problem of computing the genomic similarity under the DCJ model in a setting that does not assume that the genes of the compared genomes are grouped into gene families. This problem is called family-free DCJ similarity. RESULTS: We propose an exact ILP algorithm to solve the family-free DCJ similarity problem, then we show its APX-hardness and present four combinatorial heuristics with computational experiments comparing their results to the ILP. CONCLUSIONS: We show that the family-free DCJ similarity can be computed in reasonable time, although for larger genomes it is necessary to resort to heuristics. This provides a basis for further studies on the applicability and model refinement of family-free whole genome similarity measures. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2130-5) contains supplementary material, which is available to authorized users. BioMed Central 2018-05-08 /pmc/articles/PMC5998916/ /pubmed/29745861 http://dx.doi.org/10.1186/s12859-018-2130-5 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Rubert, Diego P. Hoshino, Edna A. Braga, Marília D. V. Stoye, Jens Martinez, Fábio V. Computing the family-free DCJ similarity |
title | Computing the family-free DCJ similarity |
title_full | Computing the family-free DCJ similarity |
title_fullStr | Computing the family-free DCJ similarity |
title_full_unstemmed | Computing the family-free DCJ similarity |
title_short | Computing the family-free DCJ similarity |
title_sort | computing the family-free dcj similarity |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5998916/ https://www.ncbi.nlm.nih.gov/pubmed/29745861 http://dx.doi.org/10.1186/s12859-018-2130-5 |
work_keys_str_mv | AT rubertdiegop computingthefamilyfreedcjsimilarity AT hoshinoednaa computingthefamilyfreedcjsimilarity AT bragamariliadv computingthefamilyfreedcjsimilarity AT stoyejens computingthefamilyfreedcjsimilarity AT martinezfabiov computingthefamilyfreedcjsimilarity |