Cargando…

A new 1.375-approximation algorithm for sorting by transpositions

BACKGROUND: sorting by transpositions (SBT) is a classical problem in genome rearrangements. In 2012, SBT was proven to be [Formula: see text] -hard and the best approximation algorithm with a 1.375 ratio was proposed in 2006 by Elias and Hartman (EH algorithm). Their algorithm employs simplificatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Silva, Luiz Augusto G., Kowada, Luis Antonio B., Rocco, Noraí Romeu, Walter, Maria Emília M. T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8760837/
https://www.ncbi.nlm.nih.gov/pubmed/35033127
http://dx.doi.org/10.1186/s13015-022-00205-z
_version_ 1784633406848499712
author Silva, Luiz Augusto G.
Kowada, Luis Antonio B.
Rocco, Noraí Romeu
Walter, Maria Emília M. T.
author_facet Silva, Luiz Augusto G.
Kowada, Luis Antonio B.
Rocco, Noraí Romeu
Walter, Maria Emília M. T.
author_sort Silva, Luiz Augusto G.
collection PubMed
description BACKGROUND: sorting by transpositions (SBT) is a classical problem in genome rearrangements. In 2012, SBT was proven to be [Formula: see text] -hard and the best approximation algorithm with a 1.375 ratio was proposed in 2006 by Elias and Hartman (EH algorithm). Their algorithm employs simplification, a technique used to transform an input permutation [Formula: see text] into a simple permutation [Formula: see text] , presumably easier to handle with. The permutation [Formula: see text] is obtained by inserting new symbols into [Formula: see text] in a way that the lower bound of the transposition distance of [Formula: see text] is kept on [Formula: see text] . The simplification is guaranteed to keep the lower bound, not the transposition distance. A sequence of operations sorting [Formula: see text] can be mimicked to sort [Formula: see text] . RESULTS AND CONCLUSIONS: First, using an algebraic approach, we propose a new upper bound for the transposition distance, which holds for all [Formula: see text] . Next, motivated by a problem identified in the EH algorithm, which causes it, in scenarios involving how the input permutation is simplified, to require one extra transposition above the 1.375-approximation ratio, we propose a new approximation algorithm to solve SBT ensuring the 1.375-approximation ratio for all [Formula: see text] . We implemented our algorithm and EH’s. Regarding the implementation of the EH algorithm, two other issues were identified and needed to be fixed. We tested both algorithms against all permutations of size n, [Formula: see text] . The results show that the EH algorithm exceeds the approximation ratio of 1.375 for permutations with a size greater than 7. The percentage of computed distances that are equal to transposition distance, computed by the implemented algorithms are also compared with others available in the literature. Finally, we investigate the performance of both implementations on longer permutations of maximum length 500. From the experiments, we conclude that maximum and the average distances computed by our algorithm are a little better than the ones computed by the EH algorithm and the running times of both algorithms are similar, despite the time complexity of our algorithm being higher.
format Online
Article
Text
id pubmed-8760837
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-87608372022-01-18 A new 1.375-approximation algorithm for sorting by transpositions Silva, Luiz Augusto G. Kowada, Luis Antonio B. Rocco, Noraí Romeu Walter, Maria Emília M. T. Algorithms Mol Biol Research BACKGROUND: sorting by transpositions (SBT) is a classical problem in genome rearrangements. In 2012, SBT was proven to be [Formula: see text] -hard and the best approximation algorithm with a 1.375 ratio was proposed in 2006 by Elias and Hartman (EH algorithm). Their algorithm employs simplification, a technique used to transform an input permutation [Formula: see text] into a simple permutation [Formula: see text] , presumably easier to handle with. The permutation [Formula: see text] is obtained by inserting new symbols into [Formula: see text] in a way that the lower bound of the transposition distance of [Formula: see text] is kept on [Formula: see text] . The simplification is guaranteed to keep the lower bound, not the transposition distance. A sequence of operations sorting [Formula: see text] can be mimicked to sort [Formula: see text] . RESULTS AND CONCLUSIONS: First, using an algebraic approach, we propose a new upper bound for the transposition distance, which holds for all [Formula: see text] . Next, motivated by a problem identified in the EH algorithm, which causes it, in scenarios involving how the input permutation is simplified, to require one extra transposition above the 1.375-approximation ratio, we propose a new approximation algorithm to solve SBT ensuring the 1.375-approximation ratio for all [Formula: see text] . We implemented our algorithm and EH’s. Regarding the implementation of the EH algorithm, two other issues were identified and needed to be fixed. We tested both algorithms against all permutations of size n, [Formula: see text] . The results show that the EH algorithm exceeds the approximation ratio of 1.375 for permutations with a size greater than 7. The percentage of computed distances that are equal to transposition distance, computed by the implemented algorithms are also compared with others available in the literature. Finally, we investigate the performance of both implementations on longer permutations of maximum length 500. From the experiments, we conclude that maximum and the average distances computed by our algorithm are a little better than the ones computed by the EH algorithm and the running times of both algorithms are similar, despite the time complexity of our algorithm being higher. BioMed Central 2022-01-15 /pmc/articles/PMC8760837/ /pubmed/35033127 http://dx.doi.org/10.1186/s13015-022-00205-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Silva, Luiz Augusto G.
Kowada, Luis Antonio B.
Rocco, Noraí Romeu
Walter, Maria Emília M. T.
A new 1.375-approximation algorithm for sorting by transpositions
title A new 1.375-approximation algorithm for sorting by transpositions
title_full A new 1.375-approximation algorithm for sorting by transpositions
title_fullStr A new 1.375-approximation algorithm for sorting by transpositions
title_full_unstemmed A new 1.375-approximation algorithm for sorting by transpositions
title_short A new 1.375-approximation algorithm for sorting by transpositions
title_sort new 1.375-approximation algorithm for sorting by transpositions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8760837/
https://www.ncbi.nlm.nih.gov/pubmed/35033127
http://dx.doi.org/10.1186/s13015-022-00205-z
work_keys_str_mv AT silvaluizaugustog anew1375approximationalgorithmforsortingbytranspositions
AT kowadaluisantoniob anew1375approximationalgorithmforsortingbytranspositions
AT rocconorairomeu anew1375approximationalgorithmforsortingbytranspositions
AT waltermariaemiliamt anew1375approximationalgorithmforsortingbytranspositions
AT silvaluizaugustog new1375approximationalgorithmforsortingbytranspositions
AT kowadaluisantoniob new1375approximationalgorithmforsortingbytranspositions
AT rocconorairomeu new1375approximationalgorithmforsortingbytranspositions
AT waltermariaemiliamt new1375approximationalgorithmforsortingbytranspositions