Cargando…
An enhanced beam search algorithm for the Shortest Common Supersequence Problem
The Shortest Common Supersequence Problem asks to obtain a shortest string that is a supersequence of every member of a given set of strings. It has applications, among others, in data compression and oligonucleotide microarray production. The problem is NP-hard, and the existing exact solutions are...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7127599/ https://www.ncbi.nlm.nih.gov/pubmed/32288320 http://dx.doi.org/10.1016/j.engappai.2011.08.006 |
_version_ | 1783516394690707456 |
---|---|
author | Mousavi, Sayyed Rasoul Bahri, Fateme Tabataba, Farzaneh Sadat |
author_facet | Mousavi, Sayyed Rasoul Bahri, Fateme Tabataba, Farzaneh Sadat |
author_sort | Mousavi, Sayyed Rasoul |
collection | PubMed |
description | The Shortest Common Supersequence Problem asks to obtain a shortest string that is a supersequence of every member of a given set of strings. It has applications, among others, in data compression and oligonucleotide microarray production. The problem is NP-hard, and the existing exact solutions are impractical for large instances. In this paper, a new beam search algorithm is proposed for the problem, which employs a probabilistic heuristic and uses the dominance property to further prune the search space. The proposed algorithm is compared with three recent algorithms proposed for the problem on both random and biological sequences, outperforming them all by quickly providing solutions of higher average quality in all the experimental cases. The Java source and binary files of the proposed IBS_SCS algorithm and our implementation of the DR algorithm and all the random and real datasets used in this paper are freely available upon request. |
format | Online Article Text |
id | pubmed-7127599 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-71275992020-04-08 An enhanced beam search algorithm for the Shortest Common Supersequence Problem Mousavi, Sayyed Rasoul Bahri, Fateme Tabataba, Farzaneh Sadat Eng Appl Artif Intell Article The Shortest Common Supersequence Problem asks to obtain a shortest string that is a supersequence of every member of a given set of strings. It has applications, among others, in data compression and oligonucleotide microarray production. The problem is NP-hard, and the existing exact solutions are impractical for large instances. In this paper, a new beam search algorithm is proposed for the problem, which employs a probabilistic heuristic and uses the dominance property to further prune the search space. The proposed algorithm is compared with three recent algorithms proposed for the problem on both random and biological sequences, outperforming them all by quickly providing solutions of higher average quality in all the experimental cases. The Java source and binary files of the proposed IBS_SCS algorithm and our implementation of the DR algorithm and all the random and real datasets used in this paper are freely available upon request. Elsevier Ltd. 2012-04 2011-09-20 /pmc/articles/PMC7127599/ /pubmed/32288320 http://dx.doi.org/10.1016/j.engappai.2011.08.006 Text en Copyright © 2011 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Mousavi, Sayyed Rasoul Bahri, Fateme Tabataba, Farzaneh Sadat An enhanced beam search algorithm for the Shortest Common Supersequence Problem |
title | An enhanced beam search algorithm for the Shortest Common Supersequence Problem |
title_full | An enhanced beam search algorithm for the Shortest Common Supersequence Problem |
title_fullStr | An enhanced beam search algorithm for the Shortest Common Supersequence Problem |
title_full_unstemmed | An enhanced beam search algorithm for the Shortest Common Supersequence Problem |
title_short | An enhanced beam search algorithm for the Shortest Common Supersequence Problem |
title_sort | enhanced beam search algorithm for the shortest common supersequence problem |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7127599/ https://www.ncbi.nlm.nih.gov/pubmed/32288320 http://dx.doi.org/10.1016/j.engappai.2011.08.006 |
work_keys_str_mv | AT mousavisayyedrasoul anenhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem AT bahrifateme anenhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem AT tabatabafarzanehsadat anenhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem AT mousavisayyedrasoul enhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem AT bahrifateme enhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem AT tabatabafarzanehsadat enhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem |