Cargando…

An enhanced beam search algorithm for the Shortest Common Supersequence Problem

The Shortest Common Supersequence Problem asks to obtain a shortest string that is a supersequence of every member of a given set of strings. It has applications, among others, in data compression and oligonucleotide microarray production. The problem is NP-hard, and the existing exact solutions are...

Descripción completa

Detalles Bibliográficos
Autores principales: Mousavi, Sayyed Rasoul, Bahri, Fateme, Tabataba, Farzaneh Sadat
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7127599/
https://www.ncbi.nlm.nih.gov/pubmed/32288320
http://dx.doi.org/10.1016/j.engappai.2011.08.006
_version_ 1783516394690707456
author Mousavi, Sayyed Rasoul
Bahri, Fateme
Tabataba, Farzaneh Sadat
author_facet Mousavi, Sayyed Rasoul
Bahri, Fateme
Tabataba, Farzaneh Sadat
author_sort Mousavi, Sayyed Rasoul
collection PubMed
description The Shortest Common Supersequence Problem asks to obtain a shortest string that is a supersequence of every member of a given set of strings. It has applications, among others, in data compression and oligonucleotide microarray production. The problem is NP-hard, and the existing exact solutions are impractical for large instances. In this paper, a new beam search algorithm is proposed for the problem, which employs a probabilistic heuristic and uses the dominance property to further prune the search space. The proposed algorithm is compared with three recent algorithms proposed for the problem on both random and biological sequences, outperforming them all by quickly providing solutions of higher average quality in all the experimental cases. The Java source and binary files of the proposed IBS_SCS algorithm and our implementation of the DR algorithm and all the random and real datasets used in this paper are freely available upon request.
format Online
Article
Text
id pubmed-7127599
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-71275992020-04-08 An enhanced beam search algorithm for the Shortest Common Supersequence Problem Mousavi, Sayyed Rasoul Bahri, Fateme Tabataba, Farzaneh Sadat Eng Appl Artif Intell Article The Shortest Common Supersequence Problem asks to obtain a shortest string that is a supersequence of every member of a given set of strings. It has applications, among others, in data compression and oligonucleotide microarray production. The problem is NP-hard, and the existing exact solutions are impractical for large instances. In this paper, a new beam search algorithm is proposed for the problem, which employs a probabilistic heuristic and uses the dominance property to further prune the search space. The proposed algorithm is compared with three recent algorithms proposed for the problem on both random and biological sequences, outperforming them all by quickly providing solutions of higher average quality in all the experimental cases. The Java source and binary files of the proposed IBS_SCS algorithm and our implementation of the DR algorithm and all the random and real datasets used in this paper are freely available upon request. Elsevier Ltd. 2012-04 2011-09-20 /pmc/articles/PMC7127599/ /pubmed/32288320 http://dx.doi.org/10.1016/j.engappai.2011.08.006 Text en Copyright © 2011 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Mousavi, Sayyed Rasoul
Bahri, Fateme
Tabataba, Farzaneh Sadat
An enhanced beam search algorithm for the Shortest Common Supersequence Problem
title An enhanced beam search algorithm for the Shortest Common Supersequence Problem
title_full An enhanced beam search algorithm for the Shortest Common Supersequence Problem
title_fullStr An enhanced beam search algorithm for the Shortest Common Supersequence Problem
title_full_unstemmed An enhanced beam search algorithm for the Shortest Common Supersequence Problem
title_short An enhanced beam search algorithm for the Shortest Common Supersequence Problem
title_sort enhanced beam search algorithm for the shortest common supersequence problem
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7127599/
https://www.ncbi.nlm.nih.gov/pubmed/32288320
http://dx.doi.org/10.1016/j.engappai.2011.08.006
work_keys_str_mv AT mousavisayyedrasoul anenhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem
AT bahrifateme anenhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem
AT tabatabafarzanehsadat anenhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem
AT mousavisayyedrasoul enhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem
AT bahrifateme enhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem
AT tabatabafarzanehsadat enhancedbeamsearchalgorithmfortheshortestcommonsupersequenceproblem