Cargando…

PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem

BACKGROUND: Motifs are patterns found in biological sequences that are vital for understanding gene function, human disease, drug design, etc. They are helpful in finding transcriptional regulatory elements, transcription factor binding sites, and so on. As a result, the problem of identifying motif...

Descripción completa

Detalles Bibliográficos
Autores principales: Dinh, Hieu, Rajasekaran, Sanguthevar, Kundeti, Vamsi K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3269969/
https://www.ncbi.nlm.nih.gov/pubmed/22024209
http://dx.doi.org/10.1186/1471-2105-12-410
_version_ 1782222524928491520
author Dinh, Hieu
Rajasekaran, Sanguthevar
Kundeti, Vamsi K
author_facet Dinh, Hieu
Rajasekaran, Sanguthevar
Kundeti, Vamsi K
author_sort Dinh, Hieu
collection PubMed
description BACKGROUND: Motifs are patterns found in biological sequences that are vital for understanding gene function, human disease, drug design, etc. They are helpful in finding transcriptional regulatory elements, transcription factor binding sites, and so on. As a result, the problem of identifying motifs is very crucial in biology. RESULTS: Many facets of the motif search problem have been identified in the literature. One of them is (ℓ, d)-motif search (or Planted Motif Search (PMS)). The PMS problem has been well investigated and shown to be NP-hard. Any algorithm for PMS that always finds all the (ℓ, d)-motifs on a given input set is called an exact algorithm. In this paper we focus on exact algorithms only. All the known exact algorithms for PMS take exponential time in some of the underlying parameters in the worst case scenario. But it does not mean that we cannot design exact algorithms for solving practical instances within a reasonable amount of time. In this paper, we propose a fast algorithm that can solve the well-known challenging instances of PMS: (21, 8) and (23, 9). No prior exact algorithm could solve these instances. In particular, our proposed algorithm takes about 10 hours on the challenging instance (21, 8) and about 54 hours on the challenging instance (23, 9). The algorithm has been run on a single 2.4GHz PC with 3GB RAM. The implementation of PMS5 is freely available on the web at http://www.pms.engr.uconn.edu/downloads/PMS5.zip. CONCLUSIONS: We present an efficient algorithm PMS5 that uses some novel ideas and combines them with well-known algorithm PMS1 and PMSPrune. PMS5 can tackle the large challenging instances (21, 8) and (23, 9). Therefore, we hope that PMS5 will help biologists discover longer motifs in the futures.
format Online
Article
Text
id pubmed-3269969
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32699692012-02-13 PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem Dinh, Hieu Rajasekaran, Sanguthevar Kundeti, Vamsi K BMC Bioinformatics Research Article BACKGROUND: Motifs are patterns found in biological sequences that are vital for understanding gene function, human disease, drug design, etc. They are helpful in finding transcriptional regulatory elements, transcription factor binding sites, and so on. As a result, the problem of identifying motifs is very crucial in biology. RESULTS: Many facets of the motif search problem have been identified in the literature. One of them is (ℓ, d)-motif search (or Planted Motif Search (PMS)). The PMS problem has been well investigated and shown to be NP-hard. Any algorithm for PMS that always finds all the (ℓ, d)-motifs on a given input set is called an exact algorithm. In this paper we focus on exact algorithms only. All the known exact algorithms for PMS take exponential time in some of the underlying parameters in the worst case scenario. But it does not mean that we cannot design exact algorithms for solving practical instances within a reasonable amount of time. In this paper, we propose a fast algorithm that can solve the well-known challenging instances of PMS: (21, 8) and (23, 9). No prior exact algorithm could solve these instances. In particular, our proposed algorithm takes about 10 hours on the challenging instance (21, 8) and about 54 hours on the challenging instance (23, 9). The algorithm has been run on a single 2.4GHz PC with 3GB RAM. The implementation of PMS5 is freely available on the web at http://www.pms.engr.uconn.edu/downloads/PMS5.zip. CONCLUSIONS: We present an efficient algorithm PMS5 that uses some novel ideas and combines them with well-known algorithm PMS1 and PMSPrune. PMS5 can tackle the large challenging instances (21, 8) and (23, 9). Therefore, we hope that PMS5 will help biologists discover longer motifs in the futures. BioMed Central 2011-10-24 /pmc/articles/PMC3269969/ /pubmed/22024209 http://dx.doi.org/10.1186/1471-2105-12-410 Text en Copyright ©2011 Dinh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Dinh, Hieu
Rajasekaran, Sanguthevar
Kundeti, Vamsi K
PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem
title PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem
title_full PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem
title_fullStr PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem
title_full_unstemmed PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem
title_short PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem
title_sort pms5: an efficient exact algorithm for the (ℓ, d)-motif finding problem
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3269969/
https://www.ncbi.nlm.nih.gov/pubmed/22024209
http://dx.doi.org/10.1186/1471-2105-12-410
work_keys_str_mv AT dinhhieu pms5anefficientexactalgorithmfortheldmotiffindingproblem
AT rajasekaransanguthevar pms5anefficientexactalgorithmfortheldmotiffindingproblem
AT kundetivamsik pms5anefficientexactalgorithmfortheldmotiffindingproblem