Cargando…
PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem
BACKGROUND: Motifs are patterns found in biological sequences that are vital for understanding gene function, human disease, drug design, etc. They are helpful in finding transcriptional regulatory elements, transcription factor binding sites, and so on. As a result, the problem of identifying motif...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3269969/ https://www.ncbi.nlm.nih.gov/pubmed/22024209 http://dx.doi.org/10.1186/1471-2105-12-410 |
_version_ | 1782222524928491520 |
---|---|
author | Dinh, Hieu Rajasekaran, Sanguthevar Kundeti, Vamsi K |
author_facet | Dinh, Hieu Rajasekaran, Sanguthevar Kundeti, Vamsi K |
author_sort | Dinh, Hieu |
collection | PubMed |
description | BACKGROUND: Motifs are patterns found in biological sequences that are vital for understanding gene function, human disease, drug design, etc. They are helpful in finding transcriptional regulatory elements, transcription factor binding sites, and so on. As a result, the problem of identifying motifs is very crucial in biology. RESULTS: Many facets of the motif search problem have been identified in the literature. One of them is (ℓ, d)-motif search (or Planted Motif Search (PMS)). The PMS problem has been well investigated and shown to be NP-hard. Any algorithm for PMS that always finds all the (ℓ, d)-motifs on a given input set is called an exact algorithm. In this paper we focus on exact algorithms only. All the known exact algorithms for PMS take exponential time in some of the underlying parameters in the worst case scenario. But it does not mean that we cannot design exact algorithms for solving practical instances within a reasonable amount of time. In this paper, we propose a fast algorithm that can solve the well-known challenging instances of PMS: (21, 8) and (23, 9). No prior exact algorithm could solve these instances. In particular, our proposed algorithm takes about 10 hours on the challenging instance (21, 8) and about 54 hours on the challenging instance (23, 9). The algorithm has been run on a single 2.4GHz PC with 3GB RAM. The implementation of PMS5 is freely available on the web at http://www.pms.engr.uconn.edu/downloads/PMS5.zip. CONCLUSIONS: We present an efficient algorithm PMS5 that uses some novel ideas and combines them with well-known algorithm PMS1 and PMSPrune. PMS5 can tackle the large challenging instances (21, 8) and (23, 9). Therefore, we hope that PMS5 will help biologists discover longer motifs in the futures. |
format | Online Article Text |
id | pubmed-3269969 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32699692012-02-13 PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem Dinh, Hieu Rajasekaran, Sanguthevar Kundeti, Vamsi K BMC Bioinformatics Research Article BACKGROUND: Motifs are patterns found in biological sequences that are vital for understanding gene function, human disease, drug design, etc. They are helpful in finding transcriptional regulatory elements, transcription factor binding sites, and so on. As a result, the problem of identifying motifs is very crucial in biology. RESULTS: Many facets of the motif search problem have been identified in the literature. One of them is (ℓ, d)-motif search (or Planted Motif Search (PMS)). The PMS problem has been well investigated and shown to be NP-hard. Any algorithm for PMS that always finds all the (ℓ, d)-motifs on a given input set is called an exact algorithm. In this paper we focus on exact algorithms only. All the known exact algorithms for PMS take exponential time in some of the underlying parameters in the worst case scenario. But it does not mean that we cannot design exact algorithms for solving practical instances within a reasonable amount of time. In this paper, we propose a fast algorithm that can solve the well-known challenging instances of PMS: (21, 8) and (23, 9). No prior exact algorithm could solve these instances. In particular, our proposed algorithm takes about 10 hours on the challenging instance (21, 8) and about 54 hours on the challenging instance (23, 9). The algorithm has been run on a single 2.4GHz PC with 3GB RAM. The implementation of PMS5 is freely available on the web at http://www.pms.engr.uconn.edu/downloads/PMS5.zip. CONCLUSIONS: We present an efficient algorithm PMS5 that uses some novel ideas and combines them with well-known algorithm PMS1 and PMSPrune. PMS5 can tackle the large challenging instances (21, 8) and (23, 9). Therefore, we hope that PMS5 will help biologists discover longer motifs in the futures. BioMed Central 2011-10-24 /pmc/articles/PMC3269969/ /pubmed/22024209 http://dx.doi.org/10.1186/1471-2105-12-410 Text en Copyright ©2011 Dinh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Dinh, Hieu Rajasekaran, Sanguthevar Kundeti, Vamsi K PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem |
title | PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem |
title_full | PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem |
title_fullStr | PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem |
title_full_unstemmed | PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem |
title_short | PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem |
title_sort | pms5: an efficient exact algorithm for the (ℓ, d)-motif finding problem |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3269969/ https://www.ncbi.nlm.nih.gov/pubmed/22024209 http://dx.doi.org/10.1186/1471-2105-12-410 |
work_keys_str_mv | AT dinhhieu pms5anefficientexactalgorithmfortheldmotiffindingproblem AT rajasekaransanguthevar pms5anefficientexactalgorithmfortheldmotiffindingproblem AT kundetivamsik pms5anefficientexactalgorithmfortheldmotiffindingproblem |