Cargando…

SMOTIF: efficient structured pattern and profile motif search

BACKGROUND: A structured motif allows variable length gaps between several components, where each component is a simple motif, which allows either no gaps or only fixed length gaps. The motif can either be represented as a pattern or a profile (also called positional weight matrix). We propose an ef...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yongqiang, Zaki, Mohammed J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1679804/
https://www.ncbi.nlm.nih.gov/pubmed/17118189
http://dx.doi.org/10.1186/1748-7188-1-22
_version_ 1782131142195937280
author Zhang, Yongqiang
Zaki, Mohammed J
author_facet Zhang, Yongqiang
Zaki, Mohammed J
author_sort Zhang, Yongqiang
collection PubMed
description BACKGROUND: A structured motif allows variable length gaps between several components, where each component is a simple motif, which allows either no gaps or only fixed length gaps. The motif can either be represented as a pattern or a profile (also called positional weight matrix). We propose an efficient algorithm, called SMOTIF, to solve the structured motif search problem, i.e., given one or more sequences and a structured motif, SMOTIF searches the sequences for all occurrences of the motif. Potential applications include searching for long terminal repeat (LTR) retrotransposons and composite regulatory binding sites in DNA sequences. RESULTS: SMOTIF can search for both pattern and profile motifs, and it is efficient in terms of both time and space; it outperforms SMARTFINDER, a state-of-the-art algorithm for structured motif search. Experimental results show that SMOTIF is about 7 times faster and consumes 100 times less memory than SMARTFINDER. It can effectively search for LTR retrotransposons and is well suited to searching for motifs with long range gaps. It is also successful in finding potential composite transcription factor binding sites. CONCLUSION: SMOTIF is a useful and efficient tool in searching for structured pattern and profile motifs. The algorithm is available as open-source at: .
format Text
id pubmed-1679804
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16798042006-12-05 SMOTIF: efficient structured pattern and profile motif search Zhang, Yongqiang Zaki, Mohammed J Algorithms Mol Biol Research BACKGROUND: A structured motif allows variable length gaps between several components, where each component is a simple motif, which allows either no gaps or only fixed length gaps. The motif can either be represented as a pattern or a profile (also called positional weight matrix). We propose an efficient algorithm, called SMOTIF, to solve the structured motif search problem, i.e., given one or more sequences and a structured motif, SMOTIF searches the sequences for all occurrences of the motif. Potential applications include searching for long terminal repeat (LTR) retrotransposons and composite regulatory binding sites in DNA sequences. RESULTS: SMOTIF can search for both pattern and profile motifs, and it is efficient in terms of both time and space; it outperforms SMARTFINDER, a state-of-the-art algorithm for structured motif search. Experimental results show that SMOTIF is about 7 times faster and consumes 100 times less memory than SMARTFINDER. It can effectively search for LTR retrotransposons and is well suited to searching for motifs with long range gaps. It is also successful in finding potential composite transcription factor binding sites. CONCLUSION: SMOTIF is a useful and efficient tool in searching for structured pattern and profile motifs. The algorithm is available as open-source at: . BioMed Central 2006-11-21 /pmc/articles/PMC1679804/ /pubmed/17118189 http://dx.doi.org/10.1186/1748-7188-1-22 Text en Copyright © 2006 Zhang and Zaki; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Zhang, Yongqiang
Zaki, Mohammed J
SMOTIF: efficient structured pattern and profile motif search
title SMOTIF: efficient structured pattern and profile motif search
title_full SMOTIF: efficient structured pattern and profile motif search
title_fullStr SMOTIF: efficient structured pattern and profile motif search
title_full_unstemmed SMOTIF: efficient structured pattern and profile motif search
title_short SMOTIF: efficient structured pattern and profile motif search
title_sort smotif: efficient structured pattern and profile motif search
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1679804/
https://www.ncbi.nlm.nih.gov/pubmed/17118189
http://dx.doi.org/10.1186/1748-7188-1-22
work_keys_str_mv AT zhangyongqiang smotifefficientstructuredpatternandprofilemotifsearch
AT zakimohammedj smotifefficientstructuredpatternandprofilemotifsearch