Cargando…

DRIMust: a web server for discovering rank imbalanced motifs using suffix trees

Cellular regulation mechanisms that involve proteins and other active molecules interacting with specific targets often involve the recognition of sequence patterns. Short sequence elements on DNA, RNA and proteins play a central role in mediating such molecular recognition events. Studies that focu...

Descripción completa

Detalles Bibliográficos
Autores principales: Leibovich, Limor, Paz, Inbal, Yakhini, Zohar, Mandel-Gutfreund, Yael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692051/
https://www.ncbi.nlm.nih.gov/pubmed/23685432
http://dx.doi.org/10.1093/nar/gkt407
_version_ 1782274558288461824
author Leibovich, Limor
Paz, Inbal
Yakhini, Zohar
Mandel-Gutfreund, Yael
author_facet Leibovich, Limor
Paz, Inbal
Yakhini, Zohar
Mandel-Gutfreund, Yael
author_sort Leibovich, Limor
collection PubMed
description Cellular regulation mechanisms that involve proteins and other active molecules interacting with specific targets often involve the recognition of sequence patterns. Short sequence elements on DNA, RNA and proteins play a central role in mediating such molecular recognition events. Studies that focus on measuring and investigating sequence-based recognition processes make use of statistical and computational tools that support the identification and understanding of sequence motifs. We present a new web application, named DRIMust, freely accessible through the website http://drimust.technion.ac.il for de novo motif discovery services. The DRIMust algorithm is based on the minimum hypergeometric statistical framework and uses suffix trees for an efficient enumeration of motif candidates. DRIMust takes as input ranked lists of sequences in FASTA format and returns motifs that are over-represented at the top of the list, where the determination of the threshold that defines top is data driven. The resulting motifs are presented individually with an accurate P-value indication and as a Position Specific Scoring Matrix. Comparing DRIMust with other state-of-the-art tools demonstrated significant advantage to DRIMust, both in result accuracy and in short running times. Overall, DRIMust is unique in combining efficient search on large ranked lists with rigorous P-value assessment for the detected motifs.
format Online
Article
Text
id pubmed-3692051
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36920512013-06-25 DRIMust: a web server for discovering rank imbalanced motifs using suffix trees Leibovich, Limor Paz, Inbal Yakhini, Zohar Mandel-Gutfreund, Yael Nucleic Acids Res Articles Cellular regulation mechanisms that involve proteins and other active molecules interacting with specific targets often involve the recognition of sequence patterns. Short sequence elements on DNA, RNA and proteins play a central role in mediating such molecular recognition events. Studies that focus on measuring and investigating sequence-based recognition processes make use of statistical and computational tools that support the identification and understanding of sequence motifs. We present a new web application, named DRIMust, freely accessible through the website http://drimust.technion.ac.il for de novo motif discovery services. The DRIMust algorithm is based on the minimum hypergeometric statistical framework and uses suffix trees for an efficient enumeration of motif candidates. DRIMust takes as input ranked lists of sequences in FASTA format and returns motifs that are over-represented at the top of the list, where the determination of the threshold that defines top is data driven. The resulting motifs are presented individually with an accurate P-value indication and as a Position Specific Scoring Matrix. Comparing DRIMust with other state-of-the-art tools demonstrated significant advantage to DRIMust, both in result accuracy and in short running times. Overall, DRIMust is unique in combining efficient search on large ranked lists with rigorous P-value assessment for the detected motifs. Oxford University Press 2013-07 2013-05-17 /pmc/articles/PMC3692051/ /pubmed/23685432 http://dx.doi.org/10.1093/nar/gkt407 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Leibovich, Limor
Paz, Inbal
Yakhini, Zohar
Mandel-Gutfreund, Yael
DRIMust: a web server for discovering rank imbalanced motifs using suffix trees
title DRIMust: a web server for discovering rank imbalanced motifs using suffix trees
title_full DRIMust: a web server for discovering rank imbalanced motifs using suffix trees
title_fullStr DRIMust: a web server for discovering rank imbalanced motifs using suffix trees
title_full_unstemmed DRIMust: a web server for discovering rank imbalanced motifs using suffix trees
title_short DRIMust: a web server for discovering rank imbalanced motifs using suffix trees
title_sort drimust: a web server for discovering rank imbalanced motifs using suffix trees
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692051/
https://www.ncbi.nlm.nih.gov/pubmed/23685432
http://dx.doi.org/10.1093/nar/gkt407
work_keys_str_mv AT leibovichlimor drimustawebserverfordiscoveringrankimbalancedmotifsusingsuffixtrees
AT pazinbal drimustawebserverfordiscoveringrankimbalancedmotifsusingsuffixtrees
AT yakhinizohar drimustawebserverfordiscoveringrankimbalancedmotifsusingsuffixtrees
AT mandelgutfreundyael drimustawebserverfordiscoveringrankimbalancedmotifsusingsuffixtrees