Cargando…
DRIMust: a web server for discovering rank imbalanced motifs using suffix trees
Cellular regulation mechanisms that involve proteins and other active molecules interacting with specific targets often involve the recognition of sequence patterns. Short sequence elements on DNA, RNA and proteins play a central role in mediating such molecular recognition events. Studies that focu...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692051/ https://www.ncbi.nlm.nih.gov/pubmed/23685432 http://dx.doi.org/10.1093/nar/gkt407 |
_version_ | 1782274558288461824 |
---|---|
author | Leibovich, Limor Paz, Inbal Yakhini, Zohar Mandel-Gutfreund, Yael |
author_facet | Leibovich, Limor Paz, Inbal Yakhini, Zohar Mandel-Gutfreund, Yael |
author_sort | Leibovich, Limor |
collection | PubMed |
description | Cellular regulation mechanisms that involve proteins and other active molecules interacting with specific targets often involve the recognition of sequence patterns. Short sequence elements on DNA, RNA and proteins play a central role in mediating such molecular recognition events. Studies that focus on measuring and investigating sequence-based recognition processes make use of statistical and computational tools that support the identification and understanding of sequence motifs. We present a new web application, named DRIMust, freely accessible through the website http://drimust.technion.ac.il for de novo motif discovery services. The DRIMust algorithm is based on the minimum hypergeometric statistical framework and uses suffix trees for an efficient enumeration of motif candidates. DRIMust takes as input ranked lists of sequences in FASTA format and returns motifs that are over-represented at the top of the list, where the determination of the threshold that defines top is data driven. The resulting motifs are presented individually with an accurate P-value indication and as a Position Specific Scoring Matrix. Comparing DRIMust with other state-of-the-art tools demonstrated significant advantage to DRIMust, both in result accuracy and in short running times. Overall, DRIMust is unique in combining efficient search on large ranked lists with rigorous P-value assessment for the detected motifs. |
format | Online Article Text |
id | pubmed-3692051 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-36920512013-06-25 DRIMust: a web server for discovering rank imbalanced motifs using suffix trees Leibovich, Limor Paz, Inbal Yakhini, Zohar Mandel-Gutfreund, Yael Nucleic Acids Res Articles Cellular regulation mechanisms that involve proteins and other active molecules interacting with specific targets often involve the recognition of sequence patterns. Short sequence elements on DNA, RNA and proteins play a central role in mediating such molecular recognition events. Studies that focus on measuring and investigating sequence-based recognition processes make use of statistical and computational tools that support the identification and understanding of sequence motifs. We present a new web application, named DRIMust, freely accessible through the website http://drimust.technion.ac.il for de novo motif discovery services. The DRIMust algorithm is based on the minimum hypergeometric statistical framework and uses suffix trees for an efficient enumeration of motif candidates. DRIMust takes as input ranked lists of sequences in FASTA format and returns motifs that are over-represented at the top of the list, where the determination of the threshold that defines top is data driven. The resulting motifs are presented individually with an accurate P-value indication and as a Position Specific Scoring Matrix. Comparing DRIMust with other state-of-the-art tools demonstrated significant advantage to DRIMust, both in result accuracy and in short running times. Overall, DRIMust is unique in combining efficient search on large ranked lists with rigorous P-value assessment for the detected motifs. Oxford University Press 2013-07 2013-05-17 /pmc/articles/PMC3692051/ /pubmed/23685432 http://dx.doi.org/10.1093/nar/gkt407 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Leibovich, Limor Paz, Inbal Yakhini, Zohar Mandel-Gutfreund, Yael DRIMust: a web server for discovering rank imbalanced motifs using suffix trees |
title | DRIMust: a web server for discovering rank imbalanced motifs using suffix trees |
title_full | DRIMust: a web server for discovering rank imbalanced motifs using suffix trees |
title_fullStr | DRIMust: a web server for discovering rank imbalanced motifs using suffix trees |
title_full_unstemmed | DRIMust: a web server for discovering rank imbalanced motifs using suffix trees |
title_short | DRIMust: a web server for discovering rank imbalanced motifs using suffix trees |
title_sort | drimust: a web server for discovering rank imbalanced motifs using suffix trees |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692051/ https://www.ncbi.nlm.nih.gov/pubmed/23685432 http://dx.doi.org/10.1093/nar/gkt407 |
work_keys_str_mv | AT leibovichlimor drimustawebserverfordiscoveringrankimbalancedmotifsusingsuffixtrees AT pazinbal drimustawebserverfordiscoveringrankimbalancedmotifsusingsuffixtrees AT yakhinizohar drimustawebserverfordiscoveringrankimbalancedmotifsusingsuffixtrees AT mandelgutfreundyael drimustawebserverfordiscoveringrankimbalancedmotifsusingsuffixtrees |