Cargando…
Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes
We present a new computational method for solving a classical problem, the identification problem of cis-regulatory motifs in a given set of promoter sequences, based on one key new idea. Instead of scoring candidate motifs individually like in all the existing motif-finding programs, our method sco...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2811016/ https://www.ncbi.nlm.nih.gov/pubmed/19906734 http://dx.doi.org/10.1093/nar/gkp907 |
_version_ | 1782176720044949504 |
---|---|
author | Li, Guojun Liu, Bingqiang Xu, Ying |
author_facet | Li, Guojun Liu, Bingqiang Xu, Ying |
author_sort | Li, Guojun |
collection | PubMed |
description | We present a new computational method for solving a classical problem, the identification problem of cis-regulatory motifs in a given set of promoter sequences, based on one key new idea. Instead of scoring candidate motifs individually like in all the existing motif-finding programs, our method scores groups of candidate motifs with similar sequences, called motif closures, using a P-value, which has substantially improved the prediction reliability over the existing methods. Our new P-value scoring scheme is sequence length independent, hence allowing direct comparisons among predicted motifs with different lengths on the same footing. We have implemented this method as a Motif Recognition Computer (MREC) program, and have extensively tested MREC on both simulated and biological data from prokaryotic genomes. Our test results indicate that MREC can accurately pick out the actual motif with the correct length as the best scoring candidate for the vast majority of the cases in our test set. We compared our prediction results with two motif-finding programs Cosmo and MEME, and found that MREC outperforms both programs across all the test cases by a large margin. The MREC program is available at http://csbl.bmb.uga.edu/~bingqiang/MREC1/. |
format | Text |
id | pubmed-2811016 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-28110162010-01-26 Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes Li, Guojun Liu, Bingqiang Xu, Ying Nucleic Acids Res Methods Online We present a new computational method for solving a classical problem, the identification problem of cis-regulatory motifs in a given set of promoter sequences, based on one key new idea. Instead of scoring candidate motifs individually like in all the existing motif-finding programs, our method scores groups of candidate motifs with similar sequences, called motif closures, using a P-value, which has substantially improved the prediction reliability over the existing methods. Our new P-value scoring scheme is sequence length independent, hence allowing direct comparisons among predicted motifs with different lengths on the same footing. We have implemented this method as a Motif Recognition Computer (MREC) program, and have extensively tested MREC on both simulated and biological data from prokaryotic genomes. Our test results indicate that MREC can accurately pick out the actual motif with the correct length as the best scoring candidate for the vast majority of the cases in our test set. We compared our prediction results with two motif-finding programs Cosmo and MEME, and found that MREC outperforms both programs across all the test cases by a large margin. The MREC program is available at http://csbl.bmb.uga.edu/~bingqiang/MREC1/. Oxford University Press 2010-01 2009-11-11 /pmc/articles/PMC2811016/ /pubmed/19906734 http://dx.doi.org/10.1093/nar/gkp907 Text en © The Author(s) 2009. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Li, Guojun Liu, Bingqiang Xu, Ying Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes |
title | Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes |
title_full | Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes |
title_fullStr | Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes |
title_full_unstemmed | Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes |
title_short | Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes |
title_sort | accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2811016/ https://www.ncbi.nlm.nih.gov/pubmed/19906734 http://dx.doi.org/10.1093/nar/gkp907 |
work_keys_str_mv | AT liguojun accuraterecognitionofcisregulatorymotifswiththecorrectlengthsinprokaryoticgenomes AT liubingqiang accuraterecognitionofcisregulatorymotifswiththecorrectlengthsinprokaryoticgenomes AT xuying accuraterecognitionofcisregulatorymotifswiththecorrectlengthsinprokaryoticgenomes |