Cargando…

Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes

We present a new computational method for solving a classical problem, the identification problem of cis-regulatory motifs in a given set of promoter sequences, based on one key new idea. Instead of scoring candidate motifs individually like in all the existing motif-finding programs, our method sco...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Guojun, Liu, Bingqiang, Xu, Ying
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2811016/
https://www.ncbi.nlm.nih.gov/pubmed/19906734
http://dx.doi.org/10.1093/nar/gkp907
_version_ 1782176720044949504
author Li, Guojun
Liu, Bingqiang
Xu, Ying
author_facet Li, Guojun
Liu, Bingqiang
Xu, Ying
author_sort Li, Guojun
collection PubMed
description We present a new computational method for solving a classical problem, the identification problem of cis-regulatory motifs in a given set of promoter sequences, based on one key new idea. Instead of scoring candidate motifs individually like in all the existing motif-finding programs, our method scores groups of candidate motifs with similar sequences, called motif closures, using a P-value, which has substantially improved the prediction reliability over the existing methods. Our new P-value scoring scheme is sequence length independent, hence allowing direct comparisons among predicted motifs with different lengths on the same footing. We have implemented this method as a Motif Recognition Computer (MREC) program, and have extensively tested MREC on both simulated and biological data from prokaryotic genomes. Our test results indicate that MREC can accurately pick out the actual motif with the correct length as the best scoring candidate for the vast majority of the cases in our test set. We compared our prediction results with two motif-finding programs Cosmo and MEME, and found that MREC outperforms both programs across all the test cases by a large margin. The MREC program is available at http://csbl.bmb.uga.edu/~bingqiang/MREC1/.
format Text
id pubmed-2811016
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28110162010-01-26 Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes Li, Guojun Liu, Bingqiang Xu, Ying Nucleic Acids Res Methods Online We present a new computational method for solving a classical problem, the identification problem of cis-regulatory motifs in a given set of promoter sequences, based on one key new idea. Instead of scoring candidate motifs individually like in all the existing motif-finding programs, our method scores groups of candidate motifs with similar sequences, called motif closures, using a P-value, which has substantially improved the prediction reliability over the existing methods. Our new P-value scoring scheme is sequence length independent, hence allowing direct comparisons among predicted motifs with different lengths on the same footing. We have implemented this method as a Motif Recognition Computer (MREC) program, and have extensively tested MREC on both simulated and biological data from prokaryotic genomes. Our test results indicate that MREC can accurately pick out the actual motif with the correct length as the best scoring candidate for the vast majority of the cases in our test set. We compared our prediction results with two motif-finding programs Cosmo and MEME, and found that MREC outperforms both programs across all the test cases by a large margin. The MREC program is available at http://csbl.bmb.uga.edu/~bingqiang/MREC1/. Oxford University Press 2010-01 2009-11-11 /pmc/articles/PMC2811016/ /pubmed/19906734 http://dx.doi.org/10.1093/nar/gkp907 Text en © The Author(s) 2009. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Li, Guojun
Liu, Bingqiang
Xu, Ying
Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes
title Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes
title_full Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes
title_fullStr Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes
title_full_unstemmed Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes
title_short Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes
title_sort accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2811016/
https://www.ncbi.nlm.nih.gov/pubmed/19906734
http://dx.doi.org/10.1093/nar/gkp907
work_keys_str_mv AT liguojun accuraterecognitionofcisregulatorymotifswiththecorrectlengthsinprokaryoticgenomes
AT liubingqiang accuraterecognitionofcisregulatorymotifswiththecorrectlengthsinprokaryoticgenomes
AT xuying accuraterecognitionofcisregulatorymotifswiththecorrectlengthsinprokaryoticgenomes