Cargando…
MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs
The identification of cis-regulatory modules (CRMs) can greatly advance our understanding of eukaryotic regulatory mechanism. Current methods to predict CRMs from known motifs either depend on multiple alignments or can only deal with a small number of known motifs provided by users. These methods a...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2490743/ https://www.ncbi.nlm.nih.gov/pubmed/18606616 http://dx.doi.org/10.1093/nar/gkn407 |
_version_ | 1782158147247407104 |
---|---|
author | Hu, Jianfei Hu, Haiyan Li, Xiaoman |
author_facet | Hu, Jianfei Hu, Haiyan Li, Xiaoman |
author_sort | Hu, Jianfei |
collection | PubMed |
description | The identification of cis-regulatory modules (CRMs) can greatly advance our understanding of eukaryotic regulatory mechanism. Current methods to predict CRMs from known motifs either depend on multiple alignments or can only deal with a small number of known motifs provided by users. These methods are problematic when binding sites are not well aligned in multiple alignments or when the number of input known motifs is large. We thus developed a new CRM identification method MOPAT (motif pair tree), which identifies CRMs through the identification of motif modules, groups of motifs co-ccurring in multiple CRMs. It can identify ‘orthologous’ CRMs without multiple alignments. It can also find CRMs given a large number of known motifs. We have applied this method to mouse developmental genes, and have evaluated the predicted CRMs and motif modules by microarray expression data and known interacting motif pairs. We show that the expression profiles of the genes containing CRMs of the same motif module correlate significantly better than those of a random set of genes do. We also show that the known interacting motif pairs are significantly included in our predictions. Compared with several current methods, our method shows better performance in identifying meaningful CRMs. |
format | Text |
id | pubmed-2490743 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-24907432008-08-01 MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs Hu, Jianfei Hu, Haiyan Li, Xiaoman Nucleic Acids Res Computational Biology The identification of cis-regulatory modules (CRMs) can greatly advance our understanding of eukaryotic regulatory mechanism. Current methods to predict CRMs from known motifs either depend on multiple alignments or can only deal with a small number of known motifs provided by users. These methods are problematic when binding sites are not well aligned in multiple alignments or when the number of input known motifs is large. We thus developed a new CRM identification method MOPAT (motif pair tree), which identifies CRMs through the identification of motif modules, groups of motifs co-ccurring in multiple CRMs. It can identify ‘orthologous’ CRMs without multiple alignments. It can also find CRMs given a large number of known motifs. We have applied this method to mouse developmental genes, and have evaluated the predicted CRMs and motif modules by microarray expression data and known interacting motif pairs. We show that the expression profiles of the genes containing CRMs of the same motif module correlate significantly better than those of a random set of genes do. We also show that the known interacting motif pairs are significantly included in our predictions. Compared with several current methods, our method shows better performance in identifying meaningful CRMs. Oxford University Press 2008-08 2008-07-07 /pmc/articles/PMC2490743/ /pubmed/18606616 http://dx.doi.org/10.1093/nar/gkn407 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Hu, Jianfei Hu, Haiyan Li, Xiaoman MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs |
title | MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs |
title_full | MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs |
title_fullStr | MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs |
title_full_unstemmed | MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs |
title_short | MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs |
title_sort | mopat: a graph-based method to predict recurrent cis-regulatory modules from known motifs |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2490743/ https://www.ncbi.nlm.nih.gov/pubmed/18606616 http://dx.doi.org/10.1093/nar/gkn407 |
work_keys_str_mv | AT hujianfei mopatagraphbasedmethodtopredictrecurrentcisregulatorymodulesfromknownmotifs AT huhaiyan mopatagraphbasedmethodtopredictrecurrentcisregulatorymodulesfromknownmotifs AT lixiaoman mopatagraphbasedmethodtopredictrecurrentcisregulatorymodulesfromknownmotifs |