Cargando…
A mutation degree model for the identification of transcriptional regulatory elements
BACKGROUND: Current approaches for identifying transcriptional regulatory elements are mainly via the combination of two properties, the evolutionary conservation and the overrepresentation of functional elements in the promoters of co-regulated genes. Despite the development of many motif detection...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228546/ https://www.ncbi.nlm.nih.gov/pubmed/21708002 http://dx.doi.org/10.1186/1471-2105-12-262 |
_version_ | 1782217830715883520 |
---|---|
author | Zhang, Changqing Wang, Jin Hua, Xu Fang, Jinggui Zhu, Huaiqiu Gao, Xiang |
author_facet | Zhang, Changqing Wang, Jin Hua, Xu Fang, Jinggui Zhu, Huaiqiu Gao, Xiang |
author_sort | Zhang, Changqing |
collection | PubMed |
description | BACKGROUND: Current approaches for identifying transcriptional regulatory elements are mainly via the combination of two properties, the evolutionary conservation and the overrepresentation of functional elements in the promoters of co-regulated genes. Despite the development of many motif detection algorithms, the discovery of conserved motifs in a wide range of phylogenetically related promoters is still a challenge, especially for the short motifs embedded in distantly related gene promoters or very closely related promoters, or in the situation that there are not enough orthologous genes available. RESULTS: A mutation degree model is proposed and a new word counting method is developed for the identification of transcriptional regulatory elements from a set of co-expressed genes. The new method comprises two parts: 1) identifying overrepresented oligo-nucleotides in promoters of co-expressed genes, 2) estimating the conservation of the oligo-nucleotides in promoters of phylogenetically related genes by the mutation degree model. Compared with the performance of other algorithms, our method shows the advantages of low false positive rate and higher specificity, especially the robustness to noisy data. Applying the method to co-expressed gene sets from Arabidopsis, most of known cis-elements were successfully detected. The tool and example are available at http://mcube.nju.edu.cn/jwang/lab/soft/ocw/OCW.html. CONCLUSIONS: The mutation degree model proposed in this paper is adapted to phylogenetic data of different qualities, and to a wide range of evolutionary distances. The new word-counting method based on this model has the advantage of better performance in detecting short sequence of cis-elements from co-expressed genes of eukaryotes and is robust to less complete phylogenetic data. |
format | Online Article Text |
id | pubmed-3228546 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32285462011-12-02 A mutation degree model for the identification of transcriptional regulatory elements Zhang, Changqing Wang, Jin Hua, Xu Fang, Jinggui Zhu, Huaiqiu Gao, Xiang BMC Bioinformatics Methodology Article BACKGROUND: Current approaches for identifying transcriptional regulatory elements are mainly via the combination of two properties, the evolutionary conservation and the overrepresentation of functional elements in the promoters of co-regulated genes. Despite the development of many motif detection algorithms, the discovery of conserved motifs in a wide range of phylogenetically related promoters is still a challenge, especially for the short motifs embedded in distantly related gene promoters or very closely related promoters, or in the situation that there are not enough orthologous genes available. RESULTS: A mutation degree model is proposed and a new word counting method is developed for the identification of transcriptional regulatory elements from a set of co-expressed genes. The new method comprises two parts: 1) identifying overrepresented oligo-nucleotides in promoters of co-expressed genes, 2) estimating the conservation of the oligo-nucleotides in promoters of phylogenetically related genes by the mutation degree model. Compared with the performance of other algorithms, our method shows the advantages of low false positive rate and higher specificity, especially the robustness to noisy data. Applying the method to co-expressed gene sets from Arabidopsis, most of known cis-elements were successfully detected. The tool and example are available at http://mcube.nju.edu.cn/jwang/lab/soft/ocw/OCW.html. CONCLUSIONS: The mutation degree model proposed in this paper is adapted to phylogenetic data of different qualities, and to a wide range of evolutionary distances. The new word-counting method based on this model has the advantage of better performance in detecting short sequence of cis-elements from co-expressed genes of eukaryotes and is robust to less complete phylogenetic data. BioMed Central 2011-06-27 /pmc/articles/PMC3228546/ /pubmed/21708002 http://dx.doi.org/10.1186/1471-2105-12-262 Text en Copyright ©2011 Zhang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Zhang, Changqing Wang, Jin Hua, Xu Fang, Jinggui Zhu, Huaiqiu Gao, Xiang A mutation degree model for the identification of transcriptional regulatory elements |
title | A mutation degree model for the identification of transcriptional regulatory elements |
title_full | A mutation degree model for the identification of transcriptional regulatory elements |
title_fullStr | A mutation degree model for the identification of transcriptional regulatory elements |
title_full_unstemmed | A mutation degree model for the identification of transcriptional regulatory elements |
title_short | A mutation degree model for the identification of transcriptional regulatory elements |
title_sort | mutation degree model for the identification of transcriptional regulatory elements |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228546/ https://www.ncbi.nlm.nih.gov/pubmed/21708002 http://dx.doi.org/10.1186/1471-2105-12-262 |
work_keys_str_mv | AT zhangchangqing amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements AT wangjin amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements AT huaxu amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements AT fangjinggui amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements AT zhuhuaiqiu amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements AT gaoxiang amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements AT zhangchangqing mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements AT wangjin mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements AT huaxu mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements AT fangjinggui mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements AT zhuhuaiqiu mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements AT gaoxiang mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements |