Cargando…

A mutation degree model for the identification of transcriptional regulatory elements

BACKGROUND: Current approaches for identifying transcriptional regulatory elements are mainly via the combination of two properties, the evolutionary conservation and the overrepresentation of functional elements in the promoters of co-regulated genes. Despite the development of many motif detection...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Changqing, Wang, Jin, Hua, Xu, Fang, Jinggui, Zhu, Huaiqiu, Gao, Xiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228546/
https://www.ncbi.nlm.nih.gov/pubmed/21708002
http://dx.doi.org/10.1186/1471-2105-12-262
_version_ 1782217830715883520
author Zhang, Changqing
Wang, Jin
Hua, Xu
Fang, Jinggui
Zhu, Huaiqiu
Gao, Xiang
author_facet Zhang, Changqing
Wang, Jin
Hua, Xu
Fang, Jinggui
Zhu, Huaiqiu
Gao, Xiang
author_sort Zhang, Changqing
collection PubMed
description BACKGROUND: Current approaches for identifying transcriptional regulatory elements are mainly via the combination of two properties, the evolutionary conservation and the overrepresentation of functional elements in the promoters of co-regulated genes. Despite the development of many motif detection algorithms, the discovery of conserved motifs in a wide range of phylogenetically related promoters is still a challenge, especially for the short motifs embedded in distantly related gene promoters or very closely related promoters, or in the situation that there are not enough orthologous genes available. RESULTS: A mutation degree model is proposed and a new word counting method is developed for the identification of transcriptional regulatory elements from a set of co-expressed genes. The new method comprises two parts: 1) identifying overrepresented oligo-nucleotides in promoters of co-expressed genes, 2) estimating the conservation of the oligo-nucleotides in promoters of phylogenetically related genes by the mutation degree model. Compared with the performance of other algorithms, our method shows the advantages of low false positive rate and higher specificity, especially the robustness to noisy data. Applying the method to co-expressed gene sets from Arabidopsis, most of known cis-elements were successfully detected. The tool and example are available at http://mcube.nju.edu.cn/jwang/lab/soft/ocw/OCW.html. CONCLUSIONS: The mutation degree model proposed in this paper is adapted to phylogenetic data of different qualities, and to a wide range of evolutionary distances. The new word-counting method based on this model has the advantage of better performance in detecting short sequence of cis-elements from co-expressed genes of eukaryotes and is robust to less complete phylogenetic data.
format Online
Article
Text
id pubmed-3228546
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32285462011-12-02 A mutation degree model for the identification of transcriptional regulatory elements Zhang, Changqing Wang, Jin Hua, Xu Fang, Jinggui Zhu, Huaiqiu Gao, Xiang BMC Bioinformatics Methodology Article BACKGROUND: Current approaches for identifying transcriptional regulatory elements are mainly via the combination of two properties, the evolutionary conservation and the overrepresentation of functional elements in the promoters of co-regulated genes. Despite the development of many motif detection algorithms, the discovery of conserved motifs in a wide range of phylogenetically related promoters is still a challenge, especially for the short motifs embedded in distantly related gene promoters or very closely related promoters, or in the situation that there are not enough orthologous genes available. RESULTS: A mutation degree model is proposed and a new word counting method is developed for the identification of transcriptional regulatory elements from a set of co-expressed genes. The new method comprises two parts: 1) identifying overrepresented oligo-nucleotides in promoters of co-expressed genes, 2) estimating the conservation of the oligo-nucleotides in promoters of phylogenetically related genes by the mutation degree model. Compared with the performance of other algorithms, our method shows the advantages of low false positive rate and higher specificity, especially the robustness to noisy data. Applying the method to co-expressed gene sets from Arabidopsis, most of known cis-elements were successfully detected. The tool and example are available at http://mcube.nju.edu.cn/jwang/lab/soft/ocw/OCW.html. CONCLUSIONS: The mutation degree model proposed in this paper is adapted to phylogenetic data of different qualities, and to a wide range of evolutionary distances. The new word-counting method based on this model has the advantage of better performance in detecting short sequence of cis-elements from co-expressed genes of eukaryotes and is robust to less complete phylogenetic data. BioMed Central 2011-06-27 /pmc/articles/PMC3228546/ /pubmed/21708002 http://dx.doi.org/10.1186/1471-2105-12-262 Text en Copyright ©2011 Zhang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Zhang, Changqing
Wang, Jin
Hua, Xu
Fang, Jinggui
Zhu, Huaiqiu
Gao, Xiang
A mutation degree model for the identification of transcriptional regulatory elements
title A mutation degree model for the identification of transcriptional regulatory elements
title_full A mutation degree model for the identification of transcriptional regulatory elements
title_fullStr A mutation degree model for the identification of transcriptional regulatory elements
title_full_unstemmed A mutation degree model for the identification of transcriptional regulatory elements
title_short A mutation degree model for the identification of transcriptional regulatory elements
title_sort mutation degree model for the identification of transcriptional regulatory elements
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228546/
https://www.ncbi.nlm.nih.gov/pubmed/21708002
http://dx.doi.org/10.1186/1471-2105-12-262
work_keys_str_mv AT zhangchangqing amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements
AT wangjin amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements
AT huaxu amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements
AT fangjinggui amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements
AT zhuhuaiqiu amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements
AT gaoxiang amutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements
AT zhangchangqing mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements
AT wangjin mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements
AT huaxu mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements
AT fangjinggui mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements
AT zhuhuaiqiu mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements
AT gaoxiang mutationdegreemodelfortheidentificationoftranscriptionalregulatoryelements