Cargando…
GPMiner: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group
BACKGROUND: Sequence features in promoter regions are involved in regulating gene transcription initiation. Although numerous computational methods have been developed for predicting transcriptional start sites (TSSs) or transcription factor (TF) binding sites (TFBSs), they lack annotations for do n...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3587379/ https://www.ncbi.nlm.nih.gov/pubmed/22369687 http://dx.doi.org/10.1186/1471-2164-13-S1-S3 |
_version_ | 1782261392526540800 |
---|---|
author | Lee, Tzong-Yi Chang, Wen-Chi Hsu, Justin Bo-Kai Chang, Tzu-Hao Shien, Dray-Ming |
author_facet | Lee, Tzong-Yi Chang, Wen-Chi Hsu, Justin Bo-Kai Chang, Tzu-Hao Shien, Dray-Ming |
author_sort | Lee, Tzong-Yi |
collection | PubMed |
description | BACKGROUND: Sequence features in promoter regions are involved in regulating gene transcription initiation. Although numerous computational methods have been developed for predicting transcriptional start sites (TSSs) or transcription factor (TF) binding sites (TFBSs), they lack annotations for do not consider some important regulatory features such as CpG islands, tandem repeats, the TATA box, CCAAT box, GC box, over-represented oligonucleotides, DNA stability, and GC content. Additionally, the combinatorial interaction of TFs regulates the gene group that is associated with same expression pattern. To investigate gene transcriptional regulation, an integrated system that annotates regulatory features in a promoter sequence and detects co-regulation of TFs in a group of genes is needed. RESULTS: This work identifies TSSs and regulatory features in a promoter sequence, and recognizes co-occurrence of cis-regulatory elements in co-expressed genes using a novel system. Three well-known TSS prediction tools are incorporated with orthologous conserved features, such as CpG islands, nucleotide composition, over-represented hexamer nucleotides, and DNA stability, to construct the novel Gene Promoter Miner (GPMiner) using a support vector machine (SVM). According to five-fold cross-validation results, the predictive sensitivity and specificity are both roughly 80%. The proposed system allows users to input a group of gene names/symbols, enabling the co-occurrence of TFBSs to be determined. Additionally, an input sequence can also be analyzed for homogeneity of experimental mammalian promoter sequences, and conserved regulatory features between homologous promoters can be observed through cross-species analysis. After identifying promoter regions, regulatory features are visualized graphically to facilitate gene promoter observations. CONCLUSIONS: The GPMiner, which has a user-friendly input/output interface, has numerous benefits in analyzing human and mouse promoters. The proposed system is freely available at http://GPMiner.mbc.nctu.edu.tw/. |
format | Online Article Text |
id | pubmed-3587379 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35873792013-03-11 GPMiner: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group Lee, Tzong-Yi Chang, Wen-Chi Hsu, Justin Bo-Kai Chang, Tzu-Hao Shien, Dray-Ming BMC Genomics Proceedings BACKGROUND: Sequence features in promoter regions are involved in regulating gene transcription initiation. Although numerous computational methods have been developed for predicting transcriptional start sites (TSSs) or transcription factor (TF) binding sites (TFBSs), they lack annotations for do not consider some important regulatory features such as CpG islands, tandem repeats, the TATA box, CCAAT box, GC box, over-represented oligonucleotides, DNA stability, and GC content. Additionally, the combinatorial interaction of TFs regulates the gene group that is associated with same expression pattern. To investigate gene transcriptional regulation, an integrated system that annotates regulatory features in a promoter sequence and detects co-regulation of TFs in a group of genes is needed. RESULTS: This work identifies TSSs and regulatory features in a promoter sequence, and recognizes co-occurrence of cis-regulatory elements in co-expressed genes using a novel system. Three well-known TSS prediction tools are incorporated with orthologous conserved features, such as CpG islands, nucleotide composition, over-represented hexamer nucleotides, and DNA stability, to construct the novel Gene Promoter Miner (GPMiner) using a support vector machine (SVM). According to five-fold cross-validation results, the predictive sensitivity and specificity are both roughly 80%. The proposed system allows users to input a group of gene names/symbols, enabling the co-occurrence of TFBSs to be determined. Additionally, an input sequence can also be analyzed for homogeneity of experimental mammalian promoter sequences, and conserved regulatory features between homologous promoters can be observed through cross-species analysis. After identifying promoter regions, regulatory features are visualized graphically to facilitate gene promoter observations. CONCLUSIONS: The GPMiner, which has a user-friendly input/output interface, has numerous benefits in analyzing human and mouse promoters. The proposed system is freely available at http://GPMiner.mbc.nctu.edu.tw/. BioMed Central 2012-01-17 /pmc/articles/PMC3587379/ /pubmed/22369687 http://dx.doi.org/10.1186/1471-2164-13-S1-S3 Text en Copyright ©2012 Lee et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Lee, Tzong-Yi Chang, Wen-Chi Hsu, Justin Bo-Kai Chang, Tzu-Hao Shien, Dray-Ming GPMiner: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group |
title | GPMiner: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group |
title_full | GPMiner: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group |
title_fullStr | GPMiner: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group |
title_full_unstemmed | GPMiner: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group |
title_short | GPMiner: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group |
title_sort | gpminer: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3587379/ https://www.ncbi.nlm.nih.gov/pubmed/22369687 http://dx.doi.org/10.1186/1471-2164-13-S1-S3 |
work_keys_str_mv | AT leetzongyi gpmineranintegratedsystemforminingcombinatorialcisregulatoryelementsinmammaliangenegroup AT changwenchi gpmineranintegratedsystemforminingcombinatorialcisregulatoryelementsinmammaliangenegroup AT hsujustinbokai gpmineranintegratedsystemforminingcombinatorialcisregulatoryelementsinmammaliangenegroup AT changtzuhao gpmineranintegratedsystemforminingcombinatorialcisregulatoryelementsinmammaliangenegroup AT shiendrayming gpmineranintegratedsystemforminingcombinatorialcisregulatoryelementsinmammaliangenegroup |