Cargando…
Motif Discovery in Tissue-Specific Regulatory Sequences Using Directed Information
Motif discovery for the identification of functional regulatory elements underlying gene expression is a challenging problem. Sequence inspection often leads to discovery of novel motifs (including transcription factor sites) with previously uncharacterized function in gene expression. Coupled with...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171326/ https://www.ncbi.nlm.nih.gov/pubmed/18340376 http://dx.doi.org/10.1155/2007/13853 |
_version_ | 1782211737371541504 |
---|---|
author | Rao, Arvind Hero, Alfred O States, David J Engel, James Douglas |
author_facet | Rao, Arvind Hero, Alfred O States, David J Engel, James Douglas |
author_sort | Rao, Arvind |
collection | PubMed |
description | Motif discovery for the identification of functional regulatory elements underlying gene expression is a challenging problem. Sequence inspection often leads to discovery of novel motifs (including transcription factor sites) with previously uncharacterized function in gene expression. Coupled with the complexity underlying tissue-specific gene expression, there are several motifs that are putatively responsible for expression in a certain cell type. This has important implications in understanding fundamental biological processes such as development and disease progression. In this work, we present an approach to the identification of motifs (not necessarily transcription factor sites) and examine its application to some questions in current bioinformatics research. These motifs are seen to discriminate tissue-specific gene promoter or regulatory regions from those that are not tissue-specific. There are two main contributions of this work. Firstly, we propose the use of directed information for such classification constrained motif discovery, and then use the selected features with a support vector machine (SVM) classifier to find the tissue specificity of any sequence of interest. Such analysis yields several novel interesting motifs that merit further experimental characterization. Furthermore, this approach leads to a principled framework for the prospective examination of any chosen motif to be discriminatory motif for a group of coexpressed/coregulated genes, thereby integrating sequence and expression perspectives. We hypothesize that the discovery of these motifs would enable the large-scale investigation for the tissue-specific regulatory role of any conserved sequence element identified from genome-wide studies. |
format | Online Article Text |
id | pubmed-3171326 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Springer |
record_format | MEDLINE/PubMed |
spelling | pubmed-31713262011-09-13 Motif Discovery in Tissue-Specific Regulatory Sequences Using Directed Information Rao, Arvind Hero, Alfred O States, David J Engel, James Douglas EURASIP J Bioinform Syst Biol Research Article Motif discovery for the identification of functional regulatory elements underlying gene expression is a challenging problem. Sequence inspection often leads to discovery of novel motifs (including transcription factor sites) with previously uncharacterized function in gene expression. Coupled with the complexity underlying tissue-specific gene expression, there are several motifs that are putatively responsible for expression in a certain cell type. This has important implications in understanding fundamental biological processes such as development and disease progression. In this work, we present an approach to the identification of motifs (not necessarily transcription factor sites) and examine its application to some questions in current bioinformatics research. These motifs are seen to discriminate tissue-specific gene promoter or regulatory regions from those that are not tissue-specific. There are two main contributions of this work. Firstly, we propose the use of directed information for such classification constrained motif discovery, and then use the selected features with a support vector machine (SVM) classifier to find the tissue specificity of any sequence of interest. Such analysis yields several novel interesting motifs that merit further experimental characterization. Furthermore, this approach leads to a principled framework for the prospective examination of any chosen motif to be discriminatory motif for a group of coexpressed/coregulated genes, thereby integrating sequence and expression perspectives. We hypothesize that the discovery of these motifs would enable the large-scale investigation for the tissue-specific regulatory role of any conserved sequence element identified from genome-wide studies. Springer 2007-12-24 /pmc/articles/PMC3171326/ /pubmed/18340376 http://dx.doi.org/10.1155/2007/13853 Text en Copyright © 2007 Arvind Rao et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Rao, Arvind Hero, Alfred O States, David J Engel, James Douglas Motif Discovery in Tissue-Specific Regulatory Sequences Using Directed Information |
title | Motif Discovery in Tissue-Specific Regulatory Sequences Using Directed Information |
title_full | Motif Discovery in Tissue-Specific Regulatory Sequences Using Directed Information |
title_fullStr | Motif Discovery in Tissue-Specific Regulatory Sequences Using Directed Information |
title_full_unstemmed | Motif Discovery in Tissue-Specific Regulatory Sequences Using Directed Information |
title_short | Motif Discovery in Tissue-Specific Regulatory Sequences Using Directed Information |
title_sort | motif discovery in tissue-specific regulatory sequences using directed information |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171326/ https://www.ncbi.nlm.nih.gov/pubmed/18340376 http://dx.doi.org/10.1155/2007/13853 |
work_keys_str_mv | AT raoarvind motifdiscoveryintissuespecificregulatorysequencesusingdirectedinformation AT heroalfredo motifdiscoveryintissuespecificregulatorysequencesusingdirectedinformation AT statesdavidj motifdiscoveryintissuespecificregulatorysequencesusingdirectedinformation AT engeljamesdouglas motifdiscoveryintissuespecificregulatorysequencesusingdirectedinformation |