Cargando…
A specialized learner for inferring structured cis-regulatory modules
BACKGROUND: The process of transcription is controlled by systems of transcription factors, which bind to specific patterns of binding sites in the transcriptional control regions of genes, called cis-regulatory modules (CRMs). We present an expressive and easily comprehensible CRM representation wh...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1762027/ https://www.ncbi.nlm.nih.gov/pubmed/17147812 http://dx.doi.org/10.1186/1471-2105-7-528 |
_version_ | 1782131525189369856 |
---|---|
author | Noto, Keith Craven, Mark |
author_facet | Noto, Keith Craven, Mark |
author_sort | Noto, Keith |
collection | PubMed |
description | BACKGROUND: The process of transcription is controlled by systems of transcription factors, which bind to specific patterns of binding sites in the transcriptional control regions of genes, called cis-regulatory modules (CRMs). We present an expressive and easily comprehensible CRM representation which is capable of capturing several aspects of a CRM's structure and distinguishing between DNA sequences which do or do not contain it. We also present a learning algorithm tailored for this domain, and a novel method to avoid overfitting by controlling the expressivity of the model. RESULTS: We are able to find statistically significant CRMs more often then a current state-of-the-art approach on the same data sets. We also show experimentally that each aspect of our expressive CRM model space makes a positive contribution to the learned models on yeast and fly data. CONCLUSION: Structural aspects are an important part of CRMs, both in terms of interpreting them biologically and learning them accurately. Source code for our algorithm is available at: |
format | Text |
id | pubmed-1762027 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-17620272007-01-10 A specialized learner for inferring structured cis-regulatory modules Noto, Keith Craven, Mark BMC Bioinformatics Methodology Article BACKGROUND: The process of transcription is controlled by systems of transcription factors, which bind to specific patterns of binding sites in the transcriptional control regions of genes, called cis-regulatory modules (CRMs). We present an expressive and easily comprehensible CRM representation which is capable of capturing several aspects of a CRM's structure and distinguishing between DNA sequences which do or do not contain it. We also present a learning algorithm tailored for this domain, and a novel method to avoid overfitting by controlling the expressivity of the model. RESULTS: We are able to find statistically significant CRMs more often then a current state-of-the-art approach on the same data sets. We also show experimentally that each aspect of our expressive CRM model space makes a positive contribution to the learned models on yeast and fly data. CONCLUSION: Structural aspects are an important part of CRMs, both in terms of interpreting them biologically and learning them accurately. Source code for our algorithm is available at: BioMed Central 2006-12-05 /pmc/articles/PMC1762027/ /pubmed/17147812 http://dx.doi.org/10.1186/1471-2105-7-528 Text en Copyright © 2006 Noto and Craven; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Noto, Keith Craven, Mark A specialized learner for inferring structured cis-regulatory modules |
title | A specialized learner for inferring structured cis-regulatory modules |
title_full | A specialized learner for inferring structured cis-regulatory modules |
title_fullStr | A specialized learner for inferring structured cis-regulatory modules |
title_full_unstemmed | A specialized learner for inferring structured cis-regulatory modules |
title_short | A specialized learner for inferring structured cis-regulatory modules |
title_sort | specialized learner for inferring structured cis-regulatory modules |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1762027/ https://www.ncbi.nlm.nih.gov/pubmed/17147812 http://dx.doi.org/10.1186/1471-2105-7-528 |
work_keys_str_mv | AT notokeith aspecializedlearnerforinferringstructuredcisregulatorymodules AT cravenmark aspecializedlearnerforinferringstructuredcisregulatorymodules AT notokeith specializedlearnerforinferringstructuredcisregulatorymodules AT cravenmark specializedlearnerforinferringstructuredcisregulatorymodules |