Cargando…

Transcriptional programs: Modelling higher order structure in transcriptional control

BACKGROUND: Transcriptional regulation is an important part of regulatory control in eukaryotes. Even if binding motifs for transcription factors are known, the task of finding binding sites by scanning sequences is plagued by false positives. One way to improve the detection of binding sites from m...

Descripción completa

Detalles Bibliográficos
Autores principales: Reid, John E, Ott, Sascha, Wernisch, Lorenz
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2725141/
https://www.ncbi.nlm.nih.gov/pubmed/19607663
http://dx.doi.org/10.1186/1471-2105-10-218
_version_ 1782170490991804416
author Reid, John E
Ott, Sascha
Wernisch, Lorenz
author_facet Reid, John E
Ott, Sascha
Wernisch, Lorenz
author_sort Reid, John E
collection PubMed
description BACKGROUND: Transcriptional regulation is an important part of regulatory control in eukaryotes. Even if binding motifs for transcription factors are known, the task of finding binding sites by scanning sequences is plagued by false positives. One way to improve the detection of binding sites from motifs is by taking cooperativity of transcription factor binding into account. We propose a non-parametric probabilistic model, similar to a document topic model, for detecting transcriptional programs, groups of cooperative transcription factors and co-regulated genes. The analysis results in transcriptional programs which generalise both transcriptional modules and TF-target gene incidence matrices and provide a higher-level summary of these structures. The method is independent of prior specification of training sets of genes, for example, via gene expression data. The analysis is based on known binding motifs. RESULTS: We applied our method to putative regulatory regions of 18,445 Mus musculus genes. We discovered just 68 transcriptional programs that effectively summarised the action of 149 transcription factors on these genes. Several of these programs were significantly enriched for known biological processes and signalling pathways. One transcriptional program has a significant overlap with a reference set of cell cycle specific transcription factors. CONCLUSION: Our method is able to pick out higher order structure from noisy sequence analyses. The transcriptional programs it identifies potentially represent common mechanisms of regulatory control across the genome. It simultaneously predicts which genes are co-regulated and which sets of transcription factors cooperate to achieve this co-regulation. The programs we discovered enable biologists to choose new genes and transcription factors to study in specific transcriptional regulatory systems.
format Text
id pubmed-2725141
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27251412009-08-12 Transcriptional programs: Modelling higher order structure in transcriptional control Reid, John E Ott, Sascha Wernisch, Lorenz BMC Bioinformatics Research Article BACKGROUND: Transcriptional regulation is an important part of regulatory control in eukaryotes. Even if binding motifs for transcription factors are known, the task of finding binding sites by scanning sequences is plagued by false positives. One way to improve the detection of binding sites from motifs is by taking cooperativity of transcription factor binding into account. We propose a non-parametric probabilistic model, similar to a document topic model, for detecting transcriptional programs, groups of cooperative transcription factors and co-regulated genes. The analysis results in transcriptional programs which generalise both transcriptional modules and TF-target gene incidence matrices and provide a higher-level summary of these structures. The method is independent of prior specification of training sets of genes, for example, via gene expression data. The analysis is based on known binding motifs. RESULTS: We applied our method to putative regulatory regions of 18,445 Mus musculus genes. We discovered just 68 transcriptional programs that effectively summarised the action of 149 transcription factors on these genes. Several of these programs were significantly enriched for known biological processes and signalling pathways. One transcriptional program has a significant overlap with a reference set of cell cycle specific transcription factors. CONCLUSION: Our method is able to pick out higher order structure from noisy sequence analyses. The transcriptional programs it identifies potentially represent common mechanisms of regulatory control across the genome. It simultaneously predicts which genes are co-regulated and which sets of transcription factors cooperate to achieve this co-regulation. The programs we discovered enable biologists to choose new genes and transcription factors to study in specific transcriptional regulatory systems. BioMed Central 2009-07-16 /pmc/articles/PMC2725141/ /pubmed/19607663 http://dx.doi.org/10.1186/1471-2105-10-218 Text en Copyright © 2009 Reid et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Reid, John E
Ott, Sascha
Wernisch, Lorenz
Transcriptional programs: Modelling higher order structure in transcriptional control
title Transcriptional programs: Modelling higher order structure in transcriptional control
title_full Transcriptional programs: Modelling higher order structure in transcriptional control
title_fullStr Transcriptional programs: Modelling higher order structure in transcriptional control
title_full_unstemmed Transcriptional programs: Modelling higher order structure in transcriptional control
title_short Transcriptional programs: Modelling higher order structure in transcriptional control
title_sort transcriptional programs: modelling higher order structure in transcriptional control
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2725141/
https://www.ncbi.nlm.nih.gov/pubmed/19607663
http://dx.doi.org/10.1186/1471-2105-10-218
work_keys_str_mv AT reidjohne transcriptionalprogramsmodellinghigherorderstructureintranscriptionalcontrol
AT ottsascha transcriptionalprogramsmodellinghigherorderstructureintranscriptionalcontrol
AT wernischlorenz transcriptionalprogramsmodellinghigherorderstructureintranscriptionalcontrol