Cargando…

Genome-Wide Signatures of Transcription Factor Activity: Connecting Transcription Factors, Disease, and Small Molecules

Identifying transcription factors (TF) involved in producing a genome-wide transcriptional profile is an essential step in building mechanistic model that can explain observed gene expression data. We developed a statistical framework for constructing genome-wide signatures of TF activity, and for u...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Jing, Hu, Zhen, Phatak, Mukta, Reichard, John, Freudenberg, Johannes M., Sivaganesan, Siva, Medvedovic, Mario
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3764016/
https://www.ncbi.nlm.nih.gov/pubmed/24039560
http://dx.doi.org/10.1371/journal.pcbi.1003198
_version_ 1782283074149548032
author Chen, Jing
Hu, Zhen
Phatak, Mukta
Reichard, John
Freudenberg, Johannes M.
Sivaganesan, Siva
Medvedovic, Mario
author_facet Chen, Jing
Hu, Zhen
Phatak, Mukta
Reichard, John
Freudenberg, Johannes M.
Sivaganesan, Siva
Medvedovic, Mario
author_sort Chen, Jing
collection PubMed
description Identifying transcription factors (TF) involved in producing a genome-wide transcriptional profile is an essential step in building mechanistic model that can explain observed gene expression data. We developed a statistical framework for constructing genome-wide signatures of TF activity, and for using such signatures in the analysis of gene expression data produced by complex transcriptional regulatory programs. Our framework integrates ChIP-seq data and appropriately matched gene expression profiles to identify True REGulatory (TREG) TF-gene interactions. It provides genome-wide quantification of the likelihood of regulatory TF-gene interaction that can be used to either identify regulated genes, or as genome-wide signature of TF activity. To effectively use ChIP-seq data, we introduce a novel statistical model that integrates information from all binding “peaks” within 2 Mb window around a gene's transcription start site (TSS), and provides gene-level binding scores and probabilities of regulatory interaction. In the second step we integrate these binding scores and regulatory probabilities with gene expression data to assess the likelihood of True REGulatory (TREG) TF-gene interactions. We demonstrate the advantages of TREG framework in identifying genes regulated by two TFs with widely different distribution of functional binding events (ERα and E2f1). We also show that TREG signatures of TF activity vastly improve our ability to detect involvement of ERα in producing complex diseases-related transcriptional profiles. Through a large study of disease-related transcriptional signatures and transcriptional signatures of drug activity, we demonstrate that increase in statistical power associated with the use of TREG signatures makes the crucial difference in identifying key targets for treatment, and drugs to use for treatment. All methods are implemented in an open-source R package treg. The package also contains all data used in the analysis including 494 TREG binding profiles based on ENCODE ChIP-seq data. The treg package can be downloaded at http://GenomicsPortals.org.
format Online
Article
Text
id pubmed-3764016
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37640162013-09-13 Genome-Wide Signatures of Transcription Factor Activity: Connecting Transcription Factors, Disease, and Small Molecules Chen, Jing Hu, Zhen Phatak, Mukta Reichard, John Freudenberg, Johannes M. Sivaganesan, Siva Medvedovic, Mario PLoS Comput Biol Research Article Identifying transcription factors (TF) involved in producing a genome-wide transcriptional profile is an essential step in building mechanistic model that can explain observed gene expression data. We developed a statistical framework for constructing genome-wide signatures of TF activity, and for using such signatures in the analysis of gene expression data produced by complex transcriptional regulatory programs. Our framework integrates ChIP-seq data and appropriately matched gene expression profiles to identify True REGulatory (TREG) TF-gene interactions. It provides genome-wide quantification of the likelihood of regulatory TF-gene interaction that can be used to either identify regulated genes, or as genome-wide signature of TF activity. To effectively use ChIP-seq data, we introduce a novel statistical model that integrates information from all binding “peaks” within 2 Mb window around a gene's transcription start site (TSS), and provides gene-level binding scores and probabilities of regulatory interaction. In the second step we integrate these binding scores and regulatory probabilities with gene expression data to assess the likelihood of True REGulatory (TREG) TF-gene interactions. We demonstrate the advantages of TREG framework in identifying genes regulated by two TFs with widely different distribution of functional binding events (ERα and E2f1). We also show that TREG signatures of TF activity vastly improve our ability to detect involvement of ERα in producing complex diseases-related transcriptional profiles. Through a large study of disease-related transcriptional signatures and transcriptional signatures of drug activity, we demonstrate that increase in statistical power associated with the use of TREG signatures makes the crucial difference in identifying key targets for treatment, and drugs to use for treatment. All methods are implemented in an open-source R package treg. The package also contains all data used in the analysis including 494 TREG binding profiles based on ENCODE ChIP-seq data. The treg package can be downloaded at http://GenomicsPortals.org. Public Library of Science 2013-09-05 /pmc/articles/PMC3764016/ /pubmed/24039560 http://dx.doi.org/10.1371/journal.pcbi.1003198 Text en © 2013 Chen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Chen, Jing
Hu, Zhen
Phatak, Mukta
Reichard, John
Freudenberg, Johannes M.
Sivaganesan, Siva
Medvedovic, Mario
Genome-Wide Signatures of Transcription Factor Activity: Connecting Transcription Factors, Disease, and Small Molecules
title Genome-Wide Signatures of Transcription Factor Activity: Connecting Transcription Factors, Disease, and Small Molecules
title_full Genome-Wide Signatures of Transcription Factor Activity: Connecting Transcription Factors, Disease, and Small Molecules
title_fullStr Genome-Wide Signatures of Transcription Factor Activity: Connecting Transcription Factors, Disease, and Small Molecules
title_full_unstemmed Genome-Wide Signatures of Transcription Factor Activity: Connecting Transcription Factors, Disease, and Small Molecules
title_short Genome-Wide Signatures of Transcription Factor Activity: Connecting Transcription Factors, Disease, and Small Molecules
title_sort genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3764016/
https://www.ncbi.nlm.nih.gov/pubmed/24039560
http://dx.doi.org/10.1371/journal.pcbi.1003198
work_keys_str_mv AT chenjing genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT huzhen genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT phatakmukta genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT reichardjohn genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT freudenbergjohannesm genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT sivaganesansiva genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT medvedovicmario genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules