Cargando…

PC-TraFF: identification of potentially collaborating transcription factors using pointwise mutual information

BACKGROUND: Transcription factors (TFs) are important regulatory proteins that govern transcriptional regulation. Today, it is known that in higher organisms different TFs have to cooperate rather than acting individually in order to control complex genetic programs. The identification of these inte...

Descripción completa

Detalles Bibliográficos
Autores principales: Meckbach, Cornelia, Tacke, Rebecca, Hua, Xu, Waack, Stephan, Wingender, Edgar, Gültas, Mehmet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4667426/
https://www.ncbi.nlm.nih.gov/pubmed/26627005
http://dx.doi.org/10.1186/s12859-015-0827-2
_version_ 1782403825894686720
author Meckbach, Cornelia
Tacke, Rebecca
Hua, Xu
Waack, Stephan
Wingender, Edgar
Gültas, Mehmet
author_facet Meckbach, Cornelia
Tacke, Rebecca
Hua, Xu
Waack, Stephan
Wingender, Edgar
Gültas, Mehmet
author_sort Meckbach, Cornelia
collection PubMed
description BACKGROUND: Transcription factors (TFs) are important regulatory proteins that govern transcriptional regulation. Today, it is known that in higher organisms different TFs have to cooperate rather than acting individually in order to control complex genetic programs. The identification of these interactions is an important challenge for understanding the molecular mechanisms of regulating biological processes. In this study, we present a new method based on pointwise mutual information, PC-TraFF, which considers the genome as a document, the sequences as sentences, and TF binding sites (TFBSs) as words to identify interacting TFs in a set of sequences. RESULTS: To demonstrate the effectiveness of PC-TraFF, we performed a genome-wide analysis and a breast cancer-associated sequence set analysis for protein coding and miRNA genes. Our results show that in any of these sequence sets, PC-TraFF is able to identify important interacting TF pairs, for most of which we found support by previously published experimental results. Further, we made a pairwise comparison between PC-TraFF and three conventional methods. The outcome of this comparison study strongly suggests that all these methods focus on different important aspects of interaction between TFs and thus the pairwise overlap between any of them is only marginal. CONCLUSIONS: In this study, adopting the idea from the field of linguistics in the field of bioinformatics, we develop a new information theoretic method, PC-TraFF, for the identification of potentially collaborating transcription factors based on the idiosyncrasy of their binding site distributions on the genome. The results of our study show that PC-TraFF can succesfully identify known interacting TF pairs and thus its currently biologically uncorfirmed predictions could provide new hypotheses for further experimental validation. Additionally, the comparison of the results of PC-TraFF with the results of previous methods demonstrates that different methods with their specific scopes can perfectly supplement each other. Overall, our analyses indicate that PC-TraFF is a time-efficient method where its algorithm has a tractable computational time and memory consumption. The PC-TraFF server is freely accessible at http://pctraff.bioinf.med.uni-goettingen.de/ ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0827-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4667426
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46674262015-12-03 PC-TraFF: identification of potentially collaborating transcription factors using pointwise mutual information Meckbach, Cornelia Tacke, Rebecca Hua, Xu Waack, Stephan Wingender, Edgar Gültas, Mehmet BMC Bioinformatics Methodology Article BACKGROUND: Transcription factors (TFs) are important regulatory proteins that govern transcriptional regulation. Today, it is known that in higher organisms different TFs have to cooperate rather than acting individually in order to control complex genetic programs. The identification of these interactions is an important challenge for understanding the molecular mechanisms of regulating biological processes. In this study, we present a new method based on pointwise mutual information, PC-TraFF, which considers the genome as a document, the sequences as sentences, and TF binding sites (TFBSs) as words to identify interacting TFs in a set of sequences. RESULTS: To demonstrate the effectiveness of PC-TraFF, we performed a genome-wide analysis and a breast cancer-associated sequence set analysis for protein coding and miRNA genes. Our results show that in any of these sequence sets, PC-TraFF is able to identify important interacting TF pairs, for most of which we found support by previously published experimental results. Further, we made a pairwise comparison between PC-TraFF and three conventional methods. The outcome of this comparison study strongly suggests that all these methods focus on different important aspects of interaction between TFs and thus the pairwise overlap between any of them is only marginal. CONCLUSIONS: In this study, adopting the idea from the field of linguistics in the field of bioinformatics, we develop a new information theoretic method, PC-TraFF, for the identification of potentially collaborating transcription factors based on the idiosyncrasy of their binding site distributions on the genome. The results of our study show that PC-TraFF can succesfully identify known interacting TF pairs and thus its currently biologically uncorfirmed predictions could provide new hypotheses for further experimental validation. Additionally, the comparison of the results of PC-TraFF with the results of previous methods demonstrates that different methods with their specific scopes can perfectly supplement each other. Overall, our analyses indicate that PC-TraFF is a time-efficient method where its algorithm has a tractable computational time and memory consumption. The PC-TraFF server is freely accessible at http://pctraff.bioinf.med.uni-goettingen.de/ ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0827-2) contains supplementary material, which is available to authorized users. BioMed Central 2015-12-01 /pmc/articles/PMC4667426/ /pubmed/26627005 http://dx.doi.org/10.1186/s12859-015-0827-2 Text en © Meckbach et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Meckbach, Cornelia
Tacke, Rebecca
Hua, Xu
Waack, Stephan
Wingender, Edgar
Gültas, Mehmet
PC-TraFF: identification of potentially collaborating transcription factors using pointwise mutual information
title PC-TraFF: identification of potentially collaborating transcription factors using pointwise mutual information
title_full PC-TraFF: identification of potentially collaborating transcription factors using pointwise mutual information
title_fullStr PC-TraFF: identification of potentially collaborating transcription factors using pointwise mutual information
title_full_unstemmed PC-TraFF: identification of potentially collaborating transcription factors using pointwise mutual information
title_short PC-TraFF: identification of potentially collaborating transcription factors using pointwise mutual information
title_sort pc-traff: identification of potentially collaborating transcription factors using pointwise mutual information
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4667426/
https://www.ncbi.nlm.nih.gov/pubmed/26627005
http://dx.doi.org/10.1186/s12859-015-0827-2
work_keys_str_mv AT meckbachcornelia pctraffidentificationofpotentiallycollaboratingtranscriptionfactorsusingpointwisemutualinformation
AT tackerebecca pctraffidentificationofpotentiallycollaboratingtranscriptionfactorsusingpointwisemutualinformation
AT huaxu pctraffidentificationofpotentiallycollaboratingtranscriptionfactorsusingpointwisemutualinformation
AT waackstephan pctraffidentificationofpotentiallycollaboratingtranscriptionfactorsusingpointwisemutualinformation
AT wingenderedgar pctraffidentificationofpotentiallycollaboratingtranscriptionfactorsusingpointwisemutualinformation
AT gultasmehmet pctraffidentificationofpotentiallycollaboratingtranscriptionfactorsusingpointwisemutualinformation