Cargando…
A proximity-based graph clustering method for the identification and application of transcription factor clusters
BACKGROUND: Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another w...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5706350/ https://www.ncbi.nlm.nih.gov/pubmed/29187152 http://dx.doi.org/10.1186/s12859-017-1935-y |
_version_ | 1783282214001180672 |
---|---|
author | Spadafore, Maxwell Najarian, Kayvan Boyle, Alan P. |
author_facet | Spadafore, Maxwell Najarian, Kayvan Boyle, Alan P. |
author_sort | Spadafore, Maxwell |
collection | PubMed |
description | BACKGROUND: Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another when they do bind. TFs tend to bind the genome in clusters, and current methods to identify these clusters are either limited in scope, unable to detect relationships beyond motif similarity, or not applied to TF-TF interactions. METHODS: Here, we present a proximity-based graph clustering approach to identify TF clusters using either ChIP-seq or motif search data. We use TF co-occurrence to construct a filtered, normalized adjacency matrix and use the Markov Clustering Algorithm to partition the graph while maintaining TF-cluster and cluster-cluster interactions. We then apply our graph structure beyond clustering, using it to increase the accuracy of motif-based TFBS searching for an example TF. RESULTS: We show that our method produces small, manageable clusters that encapsulate many known, experimentally validated transcription factor interactions and that our method is capable of capturing interactions that motif similarity methods might miss. Our graph structure is able to significantly increase the accuracy of motif TFBS searching, demonstrating that the TF-TF connections within the graph correlate with biological TF-TF interactions. CONCLUSION: The interactions identified by our method correspond to biological reality and allow for fast exploration of TF clustering and regulatory dynamics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1935-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5706350 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57063502017-12-05 A proximity-based graph clustering method for the identification and application of transcription factor clusters Spadafore, Maxwell Najarian, Kayvan Boyle, Alan P. BMC Bioinformatics Methodology Article BACKGROUND: Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another when they do bind. TFs tend to bind the genome in clusters, and current methods to identify these clusters are either limited in scope, unable to detect relationships beyond motif similarity, or not applied to TF-TF interactions. METHODS: Here, we present a proximity-based graph clustering approach to identify TF clusters using either ChIP-seq or motif search data. We use TF co-occurrence to construct a filtered, normalized adjacency matrix and use the Markov Clustering Algorithm to partition the graph while maintaining TF-cluster and cluster-cluster interactions. We then apply our graph structure beyond clustering, using it to increase the accuracy of motif-based TFBS searching for an example TF. RESULTS: We show that our method produces small, manageable clusters that encapsulate many known, experimentally validated transcription factor interactions and that our method is capable of capturing interactions that motif similarity methods might miss. Our graph structure is able to significantly increase the accuracy of motif TFBS searching, demonstrating that the TF-TF connections within the graph correlate with biological TF-TF interactions. CONCLUSION: The interactions identified by our method correspond to biological reality and allow for fast exploration of TF clustering and regulatory dynamics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1935-y) contains supplementary material, which is available to authorized users. BioMed Central 2017-11-29 /pmc/articles/PMC5706350/ /pubmed/29187152 http://dx.doi.org/10.1186/s12859-017-1935-y Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Spadafore, Maxwell Najarian, Kayvan Boyle, Alan P. A proximity-based graph clustering method for the identification and application of transcription factor clusters |
title | A proximity-based graph clustering method for the identification and application of transcription factor clusters |
title_full | A proximity-based graph clustering method for the identification and application of transcription factor clusters |
title_fullStr | A proximity-based graph clustering method for the identification and application of transcription factor clusters |
title_full_unstemmed | A proximity-based graph clustering method for the identification and application of transcription factor clusters |
title_short | A proximity-based graph clustering method for the identification and application of transcription factor clusters |
title_sort | proximity-based graph clustering method for the identification and application of transcription factor clusters |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5706350/ https://www.ncbi.nlm.nih.gov/pubmed/29187152 http://dx.doi.org/10.1186/s12859-017-1935-y |
work_keys_str_mv | AT spadaforemaxwell aproximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters AT najariankayvan aproximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters AT boylealanp aproximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters AT spadaforemaxwell proximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters AT najariankayvan proximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters AT boylealanp proximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters |