Cargando…

A proximity-based graph clustering method for the identification and application of transcription factor clusters

BACKGROUND: Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another w...

Descripción completa

Detalles Bibliográficos
Autores principales: Spadafore, Maxwell, Najarian, Kayvan, Boyle, Alan P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5706350/
https://www.ncbi.nlm.nih.gov/pubmed/29187152
http://dx.doi.org/10.1186/s12859-017-1935-y
_version_ 1783282214001180672
author Spadafore, Maxwell
Najarian, Kayvan
Boyle, Alan P.
author_facet Spadafore, Maxwell
Najarian, Kayvan
Boyle, Alan P.
author_sort Spadafore, Maxwell
collection PubMed
description BACKGROUND: Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another when they do bind. TFs tend to bind the genome in clusters, and current methods to identify these clusters are either limited in scope, unable to detect relationships beyond motif similarity, or not applied to TF-TF interactions. METHODS: Here, we present a proximity-based graph clustering approach to identify TF clusters using either ChIP-seq or motif search data. We use TF co-occurrence to construct a filtered, normalized adjacency matrix and use the Markov Clustering Algorithm to partition the graph while maintaining TF-cluster and cluster-cluster interactions. We then apply our graph structure beyond clustering, using it to increase the accuracy of motif-based TFBS searching for an example TF. RESULTS: We show that our method produces small, manageable clusters that encapsulate many known, experimentally validated transcription factor interactions and that our method is capable of capturing interactions that motif similarity methods might miss. Our graph structure is able to significantly increase the accuracy of motif TFBS searching, demonstrating that the TF-TF connections within the graph correlate with biological TF-TF interactions. CONCLUSION: The interactions identified by our method correspond to biological reality and allow for fast exploration of TF clustering and regulatory dynamics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1935-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5706350
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57063502017-12-05 A proximity-based graph clustering method for the identification and application of transcription factor clusters Spadafore, Maxwell Najarian, Kayvan Boyle, Alan P. BMC Bioinformatics Methodology Article BACKGROUND: Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another when they do bind. TFs tend to bind the genome in clusters, and current methods to identify these clusters are either limited in scope, unable to detect relationships beyond motif similarity, or not applied to TF-TF interactions. METHODS: Here, we present a proximity-based graph clustering approach to identify TF clusters using either ChIP-seq or motif search data. We use TF co-occurrence to construct a filtered, normalized adjacency matrix and use the Markov Clustering Algorithm to partition the graph while maintaining TF-cluster and cluster-cluster interactions. We then apply our graph structure beyond clustering, using it to increase the accuracy of motif-based TFBS searching for an example TF. RESULTS: We show that our method produces small, manageable clusters that encapsulate many known, experimentally validated transcription factor interactions and that our method is capable of capturing interactions that motif similarity methods might miss. Our graph structure is able to significantly increase the accuracy of motif TFBS searching, demonstrating that the TF-TF connections within the graph correlate with biological TF-TF interactions. CONCLUSION: The interactions identified by our method correspond to biological reality and allow for fast exploration of TF clustering and regulatory dynamics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1935-y) contains supplementary material, which is available to authorized users. BioMed Central 2017-11-29 /pmc/articles/PMC5706350/ /pubmed/29187152 http://dx.doi.org/10.1186/s12859-017-1935-y Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Spadafore, Maxwell
Najarian, Kayvan
Boyle, Alan P.
A proximity-based graph clustering method for the identification and application of transcription factor clusters
title A proximity-based graph clustering method for the identification and application of transcription factor clusters
title_full A proximity-based graph clustering method for the identification and application of transcription factor clusters
title_fullStr A proximity-based graph clustering method for the identification and application of transcription factor clusters
title_full_unstemmed A proximity-based graph clustering method for the identification and application of transcription factor clusters
title_short A proximity-based graph clustering method for the identification and application of transcription factor clusters
title_sort proximity-based graph clustering method for the identification and application of transcription factor clusters
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5706350/
https://www.ncbi.nlm.nih.gov/pubmed/29187152
http://dx.doi.org/10.1186/s12859-017-1935-y
work_keys_str_mv AT spadaforemaxwell aproximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters
AT najariankayvan aproximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters
AT boylealanp aproximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters
AT spadaforemaxwell proximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters
AT najariankayvan proximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters
AT boylealanp proximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters