Cargando…

A proximity-based graph clustering method for the identification and application of transcription factor clusters

BACKGROUND: Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another w...

Descripción completa

Detalles Bibliográficos
Autores principales:	Spadafore, Maxwell, Najarian, Kayvan, Boyle, Alan P.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2017
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5706350/ https://www.ncbi.nlm.nih.gov/pubmed/29187152 http://dx.doi.org/10.1186/s12859-017-1935-y

_version_	1783282214001180672
author	Spadafore, Maxwell Najarian, Kayvan Boyle, Alan P.
author_facet	Spadafore, Maxwell Najarian, Kayvan Boyle, Alan P.
author_sort	Spadafore, Maxwell
collection	PubMed
description	BACKGROUND: Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another when they do bind. TFs tend to bind the genome in clusters, and current methods to identify these clusters are either limited in scope, unable to detect relationships beyond motif similarity, or not applied to TF-TF interactions. METHODS: Here, we present a proximity-based graph clustering approach to identify TF clusters using either ChIP-seq or motif search data. We use TF co-occurrence to construct a filtered, normalized adjacency matrix and use the Markov Clustering Algorithm to partition the graph while maintaining TF-cluster and cluster-cluster interactions. We then apply our graph structure beyond clustering, using it to increase the accuracy of motif-based TFBS searching for an example TF. RESULTS: We show that our method produces small, manageable clusters that encapsulate many known, experimentally validated transcription factor interactions and that our method is capable of capturing interactions that motif similarity methods might miss. Our graph structure is able to significantly increase the accuracy of motif TFBS searching, demonstrating that the TF-TF connections within the graph correlate with biological TF-TF interactions. CONCLUSION: The interactions identified by our method correspond to biological reality and allow for fast exploration of TF clustering and regulatory dynamics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1935-y) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-5706350
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-57063502017-12-05 A proximity-based graph clustering method for the identification and application of transcription factor clusters Spadafore, Maxwell Najarian, Kayvan Boyle, Alan P. BMC Bioinformatics Methodology Article BACKGROUND: Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another when they do bind. TFs tend to bind the genome in clusters, and current methods to identify these clusters are either limited in scope, unable to detect relationships beyond motif similarity, or not applied to TF-TF interactions. METHODS: Here, we present a proximity-based graph clustering approach to identify TF clusters using either ChIP-seq or motif search data. We use TF co-occurrence to construct a filtered, normalized adjacency matrix and use the Markov Clustering Algorithm to partition the graph while maintaining TF-cluster and cluster-cluster interactions. We then apply our graph structure beyond clustering, using it to increase the accuracy of motif-based TFBS searching for an example TF. RESULTS: We show that our method produces small, manageable clusters that encapsulate many known, experimentally validated transcription factor interactions and that our method is capable of capturing interactions that motif similarity methods might miss. Our graph structure is able to significantly increase the accuracy of motif TFBS searching, demonstrating that the TF-TF connections within the graph correlate with biological TF-TF interactions. CONCLUSION: The interactions identified by our method correspond to biological reality and allow for fast exploration of TF clustering and regulatory dynamics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1935-y) contains supplementary material, which is available to authorized users. BioMed Central 2017-11-29 /pmc/articles/PMC5706350/ /pubmed/29187152 http://dx.doi.org/10.1186/s12859-017-1935-y Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Methodology Article Spadafore, Maxwell Najarian, Kayvan Boyle, Alan P. A proximity-based graph clustering method for the identification and application of transcription factor clusters
title	A proximity-based graph clustering method for the identification and application of transcription factor clusters
title_full	A proximity-based graph clustering method for the identification and application of transcription factor clusters
title_fullStr	A proximity-based graph clustering method for the identification and application of transcription factor clusters
title_full_unstemmed	A proximity-based graph clustering method for the identification and application of transcription factor clusters
title_short	A proximity-based graph clustering method for the identification and application of transcription factor clusters
title_sort	proximity-based graph clustering method for the identification and application of transcription factor clusters
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5706350/ https://www.ncbi.nlm.nih.gov/pubmed/29187152 http://dx.doi.org/10.1186/s12859-017-1935-y
work_keys_str_mv	AT spadaforemaxwell aproximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters AT najariankayvan aproximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters AT boylealanp aproximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters AT spadaforemaxwell proximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters AT najariankayvan proximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters AT boylealanp proximitybasedgraphclusteringmethodfortheidentificationandapplicationoftranscriptionfactorclusters

A proximity-based graph clustering method for the identification and application of transcription factor clusters

Ejemplares similares