Cargando…

Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution

Transcription factors (TFs) binding to specific DNA sequences or motifs, are elementary to the regulation of transcription. The gene is regulated by a combination of TFs in close proximity. Analysis of co-TFs is an important problem in understanding the mechanism of transcriptional regulation. Recen...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Gaofeng, Tang, Jijun, Guo, Fei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5327392/
https://www.ncbi.nlm.nih.gov/pubmed/28240320
http://dx.doi.org/10.1038/srep43597
_version_ 1782510719697158144
author Pan, Gaofeng
Tang, Jijun
Guo, Fei
author_facet Pan, Gaofeng
Tang, Jijun
Guo, Fei
author_sort Pan, Gaofeng
collection PubMed
description Transcription factors (TFs) binding to specific DNA sequences or motifs, are elementary to the regulation of transcription. The gene is regulated by a combination of TFs in close proximity. Analysis of co-TFs is an important problem in understanding the mechanism of transcriptional regulation. Recently, ChIP-seq in mapping TF provides a large amount of experimental data to analyze co-TFs. Several studies show that if two TFs are co-associated, the relative distance between TFs exhibits a peak-like distribution. In order to analyze co-TFs, we develop a novel method to evaluate the associated situation between TFs. We design an adjacency score based on ordered differences, which can illustrate co-TF binding affinities for motif analysis. For all candidate motifs, we calculate corresponding adjacency scores, and then list descending-order motifs. From these lists, we can find co-TFs for candidate motifs. On ChIP-seq datasets, our method obtains best AUC results on five datasets, 0.9432 for NMYC, 0.9109 for KLF4, 0.9006 for ZFX, 0.8892 for ESRRB, 0.8920 for E2F1. Our method has great stability on large sample datasets. AUC results of our method on all datasets are above 0.8.
format Online
Article
Text
id pubmed-5327392
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53273922017-03-03 Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution Pan, Gaofeng Tang, Jijun Guo, Fei Sci Rep Article Transcription factors (TFs) binding to specific DNA sequences or motifs, are elementary to the regulation of transcription. The gene is regulated by a combination of TFs in close proximity. Analysis of co-TFs is an important problem in understanding the mechanism of transcriptional regulation. Recently, ChIP-seq in mapping TF provides a large amount of experimental data to analyze co-TFs. Several studies show that if two TFs are co-associated, the relative distance between TFs exhibits a peak-like distribution. In order to analyze co-TFs, we develop a novel method to evaluate the associated situation between TFs. We design an adjacency score based on ordered differences, which can illustrate co-TF binding affinities for motif analysis. For all candidate motifs, we calculate corresponding adjacency scores, and then list descending-order motifs. From these lists, we can find co-TFs for candidate motifs. On ChIP-seq datasets, our method obtains best AUC results on five datasets, 0.9432 for NMYC, 0.9109 for KLF4, 0.9006 for ZFX, 0.8892 for ESRRB, 0.8920 for E2F1. Our method has great stability on large sample datasets. AUC results of our method on all datasets are above 0.8. Nature Publishing Group 2017-02-27 /pmc/articles/PMC5327392/ /pubmed/28240320 http://dx.doi.org/10.1038/srep43597 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Pan, Gaofeng
Tang, Jijun
Guo, Fei
Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution
title Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution
title_full Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution
title_fullStr Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution
title_full_unstemmed Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution
title_short Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution
title_sort analysis of co-associated transcription factors via ordered adjacency differences on motif distribution
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5327392/
https://www.ncbi.nlm.nih.gov/pubmed/28240320
http://dx.doi.org/10.1038/srep43597
work_keys_str_mv AT pangaofeng analysisofcoassociatedtranscriptionfactorsviaorderedadjacencydifferencesonmotifdistribution
AT tangjijun analysisofcoassociatedtranscriptionfactorsviaorderedadjacencydifferencesonmotifdistribution
AT guofei analysisofcoassociatedtranscriptionfactorsviaorderedadjacencydifferencesonmotifdistribution