Cargando…
Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution
Transcription factors (TFs) binding to specific DNA sequences or motifs, are elementary to the regulation of transcription. The gene is regulated by a combination of TFs in close proximity. Analysis of co-TFs is an important problem in understanding the mechanism of transcriptional regulation. Recen...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5327392/ https://www.ncbi.nlm.nih.gov/pubmed/28240320 http://dx.doi.org/10.1038/srep43597 |
_version_ | 1782510719697158144 |
---|---|
author | Pan, Gaofeng Tang, Jijun Guo, Fei |
author_facet | Pan, Gaofeng Tang, Jijun Guo, Fei |
author_sort | Pan, Gaofeng |
collection | PubMed |
description | Transcription factors (TFs) binding to specific DNA sequences or motifs, are elementary to the regulation of transcription. The gene is regulated by a combination of TFs in close proximity. Analysis of co-TFs is an important problem in understanding the mechanism of transcriptional regulation. Recently, ChIP-seq in mapping TF provides a large amount of experimental data to analyze co-TFs. Several studies show that if two TFs are co-associated, the relative distance between TFs exhibits a peak-like distribution. In order to analyze co-TFs, we develop a novel method to evaluate the associated situation between TFs. We design an adjacency score based on ordered differences, which can illustrate co-TF binding affinities for motif analysis. For all candidate motifs, we calculate corresponding adjacency scores, and then list descending-order motifs. From these lists, we can find co-TFs for candidate motifs. On ChIP-seq datasets, our method obtains best AUC results on five datasets, 0.9432 for NMYC, 0.9109 for KLF4, 0.9006 for ZFX, 0.8892 for ESRRB, 0.8920 for E2F1. Our method has great stability on large sample datasets. AUC results of our method on all datasets are above 0.8. |
format | Online Article Text |
id | pubmed-5327392 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-53273922017-03-03 Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution Pan, Gaofeng Tang, Jijun Guo, Fei Sci Rep Article Transcription factors (TFs) binding to specific DNA sequences or motifs, are elementary to the regulation of transcription. The gene is regulated by a combination of TFs in close proximity. Analysis of co-TFs is an important problem in understanding the mechanism of transcriptional regulation. Recently, ChIP-seq in mapping TF provides a large amount of experimental data to analyze co-TFs. Several studies show that if two TFs are co-associated, the relative distance between TFs exhibits a peak-like distribution. In order to analyze co-TFs, we develop a novel method to evaluate the associated situation between TFs. We design an adjacency score based on ordered differences, which can illustrate co-TF binding affinities for motif analysis. For all candidate motifs, we calculate corresponding adjacency scores, and then list descending-order motifs. From these lists, we can find co-TFs for candidate motifs. On ChIP-seq datasets, our method obtains best AUC results on five datasets, 0.9432 for NMYC, 0.9109 for KLF4, 0.9006 for ZFX, 0.8892 for ESRRB, 0.8920 for E2F1. Our method has great stability on large sample datasets. AUC results of our method on all datasets are above 0.8. Nature Publishing Group 2017-02-27 /pmc/articles/PMC5327392/ /pubmed/28240320 http://dx.doi.org/10.1038/srep43597 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Pan, Gaofeng Tang, Jijun Guo, Fei Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution |
title | Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution |
title_full | Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution |
title_fullStr | Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution |
title_full_unstemmed | Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution |
title_short | Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution |
title_sort | analysis of co-associated transcription factors via ordered adjacency differences on motif distribution |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5327392/ https://www.ncbi.nlm.nih.gov/pubmed/28240320 http://dx.doi.org/10.1038/srep43597 |
work_keys_str_mv | AT pangaofeng analysisofcoassociatedtranscriptionfactorsviaorderedadjacencydifferencesonmotifdistribution AT tangjijun analysisofcoassociatedtranscriptionfactorsviaorderedadjacencydifferencesonmotifdistribution AT guofei analysisofcoassociatedtranscriptionfactorsviaorderedadjacencydifferencesonmotifdistribution |