Cargando…
Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data
Transcription factors (TFs) play crucial roles in regulating gene expression through interactions with specific DNA sequences. Recently, the sequence motif of almost 400 human TFs have been identified using high-throughput SELEX sequencing. However, there remain a large number of TFs (∼800) with no...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5449588/ https://www.ncbi.nlm.nih.gov/pubmed/28472398 http://dx.doi.org/10.1093/nar/gkx358 |
_version_ | 1783239810798845952 |
---|---|
author | Zamanighomi, Mahdi Lin, Zhixiang Wang, Yong Jiang, Rui Wong, Wing Hung |
author_facet | Zamanighomi, Mahdi Lin, Zhixiang Wang, Yong Jiang, Rui Wong, Wing Hung |
author_sort | Zamanighomi, Mahdi |
collection | PubMed |
description | Transcription factors (TFs) play crucial roles in regulating gene expression through interactions with specific DNA sequences. Recently, the sequence motif of almost 400 human TFs have been identified using high-throughput SELEX sequencing. However, there remain a large number of TFs (∼800) with no high-throughput-derived binding motifs. Computational methods capable of associating known motifs to such TFs will avoid tremendous experimental efforts and enable deeper understanding of transcriptional regulatory functions. We present a method to associate known motifs to TFs (MATLAB code is available in Supplementary Materials). Our method is based on a probabilistic framework that not only exploits DNA-binding domains and specificities, but also integrates open chromatin, gene expression and genomic data to accurately infer monomeric and homodimeric binding motifs. Our analysis resulted in the assignment of motifs to 200 TFs with no SELEX-derived motifs, roughly a 50% increase compared to the existing coverage. |
format | Online Article Text |
id | pubmed-5449588 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-54495882017-06-05 Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data Zamanighomi, Mahdi Lin, Zhixiang Wang, Yong Jiang, Rui Wong, Wing Hung Nucleic Acids Res Computational Biology Transcription factors (TFs) play crucial roles in regulating gene expression through interactions with specific DNA sequences. Recently, the sequence motif of almost 400 human TFs have been identified using high-throughput SELEX sequencing. However, there remain a large number of TFs (∼800) with no high-throughput-derived binding motifs. Computational methods capable of associating known motifs to such TFs will avoid tremendous experimental efforts and enable deeper understanding of transcriptional regulatory functions. We present a method to associate known motifs to TFs (MATLAB code is available in Supplementary Materials). Our method is based on a probabilistic framework that not only exploits DNA-binding domains and specificities, but also integrates open chromatin, gene expression and genomic data to accurately infer monomeric and homodimeric binding motifs. Our analysis resulted in the assignment of motifs to 200 TFs with no SELEX-derived motifs, roughly a 50% increase compared to the existing coverage. Oxford University Press 2017-06-02 2017-05-03 /pmc/articles/PMC5449588/ /pubmed/28472398 http://dx.doi.org/10.1093/nar/gkx358 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Zamanighomi, Mahdi Lin, Zhixiang Wang, Yong Jiang, Rui Wong, Wing Hung Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data |
title | Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data |
title_full | Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data |
title_fullStr | Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data |
title_full_unstemmed | Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data |
title_short | Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data |
title_sort | predicting transcription factor binding motifs from dna-binding domains, chromatin accessibility and gene expression data |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5449588/ https://www.ncbi.nlm.nih.gov/pubmed/28472398 http://dx.doi.org/10.1093/nar/gkx358 |
work_keys_str_mv | AT zamanighomimahdi predictingtranscriptionfactorbindingmotifsfromdnabindingdomainschromatinaccessibilityandgeneexpressiondata AT linzhixiang predictingtranscriptionfactorbindingmotifsfromdnabindingdomainschromatinaccessibilityandgeneexpressiondata AT wangyong predictingtranscriptionfactorbindingmotifsfromdnabindingdomainschromatinaccessibilityandgeneexpressiondata AT jiangrui predictingtranscriptionfactorbindingmotifsfromdnabindingdomainschromatinaccessibilityandgeneexpressiondata AT wongwinghung predictingtranscriptionfactorbindingmotifsfromdnabindingdomainschromatinaccessibilityandgeneexpressiondata |