Cargando…

Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data

Transcription factors (TFs) play crucial roles in regulating gene expression through interactions with specific DNA sequences. Recently, the sequence motif of almost 400 human TFs have been identified using high-throughput SELEX sequencing. However, there remain a large number of TFs (∼800) with no...

Descripción completa

Detalles Bibliográficos
Autores principales: Zamanighomi, Mahdi, Lin, Zhixiang, Wang, Yong, Jiang, Rui, Wong, Wing Hung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5449588/
https://www.ncbi.nlm.nih.gov/pubmed/28472398
http://dx.doi.org/10.1093/nar/gkx358
_version_ 1783239810798845952
author Zamanighomi, Mahdi
Lin, Zhixiang
Wang, Yong
Jiang, Rui
Wong, Wing Hung
author_facet Zamanighomi, Mahdi
Lin, Zhixiang
Wang, Yong
Jiang, Rui
Wong, Wing Hung
author_sort Zamanighomi, Mahdi
collection PubMed
description Transcription factors (TFs) play crucial roles in regulating gene expression through interactions with specific DNA sequences. Recently, the sequence motif of almost 400 human TFs have been identified using high-throughput SELEX sequencing. However, there remain a large number of TFs (∼800) with no high-throughput-derived binding motifs. Computational methods capable of associating known motifs to such TFs will avoid tremendous experimental efforts and enable deeper understanding of transcriptional regulatory functions. We present a method to associate known motifs to TFs (MATLAB code is available in Supplementary Materials). Our method is based on a probabilistic framework that not only exploits DNA-binding domains and specificities, but also integrates open chromatin, gene expression and genomic data to accurately infer monomeric and homodimeric binding motifs. Our analysis resulted in the assignment of motifs to 200 TFs with no SELEX-derived motifs, roughly a 50% increase compared to the existing coverage.
format Online
Article
Text
id pubmed-5449588
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-54495882017-06-05 Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data Zamanighomi, Mahdi Lin, Zhixiang Wang, Yong Jiang, Rui Wong, Wing Hung Nucleic Acids Res Computational Biology Transcription factors (TFs) play crucial roles in regulating gene expression through interactions with specific DNA sequences. Recently, the sequence motif of almost 400 human TFs have been identified using high-throughput SELEX sequencing. However, there remain a large number of TFs (∼800) with no high-throughput-derived binding motifs. Computational methods capable of associating known motifs to such TFs will avoid tremendous experimental efforts and enable deeper understanding of transcriptional regulatory functions. We present a method to associate known motifs to TFs (MATLAB code is available in Supplementary Materials). Our method is based on a probabilistic framework that not only exploits DNA-binding domains and specificities, but also integrates open chromatin, gene expression and genomic data to accurately infer monomeric and homodimeric binding motifs. Our analysis resulted in the assignment of motifs to 200 TFs with no SELEX-derived motifs, roughly a 50% increase compared to the existing coverage. Oxford University Press 2017-06-02 2017-05-03 /pmc/articles/PMC5449588/ /pubmed/28472398 http://dx.doi.org/10.1093/nar/gkx358 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Zamanighomi, Mahdi
Lin, Zhixiang
Wang, Yong
Jiang, Rui
Wong, Wing Hung
Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data
title Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data
title_full Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data
title_fullStr Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data
title_full_unstemmed Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data
title_short Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data
title_sort predicting transcription factor binding motifs from dna-binding domains, chromatin accessibility and gene expression data
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5449588/
https://www.ncbi.nlm.nih.gov/pubmed/28472398
http://dx.doi.org/10.1093/nar/gkx358
work_keys_str_mv AT zamanighomimahdi predictingtranscriptionfactorbindingmotifsfromdnabindingdomainschromatinaccessibilityandgeneexpressiondata
AT linzhixiang predictingtranscriptionfactorbindingmotifsfromdnabindingdomainschromatinaccessibilityandgeneexpressiondata
AT wangyong predictingtranscriptionfactorbindingmotifsfromdnabindingdomainschromatinaccessibilityandgeneexpressiondata
AT jiangrui predictingtranscriptionfactorbindingmotifsfromdnabindingdomainschromatinaccessibilityandgeneexpressiondata
AT wongwinghung predictingtranscriptionfactorbindingmotifsfromdnabindingdomainschromatinaccessibilityandgeneexpressiondata