Cargando…

Base-resolution methylation patterns accurately predict transcription factor bindings in vivo

Detecting in vivo transcription factor (TF) binding is important for understanding gene regulatory circuitries. ChIP-seq is a powerful technique to empirically define TF binding in vivo. However, the multitude of distinct TFs makes genome-wide profiling for them all labor-intensive and costly. Algor...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Tianlei, Li, Ben, Zhao, Meng, Szulwach, Keith E., Street, R. Craig, Lin, Li, Yao, Bing, Zhang, Feiran, Jin, Peng, Wu, Hao, Qin, Zhaohui S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4357735/
https://www.ncbi.nlm.nih.gov/pubmed/25722376
http://dx.doi.org/10.1093/nar/gkv151
_version_ 1782361191191937024
author Xu, Tianlei
Li, Ben
Zhao, Meng
Szulwach, Keith E.
Street, R. Craig
Lin, Li
Yao, Bing
Zhang, Feiran
Jin, Peng
Wu, Hao
Qin, Zhaohui S.
author_facet Xu, Tianlei
Li, Ben
Zhao, Meng
Szulwach, Keith E.
Street, R. Craig
Lin, Li
Yao, Bing
Zhang, Feiran
Jin, Peng
Wu, Hao
Qin, Zhaohui S.
author_sort Xu, Tianlei
collection PubMed
description Detecting in vivo transcription factor (TF) binding is important for understanding gene regulatory circuitries. ChIP-seq is a powerful technique to empirically define TF binding in vivo. However, the multitude of distinct TFs makes genome-wide profiling for them all labor-intensive and costly. Algorithms for in silico prediction of TF binding have been developed, based mostly on histone modification or DNase I hypersensitivity data in conjunction with DNA motif and other genomic features. However, technical limitations of these methods prevent them from being applied broadly, especially in clinical settings. We conducted a comprehensive survey involving multiple cell lines, TFs, and methylation types and found that there are intimate relationships between TF binding and methylation level changes around the binding sites. Exploiting the connection between DNA methylation and TF binding, we proposed a novel supervised learning approach to predict TF–DNA interaction using data from base-resolution whole-genome methylation sequencing experiments. We devised beta-binomial models to characterize methylation data around TF binding sites and the background. Along with other static genomic features, we adopted a random forest framework to predict TF–DNA interaction. After conducting comprehensive tests, we saw that the proposed method accurately predicts TF binding and performs favorably versus competing methods.
format Online
Article
Text
id pubmed-4357735
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43577352015-03-20 Base-resolution methylation patterns accurately predict transcription factor bindings in vivo Xu, Tianlei Li, Ben Zhao, Meng Szulwach, Keith E. Street, R. Craig Lin, Li Yao, Bing Zhang, Feiran Jin, Peng Wu, Hao Qin, Zhaohui S. Nucleic Acids Res Genomics Detecting in vivo transcription factor (TF) binding is important for understanding gene regulatory circuitries. ChIP-seq is a powerful technique to empirically define TF binding in vivo. However, the multitude of distinct TFs makes genome-wide profiling for them all labor-intensive and costly. Algorithms for in silico prediction of TF binding have been developed, based mostly on histone modification or DNase I hypersensitivity data in conjunction with DNA motif and other genomic features. However, technical limitations of these methods prevent them from being applied broadly, especially in clinical settings. We conducted a comprehensive survey involving multiple cell lines, TFs, and methylation types and found that there are intimate relationships between TF binding and methylation level changes around the binding sites. Exploiting the connection between DNA methylation and TF binding, we proposed a novel supervised learning approach to predict TF–DNA interaction using data from base-resolution whole-genome methylation sequencing experiments. We devised beta-binomial models to characterize methylation data around TF binding sites and the background. Along with other static genomic features, we adopted a random forest framework to predict TF–DNA interaction. After conducting comprehensive tests, we saw that the proposed method accurately predicts TF binding and performs favorably versus competing methods. Oxford University Press 2015-03-11 2015-02-26 /pmc/articles/PMC4357735/ /pubmed/25722376 http://dx.doi.org/10.1093/nar/gkv151 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomics
Xu, Tianlei
Li, Ben
Zhao, Meng
Szulwach, Keith E.
Street, R. Craig
Lin, Li
Yao, Bing
Zhang, Feiran
Jin, Peng
Wu, Hao
Qin, Zhaohui S.
Base-resolution methylation patterns accurately predict transcription factor bindings in vivo
title Base-resolution methylation patterns accurately predict transcription factor bindings in vivo
title_full Base-resolution methylation patterns accurately predict transcription factor bindings in vivo
title_fullStr Base-resolution methylation patterns accurately predict transcription factor bindings in vivo
title_full_unstemmed Base-resolution methylation patterns accurately predict transcription factor bindings in vivo
title_short Base-resolution methylation patterns accurately predict transcription factor bindings in vivo
title_sort base-resolution methylation patterns accurately predict transcription factor bindings in vivo
topic Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4357735/
https://www.ncbi.nlm.nih.gov/pubmed/25722376
http://dx.doi.org/10.1093/nar/gkv151
work_keys_str_mv AT xutianlei baseresolutionmethylationpatternsaccuratelypredicttranscriptionfactorbindingsinvivo
AT liben baseresolutionmethylationpatternsaccuratelypredicttranscriptionfactorbindingsinvivo
AT zhaomeng baseresolutionmethylationpatternsaccuratelypredicttranscriptionfactorbindingsinvivo
AT szulwachkeithe baseresolutionmethylationpatternsaccuratelypredicttranscriptionfactorbindingsinvivo
AT streetrcraig baseresolutionmethylationpatternsaccuratelypredicttranscriptionfactorbindingsinvivo
AT linli baseresolutionmethylationpatternsaccuratelypredicttranscriptionfactorbindingsinvivo
AT yaobing baseresolutionmethylationpatternsaccuratelypredicttranscriptionfactorbindingsinvivo
AT zhangfeiran baseresolutionmethylationpatternsaccuratelypredicttranscriptionfactorbindingsinvivo
AT jinpeng baseresolutionmethylationpatternsaccuratelypredicttranscriptionfactorbindingsinvivo
AT wuhao baseresolutionmethylationpatternsaccuratelypredicttranscriptionfactorbindingsinvivo
AT qinzhaohuis baseresolutionmethylationpatternsaccuratelypredicttranscriptionfactorbindingsinvivo