Cargando…

Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework

The identification of transcription factor binding sites and cis-regulatory motifs is a frontier whereupon the rules governing protein–DNA binding are being revealed. Here, we developed a new method (DEep Sequence and Shape mOtif or DESSO) for cis-regulatory motif prediction using deep neural networ...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Jinyu, Ma, Anjun, Hoppe, Adam D, Wang, Cankun, Li, Yang, Zhang, Chi, Wang, Yan, Liu, Bingqiang, Ma, Qin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6735894/
https://www.ncbi.nlm.nih.gov/pubmed/31372637
http://dx.doi.org/10.1093/nar/gkz672
_version_ 1783450430595923968
author Yang, Jinyu
Ma, Anjun
Hoppe, Adam D
Wang, Cankun
Li, Yang
Zhang, Chi
Wang, Yan
Liu, Bingqiang
Ma, Qin
author_facet Yang, Jinyu
Ma, Anjun
Hoppe, Adam D
Wang, Cankun
Li, Yang
Zhang, Chi
Wang, Yan
Liu, Bingqiang
Ma, Qin
author_sort Yang, Jinyu
collection PubMed
description The identification of transcription factor binding sites and cis-regulatory motifs is a frontier whereupon the rules governing protein–DNA binding are being revealed. Here, we developed a new method (DEep Sequence and Shape mOtif or DESSO) for cis-regulatory motif prediction using deep neural networks and the binomial distribution model. DESSO outperformed existing tools, including DeepBind, in predicting motifs in 690 human ENCODE ChIP-sequencing datasets. Furthermore, the deep-learning framework of DESSO expanded motif discovery beyond the state-of-the-art by allowing the identification of known and new protein–protein–DNA tethering interactions in human transcription factors (TFs). Specifically, 61 putative tethering interactions were identified among the 100 TFs expressed in the K562 cell line. In this work, the power of DESSO was further expanded by integrating the detection of DNA shape features. We found that shape information has strong predictive power for TF–DNA binding and provides new putative shape motif information for human TFs. Thus, DESSO improves in the identification and structural analysis of TF binding sites, by integrating the complexities of DNA binding into a deep-learning framework.
format Online
Article
Text
id pubmed-6735894
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-67358942019-09-16 Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework Yang, Jinyu Ma, Anjun Hoppe, Adam D Wang, Cankun Li, Yang Zhang, Chi Wang, Yan Liu, Bingqiang Ma, Qin Nucleic Acids Res Computational Biology The identification of transcription factor binding sites and cis-regulatory motifs is a frontier whereupon the rules governing protein–DNA binding are being revealed. Here, we developed a new method (DEep Sequence and Shape mOtif or DESSO) for cis-regulatory motif prediction using deep neural networks and the binomial distribution model. DESSO outperformed existing tools, including DeepBind, in predicting motifs in 690 human ENCODE ChIP-sequencing datasets. Furthermore, the deep-learning framework of DESSO expanded motif discovery beyond the state-of-the-art by allowing the identification of known and new protein–protein–DNA tethering interactions in human transcription factors (TFs). Specifically, 61 putative tethering interactions were identified among the 100 TFs expressed in the K562 cell line. In this work, the power of DESSO was further expanded by integrating the detection of DNA shape features. We found that shape information has strong predictive power for TF–DNA binding and provides new putative shape motif information for human TFs. Thus, DESSO improves in the identification and structural analysis of TF binding sites, by integrating the complexities of DNA binding into a deep-learning framework. Oxford University Press 2019-09-05 2019-08-02 /pmc/articles/PMC6735894/ /pubmed/31372637 http://dx.doi.org/10.1093/nar/gkz672 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Yang, Jinyu
Ma, Anjun
Hoppe, Adam D
Wang, Cankun
Li, Yang
Zhang, Chi
Wang, Yan
Liu, Bingqiang
Ma, Qin
Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework
title Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework
title_full Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework
title_fullStr Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework
title_full_unstemmed Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework
title_short Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework
title_sort prediction of regulatory motifs from human chip-sequencing data using a deep learning framework
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6735894/
https://www.ncbi.nlm.nih.gov/pubmed/31372637
http://dx.doi.org/10.1093/nar/gkz672
work_keys_str_mv AT yangjinyu predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework
AT maanjun predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework
AT hoppeadamd predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework
AT wangcankun predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework
AT liyang predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework
AT zhangchi predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework
AT wangyan predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework
AT liubingqiang predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework
AT maqin predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework