Cargando…
Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework
The identification of transcription factor binding sites and cis-regulatory motifs is a frontier whereupon the rules governing protein–DNA binding are being revealed. Here, we developed a new method (DEep Sequence and Shape mOtif or DESSO) for cis-regulatory motif prediction using deep neural networ...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6735894/ https://www.ncbi.nlm.nih.gov/pubmed/31372637 http://dx.doi.org/10.1093/nar/gkz672 |
_version_ | 1783450430595923968 |
---|---|
author | Yang, Jinyu Ma, Anjun Hoppe, Adam D Wang, Cankun Li, Yang Zhang, Chi Wang, Yan Liu, Bingqiang Ma, Qin |
author_facet | Yang, Jinyu Ma, Anjun Hoppe, Adam D Wang, Cankun Li, Yang Zhang, Chi Wang, Yan Liu, Bingqiang Ma, Qin |
author_sort | Yang, Jinyu |
collection | PubMed |
description | The identification of transcription factor binding sites and cis-regulatory motifs is a frontier whereupon the rules governing protein–DNA binding are being revealed. Here, we developed a new method (DEep Sequence and Shape mOtif or DESSO) for cis-regulatory motif prediction using deep neural networks and the binomial distribution model. DESSO outperformed existing tools, including DeepBind, in predicting motifs in 690 human ENCODE ChIP-sequencing datasets. Furthermore, the deep-learning framework of DESSO expanded motif discovery beyond the state-of-the-art by allowing the identification of known and new protein–protein–DNA tethering interactions in human transcription factors (TFs). Specifically, 61 putative tethering interactions were identified among the 100 TFs expressed in the K562 cell line. In this work, the power of DESSO was further expanded by integrating the detection of DNA shape features. We found that shape information has strong predictive power for TF–DNA binding and provides new putative shape motif information for human TFs. Thus, DESSO improves in the identification and structural analysis of TF binding sites, by integrating the complexities of DNA binding into a deep-learning framework. |
format | Online Article Text |
id | pubmed-6735894 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-67358942019-09-16 Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework Yang, Jinyu Ma, Anjun Hoppe, Adam D Wang, Cankun Li, Yang Zhang, Chi Wang, Yan Liu, Bingqiang Ma, Qin Nucleic Acids Res Computational Biology The identification of transcription factor binding sites and cis-regulatory motifs is a frontier whereupon the rules governing protein–DNA binding are being revealed. Here, we developed a new method (DEep Sequence and Shape mOtif or DESSO) for cis-regulatory motif prediction using deep neural networks and the binomial distribution model. DESSO outperformed existing tools, including DeepBind, in predicting motifs in 690 human ENCODE ChIP-sequencing datasets. Furthermore, the deep-learning framework of DESSO expanded motif discovery beyond the state-of-the-art by allowing the identification of known and new protein–protein–DNA tethering interactions in human transcription factors (TFs). Specifically, 61 putative tethering interactions were identified among the 100 TFs expressed in the K562 cell line. In this work, the power of DESSO was further expanded by integrating the detection of DNA shape features. We found that shape information has strong predictive power for TF–DNA binding and provides new putative shape motif information for human TFs. Thus, DESSO improves in the identification and structural analysis of TF binding sites, by integrating the complexities of DNA binding into a deep-learning framework. Oxford University Press 2019-09-05 2019-08-02 /pmc/articles/PMC6735894/ /pubmed/31372637 http://dx.doi.org/10.1093/nar/gkz672 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Yang, Jinyu Ma, Anjun Hoppe, Adam D Wang, Cankun Li, Yang Zhang, Chi Wang, Yan Liu, Bingqiang Ma, Qin Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework |
title | Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework |
title_full | Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework |
title_fullStr | Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework |
title_full_unstemmed | Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework |
title_short | Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework |
title_sort | prediction of regulatory motifs from human chip-sequencing data using a deep learning framework |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6735894/ https://www.ncbi.nlm.nih.gov/pubmed/31372637 http://dx.doi.org/10.1093/nar/gkz672 |
work_keys_str_mv | AT yangjinyu predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework AT maanjun predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework AT hoppeadamd predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework AT wangcankun predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework AT liyang predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework AT zhangchi predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework AT wangyan predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework AT liubingqiang predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework AT maqin predictionofregulatorymotifsfromhumanchipsequencingdatausingadeeplearningframework |