Cargando…

A deep learning framework for modeling structural features of RNA-binding protein targets

RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational me...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Sai, Zhou, Jingtian, Hu, Hailin, Gong, Haipeng, Chen, Ligong, Cheng, Chao, Zeng, Jianyang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4770198/
https://www.ncbi.nlm.nih.gov/pubmed/26467480
http://dx.doi.org/10.1093/nar/gkv1025
_version_ 1782418216824340480
author Zhang, Sai
Zhou, Jingtian
Hu, Hailin
Gong, Haipeng
Chen, Ligong
Cheng, Chao
Zeng, Jianyang
author_facet Zhang, Sai
Zhou, Jingtian
Hu, Hailin
Gong, Haipeng
Chen, Ligong
Cheng, Chao
Zeng, Jianyang
author_sort Zhang, Sai
collection PubMed
description RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp.
format Online
Article
Text
id pubmed-4770198
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-47701982016-02-29 A deep learning framework for modeling structural features of RNA-binding protein targets Zhang, Sai Zhou, Jingtian Hu, Hailin Gong, Haipeng Chen, Ligong Cheng, Chao Zeng, Jianyang Nucleic Acids Res Methods Online RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp. Oxford University Press 2016-02-29 2015-10-13 /pmc/articles/PMC4770198/ /pubmed/26467480 http://dx.doi.org/10.1093/nar/gkv1025 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Zhang, Sai
Zhou, Jingtian
Hu, Hailin
Gong, Haipeng
Chen, Ligong
Cheng, Chao
Zeng, Jianyang
A deep learning framework for modeling structural features of RNA-binding protein targets
title A deep learning framework for modeling structural features of RNA-binding protein targets
title_full A deep learning framework for modeling structural features of RNA-binding protein targets
title_fullStr A deep learning framework for modeling structural features of RNA-binding protein targets
title_full_unstemmed A deep learning framework for modeling structural features of RNA-binding protein targets
title_short A deep learning framework for modeling structural features of RNA-binding protein targets
title_sort deep learning framework for modeling structural features of rna-binding protein targets
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4770198/
https://www.ncbi.nlm.nih.gov/pubmed/26467480
http://dx.doi.org/10.1093/nar/gkv1025
work_keys_str_mv AT zhangsai adeeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT zhoujingtian adeeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT huhailin adeeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT gonghaipeng adeeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT chenligong adeeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT chengchao adeeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT zengjianyang adeeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT zhangsai deeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT zhoujingtian deeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT huhailin deeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT gonghaipeng deeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT chenligong deeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT chengchao deeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets
AT zengjianyang deeplearningframeworkformodelingstructuralfeaturesofrnabindingproteintargets