Cargando…

RPITER: A Hierarchical Deep Learning Framework for ncRNA–Protein Interaction Prediction

Non-coding RNAs (ncRNAs) play crucial roles in multiple fundamental biological processes, such as post-transcriptional gene regulation, and are implicated in many complex human diseases. Mostly ncRNAs function by interacting with corresponding RNA-binding proteins. The research on ncRNA–protein inte...

Descripción completa

Detalles Bibliográficos
Autores principales: Peng, Cheng, Han, Siyu, Zhang, Hui, Li, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6429152/
https://www.ncbi.nlm.nih.gov/pubmed/30832218
http://dx.doi.org/10.3390/ijms20051070
_version_ 1783405531223818240
author Peng, Cheng
Han, Siyu
Zhang, Hui
Li, Ying
author_facet Peng, Cheng
Han, Siyu
Zhang, Hui
Li, Ying
author_sort Peng, Cheng
collection PubMed
description Non-coding RNAs (ncRNAs) play crucial roles in multiple fundamental biological processes, such as post-transcriptional gene regulation, and are implicated in many complex human diseases. Mostly ncRNAs function by interacting with corresponding RNA-binding proteins. The research on ncRNA–protein interaction is the key to understanding the function of ncRNA. However, the biological experiment techniques for identifying RNA–protein interactions (RPIs) are currently still expensive and time-consuming. Due to the complex molecular mechanism of ncRNA–protein interaction and the lack of conservation for ncRNA, especially for long ncRNA (lncRNA), the prediction of ncRNA–protein interaction is still a challenge. Deep learning-based models have become the state-of-the-art in a range of biological sequence analysis problems due to their strong power of feature learning. In this study, we proposed a hierarchical deep learning framework RPITER to predict RNA–protein interaction. For sequence coding, we improved the conjoint triad feature (CTF) coding method by complementing more primary sequence information and adding sequence structure information. For model design, RPITER employed two basic neural network architectures of convolution neural network (CNN) and stacked auto-encoder (SAE). Comprehensive experiments were performed on five benchmark datasets from PDB and NPInter databases to analyze and compare the performances of different sequence coding methods and prediction models. We found that CNN and SAE deep learning architectures have powerful fitting abilities for the k-mer features of RNA and protein sequence. The improved CTF coding method showed performance gain compared with the original CTF method. Moreover, our designed RPITER performed well in predicting RNA–protein interaction (RPI) and could outperform most of the previous methods. On five widely used RPI datasets, RPI369, RPI488, RPI1807, RPI2241 and NPInter, RPITER obtained [Formula: see text] of 0.821, 0.911, 0.990, 0.957 and 0.985, respectively. The proposed RPITER could be a complementary method for predicting RPI and constructing RPI network, which would help push forward the related biological research on ncRNAs and lncRNAs.
format Online
Article
Text
id pubmed-6429152
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64291522019-04-10 RPITER: A Hierarchical Deep Learning Framework for ncRNA–Protein Interaction Prediction Peng, Cheng Han, Siyu Zhang, Hui Li, Ying Int J Mol Sci Article Non-coding RNAs (ncRNAs) play crucial roles in multiple fundamental biological processes, such as post-transcriptional gene regulation, and are implicated in many complex human diseases. Mostly ncRNAs function by interacting with corresponding RNA-binding proteins. The research on ncRNA–protein interaction is the key to understanding the function of ncRNA. However, the biological experiment techniques for identifying RNA–protein interactions (RPIs) are currently still expensive and time-consuming. Due to the complex molecular mechanism of ncRNA–protein interaction and the lack of conservation for ncRNA, especially for long ncRNA (lncRNA), the prediction of ncRNA–protein interaction is still a challenge. Deep learning-based models have become the state-of-the-art in a range of biological sequence analysis problems due to their strong power of feature learning. In this study, we proposed a hierarchical deep learning framework RPITER to predict RNA–protein interaction. For sequence coding, we improved the conjoint triad feature (CTF) coding method by complementing more primary sequence information and adding sequence structure information. For model design, RPITER employed two basic neural network architectures of convolution neural network (CNN) and stacked auto-encoder (SAE). Comprehensive experiments were performed on five benchmark datasets from PDB and NPInter databases to analyze and compare the performances of different sequence coding methods and prediction models. We found that CNN and SAE deep learning architectures have powerful fitting abilities for the k-mer features of RNA and protein sequence. The improved CTF coding method showed performance gain compared with the original CTF method. Moreover, our designed RPITER performed well in predicting RNA–protein interaction (RPI) and could outperform most of the previous methods. On five widely used RPI datasets, RPI369, RPI488, RPI1807, RPI2241 and NPInter, RPITER obtained [Formula: see text] of 0.821, 0.911, 0.990, 0.957 and 0.985, respectively. The proposed RPITER could be a complementary method for predicting RPI and constructing RPI network, which would help push forward the related biological research on ncRNAs and lncRNAs. MDPI 2019-03-01 /pmc/articles/PMC6429152/ /pubmed/30832218 http://dx.doi.org/10.3390/ijms20051070 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Peng, Cheng
Han, Siyu
Zhang, Hui
Li, Ying
RPITER: A Hierarchical Deep Learning Framework for ncRNA–Protein Interaction Prediction
title RPITER: A Hierarchical Deep Learning Framework for ncRNA–Protein Interaction Prediction
title_full RPITER: A Hierarchical Deep Learning Framework for ncRNA–Protein Interaction Prediction
title_fullStr RPITER: A Hierarchical Deep Learning Framework for ncRNA–Protein Interaction Prediction
title_full_unstemmed RPITER: A Hierarchical Deep Learning Framework for ncRNA–Protein Interaction Prediction
title_short RPITER: A Hierarchical Deep Learning Framework for ncRNA–Protein Interaction Prediction
title_sort rpiter: a hierarchical deep learning framework for ncrna–protein interaction prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6429152/
https://www.ncbi.nlm.nih.gov/pubmed/30832218
http://dx.doi.org/10.3390/ijms20051070
work_keys_str_mv AT pengcheng rpiterahierarchicaldeeplearningframeworkforncrnaproteininteractionprediction
AT hansiyu rpiterahierarchicaldeeplearningframeworkforncrnaproteininteractionprediction
AT zhanghui rpiterahierarchicaldeeplearningframeworkforncrnaproteininteractionprediction
AT liying rpiterahierarchicaldeeplearningframeworkforncrnaproteininteractionprediction