Cargando…

RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence

RNA-protein interactions (RPIs) have critical roles in numerous fundamental biological processes, such as post-transcriptional gene regulation, viral assembly, cellular defence and protein synthesis. As the number of available RNA-protein binding experimental data has increased rapidly due to high-t...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Wen-Jun, Cui, Wenjuan, Chen, Danze, Zhang, Jieming, Xu, Jianzhen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6017498/
https://www.ncbi.nlm.nih.gov/pubmed/29495575
http://dx.doi.org/10.3390/molecules23030540
_version_ 1783334763591892992
author Shen, Wen-Jun
Cui, Wenjuan
Chen, Danze
Zhang, Jieming
Xu, Jianzhen
author_facet Shen, Wen-Jun
Cui, Wenjuan
Chen, Danze
Zhang, Jieming
Xu, Jianzhen
author_sort Shen, Wen-Jun
collection PubMed
description RNA-protein interactions (RPIs) have critical roles in numerous fundamental biological processes, such as post-transcriptional gene regulation, viral assembly, cellular defence and protein synthesis. As the number of available RNA-protein binding experimental data has increased rapidly due to high-throughput sequencing methods, it is now possible to measure and understand RNA-protein interactions by computational methods. In this study, we integrate a sequence-based derived kernel with regularized least squares to perform prediction. The derived kernel exploits the contextual information around an amino acid or a nucleic acid as well as the repetitive conserved motif information. We propose a novel machine learning method, called RPiRLS to predict the interaction between any RNA and protein of known sequences. For the RPiRLS classifier, each protein sequence comprises up to 20 diverse amino acids but for the RPiRLS-7G classifier, each protein sequence is represented by using 7-letter reduced alphabets based on their physiochemical properties. We evaluated both methods on a number of benchmark data sets and compared their performances with two newly developed and state-of-the-art methods, RPI-Pred and IPMiner. On the non-redundant benchmark test sets extracted from the PRIDB, the RPiRLS method outperformed RPI-Pred and IPMiner in terms of accuracy, specificity and sensitivity. Further, RPiRLS achieved an accuracy of 92% on the prediction of lncRNA-protein interactions. The proposed method can also be extended to construct RNA-protein interaction networks. The RPiRLS web server is freely available at http://bmc.med.stu.edu.cn/RPiRLS.
format Online
Article
Text
id pubmed-6017498
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-60174982018-11-13 RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence Shen, Wen-Jun Cui, Wenjuan Chen, Danze Zhang, Jieming Xu, Jianzhen Molecules Article RNA-protein interactions (RPIs) have critical roles in numerous fundamental biological processes, such as post-transcriptional gene regulation, viral assembly, cellular defence and protein synthesis. As the number of available RNA-protein binding experimental data has increased rapidly due to high-throughput sequencing methods, it is now possible to measure and understand RNA-protein interactions by computational methods. In this study, we integrate a sequence-based derived kernel with regularized least squares to perform prediction. The derived kernel exploits the contextual information around an amino acid or a nucleic acid as well as the repetitive conserved motif information. We propose a novel machine learning method, called RPiRLS to predict the interaction between any RNA and protein of known sequences. For the RPiRLS classifier, each protein sequence comprises up to 20 diverse amino acids but for the RPiRLS-7G classifier, each protein sequence is represented by using 7-letter reduced alphabets based on their physiochemical properties. We evaluated both methods on a number of benchmark data sets and compared their performances with two newly developed and state-of-the-art methods, RPI-Pred and IPMiner. On the non-redundant benchmark test sets extracted from the PRIDB, the RPiRLS method outperformed RPI-Pred and IPMiner in terms of accuracy, specificity and sensitivity. Further, RPiRLS achieved an accuracy of 92% on the prediction of lncRNA-protein interactions. The proposed method can also be extended to construct RNA-protein interaction networks. The RPiRLS web server is freely available at http://bmc.med.stu.edu.cn/RPiRLS. MDPI 2018-02-28 /pmc/articles/PMC6017498/ /pubmed/29495575 http://dx.doi.org/10.3390/molecules23030540 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Shen, Wen-Jun
Cui, Wenjuan
Chen, Danze
Zhang, Jieming
Xu, Jianzhen
RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence
title RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence
title_full RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence
title_fullStr RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence
title_full_unstemmed RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence
title_short RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence
title_sort rpirls: quantitative predictions of rna interacting with any protein of known sequence
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6017498/
https://www.ncbi.nlm.nih.gov/pubmed/29495575
http://dx.doi.org/10.3390/molecules23030540
work_keys_str_mv AT shenwenjun rpirlsquantitativepredictionsofrnainteractingwithanyproteinofknownsequence
AT cuiwenjuan rpirlsquantitativepredictionsofrnainteractingwithanyproteinofknownsequence
AT chendanze rpirlsquantitativepredictionsofrnainteractingwithanyproteinofknownsequence
AT zhangjieming rpirlsquantitativepredictionsofrnainteractingwithanyproteinofknownsequence
AT xujianzhen rpirlsquantitativepredictionsofrnainteractingwithanyproteinofknownsequence