Cargando…
RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence
RNA-protein interactions (RPIs) have critical roles in numerous fundamental biological processes, such as post-transcriptional gene regulation, viral assembly, cellular defence and protein synthesis. As the number of available RNA-protein binding experimental data has increased rapidly due to high-t...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6017498/ https://www.ncbi.nlm.nih.gov/pubmed/29495575 http://dx.doi.org/10.3390/molecules23030540 |
_version_ | 1783334763591892992 |
---|---|
author | Shen, Wen-Jun Cui, Wenjuan Chen, Danze Zhang, Jieming Xu, Jianzhen |
author_facet | Shen, Wen-Jun Cui, Wenjuan Chen, Danze Zhang, Jieming Xu, Jianzhen |
author_sort | Shen, Wen-Jun |
collection | PubMed |
description | RNA-protein interactions (RPIs) have critical roles in numerous fundamental biological processes, such as post-transcriptional gene regulation, viral assembly, cellular defence and protein synthesis. As the number of available RNA-protein binding experimental data has increased rapidly due to high-throughput sequencing methods, it is now possible to measure and understand RNA-protein interactions by computational methods. In this study, we integrate a sequence-based derived kernel with regularized least squares to perform prediction. The derived kernel exploits the contextual information around an amino acid or a nucleic acid as well as the repetitive conserved motif information. We propose a novel machine learning method, called RPiRLS to predict the interaction between any RNA and protein of known sequences. For the RPiRLS classifier, each protein sequence comprises up to 20 diverse amino acids but for the RPiRLS-7G classifier, each protein sequence is represented by using 7-letter reduced alphabets based on their physiochemical properties. We evaluated both methods on a number of benchmark data sets and compared their performances with two newly developed and state-of-the-art methods, RPI-Pred and IPMiner. On the non-redundant benchmark test sets extracted from the PRIDB, the RPiRLS method outperformed RPI-Pred and IPMiner in terms of accuracy, specificity and sensitivity. Further, RPiRLS achieved an accuracy of 92% on the prediction of lncRNA-protein interactions. The proposed method can also be extended to construct RNA-protein interaction networks. The RPiRLS web server is freely available at http://bmc.med.stu.edu.cn/RPiRLS. |
format | Online Article Text |
id | pubmed-6017498 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-60174982018-11-13 RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence Shen, Wen-Jun Cui, Wenjuan Chen, Danze Zhang, Jieming Xu, Jianzhen Molecules Article RNA-protein interactions (RPIs) have critical roles in numerous fundamental biological processes, such as post-transcriptional gene regulation, viral assembly, cellular defence and protein synthesis. As the number of available RNA-protein binding experimental data has increased rapidly due to high-throughput sequencing methods, it is now possible to measure and understand RNA-protein interactions by computational methods. In this study, we integrate a sequence-based derived kernel with regularized least squares to perform prediction. The derived kernel exploits the contextual information around an amino acid or a nucleic acid as well as the repetitive conserved motif information. We propose a novel machine learning method, called RPiRLS to predict the interaction between any RNA and protein of known sequences. For the RPiRLS classifier, each protein sequence comprises up to 20 diverse amino acids but for the RPiRLS-7G classifier, each protein sequence is represented by using 7-letter reduced alphabets based on their physiochemical properties. We evaluated both methods on a number of benchmark data sets and compared their performances with two newly developed and state-of-the-art methods, RPI-Pred and IPMiner. On the non-redundant benchmark test sets extracted from the PRIDB, the RPiRLS method outperformed RPI-Pred and IPMiner in terms of accuracy, specificity and sensitivity. Further, RPiRLS achieved an accuracy of 92% on the prediction of lncRNA-protein interactions. The proposed method can also be extended to construct RNA-protein interaction networks. The RPiRLS web server is freely available at http://bmc.med.stu.edu.cn/RPiRLS. MDPI 2018-02-28 /pmc/articles/PMC6017498/ /pubmed/29495575 http://dx.doi.org/10.3390/molecules23030540 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Shen, Wen-Jun Cui, Wenjuan Chen, Danze Zhang, Jieming Xu, Jianzhen RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence |
title | RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence |
title_full | RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence |
title_fullStr | RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence |
title_full_unstemmed | RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence |
title_short | RPiRLS: Quantitative Predictions of RNA Interacting with Any Protein of Known Sequence |
title_sort | rpirls: quantitative predictions of rna interacting with any protein of known sequence |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6017498/ https://www.ncbi.nlm.nih.gov/pubmed/29495575 http://dx.doi.org/10.3390/molecules23030540 |
work_keys_str_mv | AT shenwenjun rpirlsquantitativepredictionsofrnainteractingwithanyproteinofknownsequence AT cuiwenjuan rpirlsquantitativepredictionsofrnainteractingwithanyproteinofknownsequence AT chendanze rpirlsquantitativepredictionsofrnainteractingwithanyproteinofknownsequence AT zhangjieming rpirlsquantitativepredictionsofrnainteractingwithanyproteinofknownsequence AT xujianzhen rpirlsquantitativepredictionsofrnainteractingwithanyproteinofknownsequence |