Cargando…

Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction

Non-coding RNA (ncRNA) plays important roles in many critical regulation processes. Many ncRNAs perform their regulatory functions by the form of RNA-protein complexes. Therefore, identifying the interaction between ncRNA and protein is fundamental to understand functions of ncRNA. Under pressures f...

Descripción completa

Detalles Bibliográficos
Autores principales: Dai, Qiguo, Guo, Maozu, Duan, Xiaodong, Teng, Zhixia, Fu, Yueyue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6367266/
https://www.ncbi.nlm.nih.gov/pubmed/30774646
http://dx.doi.org/10.3389/fgene.2019.00018
_version_ 1783393748241088512
author Dai, Qiguo
Guo, Maozu
Duan, Xiaodong
Teng, Zhixia
Fu, Yueyue
author_facet Dai, Qiguo
Guo, Maozu
Duan, Xiaodong
Teng, Zhixia
Fu, Yueyue
author_sort Dai, Qiguo
collection PubMed
description Non-coding RNA (ncRNA) plays important roles in many critical regulation processes. Many ncRNAs perform their regulatory functions by the form of RNA-protein complexes. Therefore, identifying the interaction between ncRNA and protein is fundamental to understand functions of ncRNA. Under pressures from expensive cost of experimental techniques, developing an accuracy computational predictive model has become an indispensable way to identify ncRNA-protein interaction. A powerful predicting model of ncRNA-protein interaction needs a good feature set of characterizing the interaction. In this paper, a novel method is put forward to generate complex features for characterizing ncRNA-protein interaction (named CFRP). To obtain a comprehensive description of ncRNA-protein interaction, complex features are generated by non-linear transformations from the traditional k-mer features of ncRNA and protein sequences. To further reduce the dimensions of complex features, a group of discriminative features are selected by random forest. To validate the performances of the proposed method, a series of experiments are carried on several widely-used public datasets. Compared with the traditional k-mer features, the CFRP complex features can boost the performances of ncRNA-protein interaction prediction model. Meanwhile, the CFRP-based prediction model is compared with several state-of-the-art methods, and the results show that the proposed method achieves better performances than the others in term of the evaluation metrics. In conclusion, the complex features generated by CFRP are beneficial for building a powerful predicting model of ncRNA-protein interaction.
format Online
Article
Text
id pubmed-6367266
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-63672662019-02-15 Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction Dai, Qiguo Guo, Maozu Duan, Xiaodong Teng, Zhixia Fu, Yueyue Front Genet Genetics Non-coding RNA (ncRNA) plays important roles in many critical regulation processes. Many ncRNAs perform their regulatory functions by the form of RNA-protein complexes. Therefore, identifying the interaction between ncRNA and protein is fundamental to understand functions of ncRNA. Under pressures from expensive cost of experimental techniques, developing an accuracy computational predictive model has become an indispensable way to identify ncRNA-protein interaction. A powerful predicting model of ncRNA-protein interaction needs a good feature set of characterizing the interaction. In this paper, a novel method is put forward to generate complex features for characterizing ncRNA-protein interaction (named CFRP). To obtain a comprehensive description of ncRNA-protein interaction, complex features are generated by non-linear transformations from the traditional k-mer features of ncRNA and protein sequences. To further reduce the dimensions of complex features, a group of discriminative features are selected by random forest. To validate the performances of the proposed method, a series of experiments are carried on several widely-used public datasets. Compared with the traditional k-mer features, the CFRP complex features can boost the performances of ncRNA-protein interaction prediction model. Meanwhile, the CFRP-based prediction model is compared with several state-of-the-art methods, and the results show that the proposed method achieves better performances than the others in term of the evaluation metrics. In conclusion, the complex features generated by CFRP are beneficial for building a powerful predicting model of ncRNA-protein interaction. Frontiers Media S.A. 2019-02-01 /pmc/articles/PMC6367266/ /pubmed/30774646 http://dx.doi.org/10.3389/fgene.2019.00018 Text en Copyright © 2019 Dai, Guo, Duan, Teng and Fu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Dai, Qiguo
Guo, Maozu
Duan, Xiaodong
Teng, Zhixia
Fu, Yueyue
Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction
title Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction
title_full Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction
title_fullStr Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction
title_full_unstemmed Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction
title_short Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction
title_sort construction of complex features for computational predicting ncrna-protein interaction
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6367266/
https://www.ncbi.nlm.nih.gov/pubmed/30774646
http://dx.doi.org/10.3389/fgene.2019.00018
work_keys_str_mv AT daiqiguo constructionofcomplexfeaturesforcomputationalpredictingncrnaproteininteraction
AT guomaozu constructionofcomplexfeaturesforcomputationalpredictingncrnaproteininteraction
AT duanxiaodong constructionofcomplexfeaturesforcomputationalpredictingncrnaproteininteraction
AT tengzhixia constructionofcomplexfeaturesforcomputationalpredictingncrnaproteininteraction
AT fuyueyue constructionofcomplexfeaturesforcomputationalpredictingncrnaproteininteraction