Cargando…

Prediction of Self-Interacting Proteins from Protein Sequence Information Based on Random Projection Model and Fast Fourier Transform

It is significant for biological cells to predict self-interacting proteins (SIPs) in the field of bioinformatics. SIPs mean that two or more identical proteins can interact with each other by one gene expression. This plays a major role in the evolution of protein‒protein interactions (PPIs) and ce...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Zhan-Heng, You, Zhu-Hong, Li, Li-Ping, Wang, Yan-Bin, Wong, Leon, Yi, Hai-Cheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412412/
https://www.ncbi.nlm.nih.gov/pubmed/30795499
http://dx.doi.org/10.3390/ijms20040930
_version_ 1783402599037272064
author Chen, Zhan-Heng
You, Zhu-Hong
Li, Li-Ping
Wang, Yan-Bin
Wong, Leon
Yi, Hai-Cheng
author_facet Chen, Zhan-Heng
You, Zhu-Hong
Li, Li-Ping
Wang, Yan-Bin
Wong, Leon
Yi, Hai-Cheng
author_sort Chen, Zhan-Heng
collection PubMed
description It is significant for biological cells to predict self-interacting proteins (SIPs) in the field of bioinformatics. SIPs mean that two or more identical proteins can interact with each other by one gene expression. This plays a major role in the evolution of protein‒protein interactions (PPIs) and cellular functions. Owing to the limitation of the experimental identification of self-interacting proteins, it is more and more significant to develop a useful biological tool for the prediction of SIPs from protein sequence information. Therefore, we propose a novel prediction model called RP-FFT that merges the Random Projection (RP) model and Fast Fourier Transform (FFT) for detecting SIPs. First, each protein sequence was transformed into a Position Specific Scoring Matrix (PSSM) using the Position Specific Iterated BLAST (PSI-BLAST). Second, the features of protein sequences were extracted by the FFT method on PSSM. Lastly, we evaluated the performance of RP-FFT and compared the RP classifier with the state-of-the-art support vector machine (SVM) classifier and other existing methods on the human and yeast datasets; after the five-fold cross-validation, the RP-FFT model can obtain high average accuracies of 96.28% and 91.87% on the human and yeast datasets, respectively. The experimental results demonstrated that our RP-FFT prediction model is reasonable and robust.
format Online
Article
Text
id pubmed-6412412
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64124122019-04-05 Prediction of Self-Interacting Proteins from Protein Sequence Information Based on Random Projection Model and Fast Fourier Transform Chen, Zhan-Heng You, Zhu-Hong Li, Li-Ping Wang, Yan-Bin Wong, Leon Yi, Hai-Cheng Int J Mol Sci Article It is significant for biological cells to predict self-interacting proteins (SIPs) in the field of bioinformatics. SIPs mean that two or more identical proteins can interact with each other by one gene expression. This plays a major role in the evolution of protein‒protein interactions (PPIs) and cellular functions. Owing to the limitation of the experimental identification of self-interacting proteins, it is more and more significant to develop a useful biological tool for the prediction of SIPs from protein sequence information. Therefore, we propose a novel prediction model called RP-FFT that merges the Random Projection (RP) model and Fast Fourier Transform (FFT) for detecting SIPs. First, each protein sequence was transformed into a Position Specific Scoring Matrix (PSSM) using the Position Specific Iterated BLAST (PSI-BLAST). Second, the features of protein sequences were extracted by the FFT method on PSSM. Lastly, we evaluated the performance of RP-FFT and compared the RP classifier with the state-of-the-art support vector machine (SVM) classifier and other existing methods on the human and yeast datasets; after the five-fold cross-validation, the RP-FFT model can obtain high average accuracies of 96.28% and 91.87% on the human and yeast datasets, respectively. The experimental results demonstrated that our RP-FFT prediction model is reasonable and robust. MDPI 2019-02-21 /pmc/articles/PMC6412412/ /pubmed/30795499 http://dx.doi.org/10.3390/ijms20040930 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chen, Zhan-Heng
You, Zhu-Hong
Li, Li-Ping
Wang, Yan-Bin
Wong, Leon
Yi, Hai-Cheng
Prediction of Self-Interacting Proteins from Protein Sequence Information Based on Random Projection Model and Fast Fourier Transform
title Prediction of Self-Interacting Proteins from Protein Sequence Information Based on Random Projection Model and Fast Fourier Transform
title_full Prediction of Self-Interacting Proteins from Protein Sequence Information Based on Random Projection Model and Fast Fourier Transform
title_fullStr Prediction of Self-Interacting Proteins from Protein Sequence Information Based on Random Projection Model and Fast Fourier Transform
title_full_unstemmed Prediction of Self-Interacting Proteins from Protein Sequence Information Based on Random Projection Model and Fast Fourier Transform
title_short Prediction of Self-Interacting Proteins from Protein Sequence Information Based on Random Projection Model and Fast Fourier Transform
title_sort prediction of self-interacting proteins from protein sequence information based on random projection model and fast fourier transform
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412412/
https://www.ncbi.nlm.nih.gov/pubmed/30795499
http://dx.doi.org/10.3390/ijms20040930
work_keys_str_mv AT chenzhanheng predictionofselfinteractingproteinsfromproteinsequenceinformationbasedonrandomprojectionmodelandfastfouriertransform
AT youzhuhong predictionofselfinteractingproteinsfromproteinsequenceinformationbasedonrandomprojectionmodelandfastfouriertransform
AT liliping predictionofselfinteractingproteinsfromproteinsequenceinformationbasedonrandomprojectionmodelandfastfouriertransform
AT wangyanbin predictionofselfinteractingproteinsfromproteinsequenceinformationbasedonrandomprojectionmodelandfastfouriertransform
AT wongleon predictionofselfinteractingproteinsfromproteinsequenceinformationbasedonrandomprojectionmodelandfastfouriertransform
AT yihaicheng predictionofselfinteractingproteinsfromproteinsequenceinformationbasedonrandomprojectionmodelandfastfouriertransform