Cargando…

Predicting Protein–Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence

SIMPLE SUMMARY: Due to most traditional high-throughput experiments are tedious and laborious in identifying potential protein–protein interaction. To better improve accuracy prediction in protein–protein interactions. We proposed a novel computational method that can identify unknown protein–protei...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhan, Xinke, Xiao, Mang, You, Zhuhong, Yan, Chenggang, Guo, Jianxin, Wang, Liping, Sun, Yaoqi, Shang, Bingwan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9311754/
https://www.ncbi.nlm.nih.gov/pubmed/36101379
http://dx.doi.org/10.3390/biology11070995
_version_ 1784753670904086528
author Zhan, Xinke
Xiao, Mang
You, Zhuhong
Yan, Chenggang
Guo, Jianxin
Wang, Liping
Sun, Yaoqi
Shang, Bingwan
author_facet Zhan, Xinke
Xiao, Mang
You, Zhuhong
Yan, Chenggang
Guo, Jianxin
Wang, Liping
Sun, Yaoqi
Shang, Bingwan
author_sort Zhan, Xinke
collection PubMed
description SIMPLE SUMMARY: Due to most traditional high-throughput experiments are tedious and laborious in identifying potential protein–protein interaction. To better improve accuracy prediction in protein–protein interactions. We proposed a novel computational method that can identify unknown protein–protein interaction efficiently and hope this method can provide a helpful idea and tool for proteomics research. ABSTRACT: Protein–protein interactions (PPIs) play an essential role in many biological cellular functions. However, it is still tedious and time-consuming to identify protein–protein interactions through traditional experimental methods. For this reason, it is imperative and necessary to develop a computational method for predicting PPIs efficiently. This paper explores a novel computational method for detecting PPIs from protein sequence, the approach which mainly adopts the feature extraction method: Locality Preserving Projections (LPP) and classifier: Rotation Forest (RF). Specifically, we first employ the Position Specific Scoring Matrix (PSSM), which can remain evolutionary information of biological for representing protein sequence efficiently. Then, the LPP descriptor is applied to extract feature vectors from PSSM. The feature vectors are fed into the RF to obtain the final results. The proposed method is applied to two datasets: Yeast and H. pylori, and obtained an average accuracy of 92.81% and 92.56%, respectively. We also compare it with K nearest neighbors (KNN) and support vector machine (SVM) to better evaluate the performance of the proposed method. In summary, all experimental results indicate that the proposed approach is stable and robust for predicting PPIs and promising to be a useful tool for proteomics research.
format Online
Article
Text
id pubmed-9311754
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93117542022-07-26 Predicting Protein–Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence Zhan, Xinke Xiao, Mang You, Zhuhong Yan, Chenggang Guo, Jianxin Wang, Liping Sun, Yaoqi Shang, Bingwan Biology (Basel) Article SIMPLE SUMMARY: Due to most traditional high-throughput experiments are tedious and laborious in identifying potential protein–protein interaction. To better improve accuracy prediction in protein–protein interactions. We proposed a novel computational method that can identify unknown protein–protein interaction efficiently and hope this method can provide a helpful idea and tool for proteomics research. ABSTRACT: Protein–protein interactions (PPIs) play an essential role in many biological cellular functions. However, it is still tedious and time-consuming to identify protein–protein interactions through traditional experimental methods. For this reason, it is imperative and necessary to develop a computational method for predicting PPIs efficiently. This paper explores a novel computational method for detecting PPIs from protein sequence, the approach which mainly adopts the feature extraction method: Locality Preserving Projections (LPP) and classifier: Rotation Forest (RF). Specifically, we first employ the Position Specific Scoring Matrix (PSSM), which can remain evolutionary information of biological for representing protein sequence efficiently. Then, the LPP descriptor is applied to extract feature vectors from PSSM. The feature vectors are fed into the RF to obtain the final results. The proposed method is applied to two datasets: Yeast and H. pylori, and obtained an average accuracy of 92.81% and 92.56%, respectively. We also compare it with K nearest neighbors (KNN) and support vector machine (SVM) to better evaluate the performance of the proposed method. In summary, all experimental results indicate that the proposed approach is stable and robust for predicting PPIs and promising to be a useful tool for proteomics research. MDPI 2022-06-30 /pmc/articles/PMC9311754/ /pubmed/36101379 http://dx.doi.org/10.3390/biology11070995 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhan, Xinke
Xiao, Mang
You, Zhuhong
Yan, Chenggang
Guo, Jianxin
Wang, Liping
Sun, Yaoqi
Shang, Bingwan
Predicting Protein–Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence
title Predicting Protein–Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence
title_full Predicting Protein–Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence
title_fullStr Predicting Protein–Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence
title_full_unstemmed Predicting Protein–Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence
title_short Predicting Protein–Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence
title_sort predicting protein–protein interactions based on ensemble learning-based model from protein sequence
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9311754/
https://www.ncbi.nlm.nih.gov/pubmed/36101379
http://dx.doi.org/10.3390/biology11070995
work_keys_str_mv AT zhanxinke predictingproteinproteininteractionsbasedonensemblelearningbasedmodelfromproteinsequence
AT xiaomang predictingproteinproteininteractionsbasedonensemblelearningbasedmodelfromproteinsequence
AT youzhuhong predictingproteinproteininteractionsbasedonensemblelearningbasedmodelfromproteinsequence
AT yanchenggang predictingproteinproteininteractionsbasedonensemblelearningbasedmodelfromproteinsequence
AT guojianxin predictingproteinproteininteractionsbasedonensemblelearningbasedmodelfromproteinsequence
AT wangliping predictingproteinproteininteractionsbasedonensemblelearningbasedmodelfromproteinsequence
AT sunyaoqi predictingproteinproteininteractionsbasedonensemblelearningbasedmodelfromproteinsequence
AT shangbingwan predictingproteinproteininteractionsbasedonensemblelearningbasedmodelfromproteinsequence