Cargando…

Robust and accurate prediction of protein–protein interactions by exploiting evolutionary information

Various biochemical functions of organisms are performed by protein–protein interactions (PPIs). Therefore, recognition of protein–protein interactions is very important for understanding most life activities, such as DNA replication and transcription, protein synthesis and secretion, signal transdu...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yang, Wang, Zheng, Li, Li-Ping, You, Zhu-Hong, Huang, Wen-Zhun, Zhan, Xin-Ke, Wang, Yan-Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8376940/
https://www.ncbi.nlm.nih.gov/pubmed/34413375
http://dx.doi.org/10.1038/s41598-021-96265-z
_version_ 1783740554283057152
author Li, Yang
Wang, Zheng
Li, Li-Ping
You, Zhu-Hong
Huang, Wen-Zhun
Zhan, Xin-Ke
Wang, Yan-Bin
author_facet Li, Yang
Wang, Zheng
Li, Li-Ping
You, Zhu-Hong
Huang, Wen-Zhun
Zhan, Xin-Ke
Wang, Yan-Bin
author_sort Li, Yang
collection PubMed
description Various biochemical functions of organisms are performed by protein–protein interactions (PPIs). Therefore, recognition of protein–protein interactions is very important for understanding most life activities, such as DNA replication and transcription, protein synthesis and secretion, signal transduction and metabolism. Although high-throughput technology makes it possible to generate large-scale PPIs data, it requires expensive cost of both time and labor, and leave a risk of high false positive rate. In order to formulate a more ingenious solution, biology community is looking for computational methods to quickly and efficiently discover massive protein interaction data. In this paper, we propose a computational method for predicting PPIs based on a fresh idea of combining orthogonal locality preserving projections (OLPP) and rotation forest (RoF) models, using protein sequence information. Specifically, the protein sequence is first converted into position-specific scoring matrices (PSSMs) containing protein evolutionary information by using the Position-Specific Iterated Basic Local Alignment Search Tool (PSI-BLAST). Then we characterize a protein as a fixed length feature vector by applying OLPP to PSSMs. Finally, we train an RoF classifier for the purpose of identifying non-interacting and interacting protein pairs. The proposed method yielded a significantly better results than existing methods, with 90.07% and 96.09% prediction accuracy on Yeast and Human datasets. Our experiment show the proposed method can serve as a useful tool to accelerate the process of solving key problems in proteomics.
format Online
Article
Text
id pubmed-8376940
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-83769402021-08-20 Robust and accurate prediction of protein–protein interactions by exploiting evolutionary information Li, Yang Wang, Zheng Li, Li-Ping You, Zhu-Hong Huang, Wen-Zhun Zhan, Xin-Ke Wang, Yan-Bin Sci Rep Article Various biochemical functions of organisms are performed by protein–protein interactions (PPIs). Therefore, recognition of protein–protein interactions is very important for understanding most life activities, such as DNA replication and transcription, protein synthesis and secretion, signal transduction and metabolism. Although high-throughput technology makes it possible to generate large-scale PPIs data, it requires expensive cost of both time and labor, and leave a risk of high false positive rate. In order to formulate a more ingenious solution, biology community is looking for computational methods to quickly and efficiently discover massive protein interaction data. In this paper, we propose a computational method for predicting PPIs based on a fresh idea of combining orthogonal locality preserving projections (OLPP) and rotation forest (RoF) models, using protein sequence information. Specifically, the protein sequence is first converted into position-specific scoring matrices (PSSMs) containing protein evolutionary information by using the Position-Specific Iterated Basic Local Alignment Search Tool (PSI-BLAST). Then we characterize a protein as a fixed length feature vector by applying OLPP to PSSMs. Finally, we train an RoF classifier for the purpose of identifying non-interacting and interacting protein pairs. The proposed method yielded a significantly better results than existing methods, with 90.07% and 96.09% prediction accuracy on Yeast and Human datasets. Our experiment show the proposed method can serve as a useful tool to accelerate the process of solving key problems in proteomics. Nature Publishing Group UK 2021-08-19 /pmc/articles/PMC8376940/ /pubmed/34413375 http://dx.doi.org/10.1038/s41598-021-96265-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Li, Yang
Wang, Zheng
Li, Li-Ping
You, Zhu-Hong
Huang, Wen-Zhun
Zhan, Xin-Ke
Wang, Yan-Bin
Robust and accurate prediction of protein–protein interactions by exploiting evolutionary information
title Robust and accurate prediction of protein–protein interactions by exploiting evolutionary information
title_full Robust and accurate prediction of protein–protein interactions by exploiting evolutionary information
title_fullStr Robust and accurate prediction of protein–protein interactions by exploiting evolutionary information
title_full_unstemmed Robust and accurate prediction of protein–protein interactions by exploiting evolutionary information
title_short Robust and accurate prediction of protein–protein interactions by exploiting evolutionary information
title_sort robust and accurate prediction of protein–protein interactions by exploiting evolutionary information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8376940/
https://www.ncbi.nlm.nih.gov/pubmed/34413375
http://dx.doi.org/10.1038/s41598-021-96265-z
work_keys_str_mv AT liyang robustandaccuratepredictionofproteinproteininteractionsbyexploitingevolutionaryinformation
AT wangzheng robustandaccuratepredictionofproteinproteininteractionsbyexploitingevolutionaryinformation
AT liliping robustandaccuratepredictionofproteinproteininteractionsbyexploitingevolutionaryinformation
AT youzhuhong robustandaccuratepredictionofproteinproteininteractionsbyexploitingevolutionaryinformation
AT huangwenzhun robustandaccuratepredictionofproteinproteininteractionsbyexploitingevolutionaryinformation
AT zhanxinke robustandaccuratepredictionofproteinproteininteractionsbyexploitingevolutionaryinformation
AT wangyanbin robustandaccuratepredictionofproteinproteininteractionsbyexploitingevolutionaryinformation