Cargando…

Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions

The interaction among proteins is essential in all life activities, and it is the basis of all the metabolic activities of the cells. By studying the protein-protein interactions (PPIs), people can better interpret the function of protein, decoding the phenomenon of life, especially in the design of...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Lei, You, Zhu-Hong, Yan, Xin, Xia, Shi-Xiong, Liu, Feng, Li, Li-Ping, Zhang, Wei, Zhou, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6110764/
https://www.ncbi.nlm.nih.gov/pubmed/30150728
http://dx.doi.org/10.1038/s41598-018-30694-1
_version_ 1783350531017670656
author Wang, Lei
You, Zhu-Hong
Yan, Xin
Xia, Shi-Xiong
Liu, Feng
Li, Li-Ping
Zhang, Wei
Zhou, Yong
author_facet Wang, Lei
You, Zhu-Hong
Yan, Xin
Xia, Shi-Xiong
Liu, Feng
Li, Li-Ping
Zhang, Wei
Zhou, Yong
author_sort Wang, Lei
collection PubMed
description The interaction among proteins is essential in all life activities, and it is the basis of all the metabolic activities of the cells. By studying the protein-protein interactions (PPIs), people can better interpret the function of protein, decoding the phenomenon of life, especially in the design of new drugs with great practical value. Although many high-throughput techniques have been devised for large-scale detection of PPIs, these methods are still expensive and time-consuming. For this reason, there is a much-needed to develop computational methods for predicting PPIs at the entire proteome scale. In this article, we propose a new approach to predict PPIs using Rotation Forest (RF) classifier combine with matrix-based protein sequence. We apply the Position-Specific Scoring Matrix (PSSM), which contains biological evolution information, to represent protein sequences and extract the features through the two-dimensional Principal Component Analysis (2DPCA) algorithm. The descriptors are then sending to the rotation forest classifier for classification. We obtained 97.43% prediction accuracy with 94.92% sensitivity at the precision of 99.93% when the proposed method was applied to the PPIs data of yeast. To evaluate the performance of the proposed method, we compared it with other methods in the same dataset, and validate it on an independent datasets. The results obtained show that the proposed method is an appropriate and promising method for predicting PPIs.
format Online
Article
Text
id pubmed-6110764
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-61107642018-08-30 Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions Wang, Lei You, Zhu-Hong Yan, Xin Xia, Shi-Xiong Liu, Feng Li, Li-Ping Zhang, Wei Zhou, Yong Sci Rep Article The interaction among proteins is essential in all life activities, and it is the basis of all the metabolic activities of the cells. By studying the protein-protein interactions (PPIs), people can better interpret the function of protein, decoding the phenomenon of life, especially in the design of new drugs with great practical value. Although many high-throughput techniques have been devised for large-scale detection of PPIs, these methods are still expensive and time-consuming. For this reason, there is a much-needed to develop computational methods for predicting PPIs at the entire proteome scale. In this article, we propose a new approach to predict PPIs using Rotation Forest (RF) classifier combine with matrix-based protein sequence. We apply the Position-Specific Scoring Matrix (PSSM), which contains biological evolution information, to represent protein sequences and extract the features through the two-dimensional Principal Component Analysis (2DPCA) algorithm. The descriptors are then sending to the rotation forest classifier for classification. We obtained 97.43% prediction accuracy with 94.92% sensitivity at the precision of 99.93% when the proposed method was applied to the PPIs data of yeast. To evaluate the performance of the proposed method, we compared it with other methods in the same dataset, and validate it on an independent datasets. The results obtained show that the proposed method is an appropriate and promising method for predicting PPIs. Nature Publishing Group UK 2018-08-27 /pmc/articles/PMC6110764/ /pubmed/30150728 http://dx.doi.org/10.1038/s41598-018-30694-1 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Wang, Lei
You, Zhu-Hong
Yan, Xin
Xia, Shi-Xiong
Liu, Feng
Li, Li-Ping
Zhang, Wei
Zhou, Yong
Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions
title Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions
title_full Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions
title_fullStr Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions
title_full_unstemmed Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions
title_short Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions
title_sort using two-dimensional principal component analysis and rotation forest for prediction of protein-protein interactions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6110764/
https://www.ncbi.nlm.nih.gov/pubmed/30150728
http://dx.doi.org/10.1038/s41598-018-30694-1
work_keys_str_mv AT wanglei usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions
AT youzhuhong usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions
AT yanxin usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions
AT xiashixiong usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions
AT liufeng usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions
AT liliping usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions
AT zhangwei usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions
AT zhouyong usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions