Cargando…
Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions
The interaction among proteins is essential in all life activities, and it is the basis of all the metabolic activities of the cells. By studying the protein-protein interactions (PPIs), people can better interpret the function of protein, decoding the phenomenon of life, especially in the design of...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6110764/ https://www.ncbi.nlm.nih.gov/pubmed/30150728 http://dx.doi.org/10.1038/s41598-018-30694-1 |
_version_ | 1783350531017670656 |
---|---|
author | Wang, Lei You, Zhu-Hong Yan, Xin Xia, Shi-Xiong Liu, Feng Li, Li-Ping Zhang, Wei Zhou, Yong |
author_facet | Wang, Lei You, Zhu-Hong Yan, Xin Xia, Shi-Xiong Liu, Feng Li, Li-Ping Zhang, Wei Zhou, Yong |
author_sort | Wang, Lei |
collection | PubMed |
description | The interaction among proteins is essential in all life activities, and it is the basis of all the metabolic activities of the cells. By studying the protein-protein interactions (PPIs), people can better interpret the function of protein, decoding the phenomenon of life, especially in the design of new drugs with great practical value. Although many high-throughput techniques have been devised for large-scale detection of PPIs, these methods are still expensive and time-consuming. For this reason, there is a much-needed to develop computational methods for predicting PPIs at the entire proteome scale. In this article, we propose a new approach to predict PPIs using Rotation Forest (RF) classifier combine with matrix-based protein sequence. We apply the Position-Specific Scoring Matrix (PSSM), which contains biological evolution information, to represent protein sequences and extract the features through the two-dimensional Principal Component Analysis (2DPCA) algorithm. The descriptors are then sending to the rotation forest classifier for classification. We obtained 97.43% prediction accuracy with 94.92% sensitivity at the precision of 99.93% when the proposed method was applied to the PPIs data of yeast. To evaluate the performance of the proposed method, we compared it with other methods in the same dataset, and validate it on an independent datasets. The results obtained show that the proposed method is an appropriate and promising method for predicting PPIs. |
format | Online Article Text |
id | pubmed-6110764 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-61107642018-08-30 Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions Wang, Lei You, Zhu-Hong Yan, Xin Xia, Shi-Xiong Liu, Feng Li, Li-Ping Zhang, Wei Zhou, Yong Sci Rep Article The interaction among proteins is essential in all life activities, and it is the basis of all the metabolic activities of the cells. By studying the protein-protein interactions (PPIs), people can better interpret the function of protein, decoding the phenomenon of life, especially in the design of new drugs with great practical value. Although many high-throughput techniques have been devised for large-scale detection of PPIs, these methods are still expensive and time-consuming. For this reason, there is a much-needed to develop computational methods for predicting PPIs at the entire proteome scale. In this article, we propose a new approach to predict PPIs using Rotation Forest (RF) classifier combine with matrix-based protein sequence. We apply the Position-Specific Scoring Matrix (PSSM), which contains biological evolution information, to represent protein sequences and extract the features through the two-dimensional Principal Component Analysis (2DPCA) algorithm. The descriptors are then sending to the rotation forest classifier for classification. We obtained 97.43% prediction accuracy with 94.92% sensitivity at the precision of 99.93% when the proposed method was applied to the PPIs data of yeast. To evaluate the performance of the proposed method, we compared it with other methods in the same dataset, and validate it on an independent datasets. The results obtained show that the proposed method is an appropriate and promising method for predicting PPIs. Nature Publishing Group UK 2018-08-27 /pmc/articles/PMC6110764/ /pubmed/30150728 http://dx.doi.org/10.1038/s41598-018-30694-1 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Wang, Lei You, Zhu-Hong Yan, Xin Xia, Shi-Xiong Liu, Feng Li, Li-Ping Zhang, Wei Zhou, Yong Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions |
title | Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions |
title_full | Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions |
title_fullStr | Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions |
title_full_unstemmed | Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions |
title_short | Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions |
title_sort | using two-dimensional principal component analysis and rotation forest for prediction of protein-protein interactions |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6110764/ https://www.ncbi.nlm.nih.gov/pubmed/30150728 http://dx.doi.org/10.1038/s41598-018-30694-1 |
work_keys_str_mv | AT wanglei usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions AT youzhuhong usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions AT yanxin usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions AT xiashixiong usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions AT liufeng usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions AT liliping usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions AT zhangwei usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions AT zhouyong usingtwodimensionalprincipalcomponentanalysisandrotationforestforpredictionofproteinproteininteractions |