Cargando…

Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix

Apoptosis proteins are strongly related to many diseases and play an indispensable role in maintaining the dynamic balance between cell death and division in vivo. Obtaining localization information on apoptosis proteins is necessary in understanding their function. To date, few researchers have foc...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruan, Xiaoli, Zhou, Dongming, Nie, Rencan, Guo, Yanbu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7201498/
https://www.ncbi.nlm.nih.gov/pubmed/32420339
http://dx.doi.org/10.1155/2020/4071508
_version_ 1783529545565995008
author Ruan, Xiaoli
Zhou, Dongming
Nie, Rencan
Guo, Yanbu
author_facet Ruan, Xiaoli
Zhou, Dongming
Nie, Rencan
Guo, Yanbu
author_sort Ruan, Xiaoli
collection PubMed
description Apoptosis proteins are strongly related to many diseases and play an indispensable role in maintaining the dynamic balance between cell death and division in vivo. Obtaining localization information on apoptosis proteins is necessary in understanding their function. To date, few researchers have focused on the problem of apoptosis data imbalance before classification, while this data imbalance is prone to misclassification. Therefore, in this work, we introduce a method to resolve this problem and to enhance prediction accuracy. Firstly, the features of the protein sequence are captured by combining Improving Pseudo-Position-Specific Scoring Matrix (IM-Psepssm) with the Bidirectional Correlation Coefficient (Bid-CC) algorithm from position-specific scoring matrix. Secondly, different features of fusion and resampling strategies are used to reduce the impact of imbalance on apoptosis protein datasets. Finally, the eigenvector adopts the Support Vector Machine (SVM) to the training classification model, and the prediction accuracy is evaluated by jackknife cross-validation tests. The experimental results indicate that, under the same feature vector, adopting resampling methods remarkably boosts many significant indicators in the unsampling method for predicting the localization of apoptosis proteins in the ZD98, ZW225, and CL317 databases. Additionally, we also present new user-friendly local software for readers to apply; the codes and software can be freely accessed at https://github.com/ruanxiaoli/Im-Psepssm.
format Online
Article
Text
id pubmed-7201498
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-72014982020-05-15 Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix Ruan, Xiaoli Zhou, Dongming Nie, Rencan Guo, Yanbu Biomed Res Int Research Article Apoptosis proteins are strongly related to many diseases and play an indispensable role in maintaining the dynamic balance between cell death and division in vivo. Obtaining localization information on apoptosis proteins is necessary in understanding their function. To date, few researchers have focused on the problem of apoptosis data imbalance before classification, while this data imbalance is prone to misclassification. Therefore, in this work, we introduce a method to resolve this problem and to enhance prediction accuracy. Firstly, the features of the protein sequence are captured by combining Improving Pseudo-Position-Specific Scoring Matrix (IM-Psepssm) with the Bidirectional Correlation Coefficient (Bid-CC) algorithm from position-specific scoring matrix. Secondly, different features of fusion and resampling strategies are used to reduce the impact of imbalance on apoptosis protein datasets. Finally, the eigenvector adopts the Support Vector Machine (SVM) to the training classification model, and the prediction accuracy is evaluated by jackknife cross-validation tests. The experimental results indicate that, under the same feature vector, adopting resampling methods remarkably boosts many significant indicators in the unsampling method for predicting the localization of apoptosis proteins in the ZD98, ZW225, and CL317 databases. Additionally, we also present new user-friendly local software for readers to apply; the codes and software can be freely accessed at https://github.com/ruanxiaoli/Im-Psepssm. Hindawi 2020-01-14 /pmc/articles/PMC7201498/ /pubmed/32420339 http://dx.doi.org/10.1155/2020/4071508 Text en Copyright © 2020 Xiaoli Ruan et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ruan, Xiaoli
Zhou, Dongming
Nie, Rencan
Guo, Yanbu
Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix
title Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix
title_full Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix
title_fullStr Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix
title_full_unstemmed Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix
title_short Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix
title_sort predictions of apoptosis proteins by integrating different features based on improving pseudo-position-specific scoring matrix
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7201498/
https://www.ncbi.nlm.nih.gov/pubmed/32420339
http://dx.doi.org/10.1155/2020/4071508
work_keys_str_mv AT ruanxiaoli predictionsofapoptosisproteinsbyintegratingdifferentfeaturesbasedonimprovingpseudopositionspecificscoringmatrix
AT zhoudongming predictionsofapoptosisproteinsbyintegratingdifferentfeaturesbasedonimprovingpseudopositionspecificscoringmatrix
AT nierencan predictionsofapoptosisproteinsbyintegratingdifferentfeaturesbasedonimprovingpseudopositionspecificscoringmatrix
AT guoyanbu predictionsofapoptosisproteinsbyintegratingdifferentfeaturesbasedonimprovingpseudopositionspecificscoringmatrix