Cargando…
Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix
Apoptosis proteins are strongly related to many diseases and play an indispensable role in maintaining the dynamic balance between cell death and division in vivo. Obtaining localization information on apoptosis proteins is necessary in understanding their function. To date, few researchers have foc...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7201498/ https://www.ncbi.nlm.nih.gov/pubmed/32420339 http://dx.doi.org/10.1155/2020/4071508 |
_version_ | 1783529545565995008 |
---|---|
author | Ruan, Xiaoli Zhou, Dongming Nie, Rencan Guo, Yanbu |
author_facet | Ruan, Xiaoli Zhou, Dongming Nie, Rencan Guo, Yanbu |
author_sort | Ruan, Xiaoli |
collection | PubMed |
description | Apoptosis proteins are strongly related to many diseases and play an indispensable role in maintaining the dynamic balance between cell death and division in vivo. Obtaining localization information on apoptosis proteins is necessary in understanding their function. To date, few researchers have focused on the problem of apoptosis data imbalance before classification, while this data imbalance is prone to misclassification. Therefore, in this work, we introduce a method to resolve this problem and to enhance prediction accuracy. Firstly, the features of the protein sequence are captured by combining Improving Pseudo-Position-Specific Scoring Matrix (IM-Psepssm) with the Bidirectional Correlation Coefficient (Bid-CC) algorithm from position-specific scoring matrix. Secondly, different features of fusion and resampling strategies are used to reduce the impact of imbalance on apoptosis protein datasets. Finally, the eigenvector adopts the Support Vector Machine (SVM) to the training classification model, and the prediction accuracy is evaluated by jackknife cross-validation tests. The experimental results indicate that, under the same feature vector, adopting resampling methods remarkably boosts many significant indicators in the unsampling method for predicting the localization of apoptosis proteins in the ZD98, ZW225, and CL317 databases. Additionally, we also present new user-friendly local software for readers to apply; the codes and software can be freely accessed at https://github.com/ruanxiaoli/Im-Psepssm. |
format | Online Article Text |
id | pubmed-7201498 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-72014982020-05-15 Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix Ruan, Xiaoli Zhou, Dongming Nie, Rencan Guo, Yanbu Biomed Res Int Research Article Apoptosis proteins are strongly related to many diseases and play an indispensable role in maintaining the dynamic balance between cell death and division in vivo. Obtaining localization information on apoptosis proteins is necessary in understanding their function. To date, few researchers have focused on the problem of apoptosis data imbalance before classification, while this data imbalance is prone to misclassification. Therefore, in this work, we introduce a method to resolve this problem and to enhance prediction accuracy. Firstly, the features of the protein sequence are captured by combining Improving Pseudo-Position-Specific Scoring Matrix (IM-Psepssm) with the Bidirectional Correlation Coefficient (Bid-CC) algorithm from position-specific scoring matrix. Secondly, different features of fusion and resampling strategies are used to reduce the impact of imbalance on apoptosis protein datasets. Finally, the eigenvector adopts the Support Vector Machine (SVM) to the training classification model, and the prediction accuracy is evaluated by jackknife cross-validation tests. The experimental results indicate that, under the same feature vector, adopting resampling methods remarkably boosts many significant indicators in the unsampling method for predicting the localization of apoptosis proteins in the ZD98, ZW225, and CL317 databases. Additionally, we also present new user-friendly local software for readers to apply; the codes and software can be freely accessed at https://github.com/ruanxiaoli/Im-Psepssm. Hindawi 2020-01-14 /pmc/articles/PMC7201498/ /pubmed/32420339 http://dx.doi.org/10.1155/2020/4071508 Text en Copyright © 2020 Xiaoli Ruan et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Ruan, Xiaoli Zhou, Dongming Nie, Rencan Guo, Yanbu Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix |
title | Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix |
title_full | Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix |
title_fullStr | Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix |
title_full_unstemmed | Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix |
title_short | Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix |
title_sort | predictions of apoptosis proteins by integrating different features based on improving pseudo-position-specific scoring matrix |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7201498/ https://www.ncbi.nlm.nih.gov/pubmed/32420339 http://dx.doi.org/10.1155/2020/4071508 |
work_keys_str_mv | AT ruanxiaoli predictionsofapoptosisproteinsbyintegratingdifferentfeaturesbasedonimprovingpseudopositionspecificscoringmatrix AT zhoudongming predictionsofapoptosisproteinsbyintegratingdifferentfeaturesbasedonimprovingpseudopositionspecificscoringmatrix AT nierencan predictionsofapoptosisproteinsbyintegratingdifferentfeaturesbasedonimprovingpseudopositionspecificscoringmatrix AT guoyanbu predictionsofapoptosisproteinsbyintegratingdifferentfeaturesbasedonimprovingpseudopositionspecificscoringmatrix |