Cargando…

Predicting Apoptosis Protein Subcellular Locations based on the Protein Overlapping Property Matrix and Tri-Gram Encoding

To reveal the working pattern of programmed cell death, knowledge of the subcellular location of apoptosis proteins is essential. Besides the costly and time-consuming method of experimental determination, research into computational locating schemes, focusing mainly on the innovation of representat...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Yang, Zheng, Huiwen, Wang, Chunhua, Xiao, Wanyue, Liu, Taigang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6539631/
https://www.ncbi.nlm.nih.gov/pubmed/31083553
http://dx.doi.org/10.3390/ijms20092344
_version_ 1783422435980214272
author Yang, Yang
Zheng, Huiwen
Wang, Chunhua
Xiao, Wanyue
Liu, Taigang
author_facet Yang, Yang
Zheng, Huiwen
Wang, Chunhua
Xiao, Wanyue
Liu, Taigang
author_sort Yang, Yang
collection PubMed
description To reveal the working pattern of programmed cell death, knowledge of the subcellular location of apoptosis proteins is essential. Besides the costly and time-consuming method of experimental determination, research into computational locating schemes, focusing mainly on the innovation of representation techniques on protein sequences and the selection of classification algorithms, has become popular in recent decades. In this study, a novel tri-gram encoding model is proposed, which is based on using the protein overlapping property matrix (POPM) for predicting apoptosis protein subcellular location. Next, a 1000-dimensional feature vector is built to represent a protein. Finally, with the help of support vector machine-recursive feature elimination (SVM-RFE), we select the optimal features and put them into a support vector machine (SVM) classifier for predictions. The results of jackknife tests on two benchmark datasets demonstrate that our proposed method can achieve satisfactory prediction performance level with less computing capacity required and could work as a promising tool to predict the subcellular locations of apoptosis proteins.
format Online
Article
Text
id pubmed-6539631
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-65396312019-06-04 Predicting Apoptosis Protein Subcellular Locations based on the Protein Overlapping Property Matrix and Tri-Gram Encoding Yang, Yang Zheng, Huiwen Wang, Chunhua Xiao, Wanyue Liu, Taigang Int J Mol Sci Communication To reveal the working pattern of programmed cell death, knowledge of the subcellular location of apoptosis proteins is essential. Besides the costly and time-consuming method of experimental determination, research into computational locating schemes, focusing mainly on the innovation of representation techniques on protein sequences and the selection of classification algorithms, has become popular in recent decades. In this study, a novel tri-gram encoding model is proposed, which is based on using the protein overlapping property matrix (POPM) for predicting apoptosis protein subcellular location. Next, a 1000-dimensional feature vector is built to represent a protein. Finally, with the help of support vector machine-recursive feature elimination (SVM-RFE), we select the optimal features and put them into a support vector machine (SVM) classifier for predictions. The results of jackknife tests on two benchmark datasets demonstrate that our proposed method can achieve satisfactory prediction performance level with less computing capacity required and could work as a promising tool to predict the subcellular locations of apoptosis proteins. MDPI 2019-05-11 /pmc/articles/PMC6539631/ /pubmed/31083553 http://dx.doi.org/10.3390/ijms20092344 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Yang, Yang
Zheng, Huiwen
Wang, Chunhua
Xiao, Wanyue
Liu, Taigang
Predicting Apoptosis Protein Subcellular Locations based on the Protein Overlapping Property Matrix and Tri-Gram Encoding
title Predicting Apoptosis Protein Subcellular Locations based on the Protein Overlapping Property Matrix and Tri-Gram Encoding
title_full Predicting Apoptosis Protein Subcellular Locations based on the Protein Overlapping Property Matrix and Tri-Gram Encoding
title_fullStr Predicting Apoptosis Protein Subcellular Locations based on the Protein Overlapping Property Matrix and Tri-Gram Encoding
title_full_unstemmed Predicting Apoptosis Protein Subcellular Locations based on the Protein Overlapping Property Matrix and Tri-Gram Encoding
title_short Predicting Apoptosis Protein Subcellular Locations based on the Protein Overlapping Property Matrix and Tri-Gram Encoding
title_sort predicting apoptosis protein subcellular locations based on the protein overlapping property matrix and tri-gram encoding
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6539631/
https://www.ncbi.nlm.nih.gov/pubmed/31083553
http://dx.doi.org/10.3390/ijms20092344
work_keys_str_mv AT yangyang predictingapoptosisproteinsubcellularlocationsbasedontheproteinoverlappingpropertymatrixandtrigramencoding
AT zhenghuiwen predictingapoptosisproteinsubcellularlocationsbasedontheproteinoverlappingpropertymatrixandtrigramencoding
AT wangchunhua predictingapoptosisproteinsubcellularlocationsbasedontheproteinoverlappingpropertymatrixandtrigramencoding
AT xiaowanyue predictingapoptosisproteinsubcellularlocationsbasedontheproteinoverlappingpropertymatrixandtrigramencoding
AT liutaigang predictingapoptosisproteinsubcellularlocationsbasedontheproteinoverlappingpropertymatrixandtrigramencoding