Cargando…
Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach
The prior knowledge of protein structural class may offer useful clues on understanding its functionality as well as its tertiary structure. Though various significant efforts have been made to find a fast and effective computational approach to address this problem, it is still a challenging topic...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4730262/ https://www.ncbi.nlm.nih.gov/pubmed/26712737 http://dx.doi.org/10.3390/ijms17010015 |
_version_ | 1782412362039427072 |
---|---|
author | Liu, Taigang Qin, Yufang Wang, Yongjie Wang, Chunhua |
author_facet | Liu, Taigang Qin, Yufang Wang, Yongjie Wang, Chunhua |
author_sort | Liu, Taigang |
collection | PubMed |
description | The prior knowledge of protein structural class may offer useful clues on understanding its functionality as well as its tertiary structure. Though various significant efforts have been made to find a fast and effective computational approach to address this problem, it is still a challenging topic in the field of bioinformatics. The position-specific score matrix (PSSM) profile has been shown to provide a useful source of information for improving the prediction performance of protein structural class. However, this information has not been adequately explored. To this end, in this study, we present a feature extraction technique which is based on gapped-dipeptides composition computed directly from PSSM. Then, a careful feature selection technique is performed based on support vector machine-recursive feature elimination (SVM-RFE). These optimal features are selected to construct a final predictor. The results of jackknife tests on four working datasets show that our method obtains satisfactory prediction accuracies by extracting features solely based on PSSM and could serve as a very promising tool to predict protein structural class. |
format | Online Article Text |
id | pubmed-4730262 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-47302622016-02-11 Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach Liu, Taigang Qin, Yufang Wang, Yongjie Wang, Chunhua Int J Mol Sci Article The prior knowledge of protein structural class may offer useful clues on understanding its functionality as well as its tertiary structure. Though various significant efforts have been made to find a fast and effective computational approach to address this problem, it is still a challenging topic in the field of bioinformatics. The position-specific score matrix (PSSM) profile has been shown to provide a useful source of information for improving the prediction performance of protein structural class. However, this information has not been adequately explored. To this end, in this study, we present a feature extraction technique which is based on gapped-dipeptides composition computed directly from PSSM. Then, a careful feature selection technique is performed based on support vector machine-recursive feature elimination (SVM-RFE). These optimal features are selected to construct a final predictor. The results of jackknife tests on four working datasets show that our method obtains satisfactory prediction accuracies by extracting features solely based on PSSM and could serve as a very promising tool to predict protein structural class. MDPI 2015-12-24 /pmc/articles/PMC4730262/ /pubmed/26712737 http://dx.doi.org/10.3390/ijms17010015 Text en © 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons by Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Liu, Taigang Qin, Yufang Wang, Yongjie Wang, Chunhua Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach |
title | Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach |
title_full | Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach |
title_fullStr | Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach |
title_full_unstemmed | Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach |
title_short | Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach |
title_sort | prediction of protein structural class based on gapped-dipeptides and a recursive feature selection approach |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4730262/ https://www.ncbi.nlm.nih.gov/pubmed/26712737 http://dx.doi.org/10.3390/ijms17010015 |
work_keys_str_mv | AT liutaigang predictionofproteinstructuralclassbasedongappeddipeptidesandarecursivefeatureselectionapproach AT qinyufang predictionofproteinstructuralclassbasedongappeddipeptidesandarecursivefeatureselectionapproach AT wangyongjie predictionofproteinstructuralclassbasedongappeddipeptidesandarecursivefeatureselectionapproach AT wangchunhua predictionofproteinstructuralclassbasedongappeddipeptidesandarecursivefeatureselectionapproach |