Cargando…

Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach

The prior knowledge of protein structural class may offer useful clues on understanding its functionality as well as its tertiary structure. Though various significant efforts have been made to find a fast and effective computational approach to address this problem, it is still a challenging topic...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Taigang, Qin, Yufang, Wang, Yongjie, Wang, Chunhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4730262/
https://www.ncbi.nlm.nih.gov/pubmed/26712737
http://dx.doi.org/10.3390/ijms17010015
_version_ 1782412362039427072
author Liu, Taigang
Qin, Yufang
Wang, Yongjie
Wang, Chunhua
author_facet Liu, Taigang
Qin, Yufang
Wang, Yongjie
Wang, Chunhua
author_sort Liu, Taigang
collection PubMed
description The prior knowledge of protein structural class may offer useful clues on understanding its functionality as well as its tertiary structure. Though various significant efforts have been made to find a fast and effective computational approach to address this problem, it is still a challenging topic in the field of bioinformatics. The position-specific score matrix (PSSM) profile has been shown to provide a useful source of information for improving the prediction performance of protein structural class. However, this information has not been adequately explored. To this end, in this study, we present a feature extraction technique which is based on gapped-dipeptides composition computed directly from PSSM. Then, a careful feature selection technique is performed based on support vector machine-recursive feature elimination (SVM-RFE). These optimal features are selected to construct a final predictor. The results of jackknife tests on four working datasets show that our method obtains satisfactory prediction accuracies by extracting features solely based on PSSM and could serve as a very promising tool to predict protein structural class.
format Online
Article
Text
id pubmed-4730262
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-47302622016-02-11 Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach Liu, Taigang Qin, Yufang Wang, Yongjie Wang, Chunhua Int J Mol Sci Article The prior knowledge of protein structural class may offer useful clues on understanding its functionality as well as its tertiary structure. Though various significant efforts have been made to find a fast and effective computational approach to address this problem, it is still a challenging topic in the field of bioinformatics. The position-specific score matrix (PSSM) profile has been shown to provide a useful source of information for improving the prediction performance of protein structural class. However, this information has not been adequately explored. To this end, in this study, we present a feature extraction technique which is based on gapped-dipeptides composition computed directly from PSSM. Then, a careful feature selection technique is performed based on support vector machine-recursive feature elimination (SVM-RFE). These optimal features are selected to construct a final predictor. The results of jackknife tests on four working datasets show that our method obtains satisfactory prediction accuracies by extracting features solely based on PSSM and could serve as a very promising tool to predict protein structural class. MDPI 2015-12-24 /pmc/articles/PMC4730262/ /pubmed/26712737 http://dx.doi.org/10.3390/ijms17010015 Text en © 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons by Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liu, Taigang
Qin, Yufang
Wang, Yongjie
Wang, Chunhua
Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach
title Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach
title_full Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach
title_fullStr Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach
title_full_unstemmed Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach
title_short Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach
title_sort prediction of protein structural class based on gapped-dipeptides and a recursive feature selection approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4730262/
https://www.ncbi.nlm.nih.gov/pubmed/26712737
http://dx.doi.org/10.3390/ijms17010015
work_keys_str_mv AT liutaigang predictionofproteinstructuralclassbasedongappeddipeptidesandarecursivefeatureselectionapproach
AT qinyufang predictionofproteinstructuralclassbasedongappeddipeptidesandarecursivefeatureselectionapproach
AT wangyongjie predictionofproteinstructuralclassbasedongappeddipeptidesandarecursivefeatureselectionapproach
AT wangchunhua predictionofproteinstructuralclassbasedongappeddipeptidesandarecursivefeatureselectionapproach