Cargando…

Real value prediction of protein solvent accessibility using enhanced PSSM features

BACKGROUND: Prediction of protein solvent accessibility, also called accessible surface area (ASA) prediction, is an important step for tertiary structure prediction directly from one-dimensional sequences. Traditionally, predicting solvent accessibility is regarded as either a two- (exposed or buri...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Darby Tien-Hao, Huang, Hsuan-Yu, Syu, Yu-Tang, Wu, Chih-Peng
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2638152/
https://www.ncbi.nlm.nih.gov/pubmed/19091011
http://dx.doi.org/10.1186/1471-2105-9-S12-S12
_version_ 1782164396782387200
author Chang, Darby Tien-Hao
Huang, Hsuan-Yu
Syu, Yu-Tang
Wu, Chih-Peng
author_facet Chang, Darby Tien-Hao
Huang, Hsuan-Yu
Syu, Yu-Tang
Wu, Chih-Peng
author_sort Chang, Darby Tien-Hao
collection PubMed
description BACKGROUND: Prediction of protein solvent accessibility, also called accessible surface area (ASA) prediction, is an important step for tertiary structure prediction directly from one-dimensional sequences. Traditionally, predicting solvent accessibility is regarded as either a two- (exposed or buried) or three-state (exposed, intermediate or buried) classification problem. However, the states of solvent accessibility are not well-defined in real protein structures. Thus, a number of methods have been developed to directly predict the real value ASA based on evolutionary information such as position specific scoring matrix (PSSM). RESULTS: This study enhances the PSSM-based features for real value ASA prediction by considering the physicochemical properties and solvent propensities of amino acid types. We propose a systematic method for identifying residue groups with respect to protein solvent accessibility. The amino acid columns in the PSSM profile that belong to a certain residue group are merged to generate novel features. Finally, support vector regression (SVR) is adopted to construct a real value ASA predictor. Experimental results demonstrate that the features produced by the proposed selection process are informative for ASA prediction. CONCLUSION: Experimental results based on a widely used benchmark reveal that the proposed method performs best among several of existing packages for performing ASA prediction. Furthermore, the feature selection mechanism incorporated in this study can be applied to other regression problems using the PSSM. The program and data are available from the authors upon request.
format Text
id pubmed-2638152
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26381522009-02-24 Real value prediction of protein solvent accessibility using enhanced PSSM features Chang, Darby Tien-Hao Huang, Hsuan-Yu Syu, Yu-Tang Wu, Chih-Peng BMC Bioinformatics Research BACKGROUND: Prediction of protein solvent accessibility, also called accessible surface area (ASA) prediction, is an important step for tertiary structure prediction directly from one-dimensional sequences. Traditionally, predicting solvent accessibility is regarded as either a two- (exposed or buried) or three-state (exposed, intermediate or buried) classification problem. However, the states of solvent accessibility are not well-defined in real protein structures. Thus, a number of methods have been developed to directly predict the real value ASA based on evolutionary information such as position specific scoring matrix (PSSM). RESULTS: This study enhances the PSSM-based features for real value ASA prediction by considering the physicochemical properties and solvent propensities of amino acid types. We propose a systematic method for identifying residue groups with respect to protein solvent accessibility. The amino acid columns in the PSSM profile that belong to a certain residue group are merged to generate novel features. Finally, support vector regression (SVR) is adopted to construct a real value ASA predictor. Experimental results demonstrate that the features produced by the proposed selection process are informative for ASA prediction. CONCLUSION: Experimental results based on a widely used benchmark reveal that the proposed method performs best among several of existing packages for performing ASA prediction. Furthermore, the feature selection mechanism incorporated in this study can be applied to other regression problems using the PSSM. The program and data are available from the authors upon request. BioMed Central 2008-12-12 /pmc/articles/PMC2638152/ /pubmed/19091011 http://dx.doi.org/10.1186/1471-2105-9-S12-S12 Text en Copyright © 2008 Chang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Chang, Darby Tien-Hao
Huang, Hsuan-Yu
Syu, Yu-Tang
Wu, Chih-Peng
Real value prediction of protein solvent accessibility using enhanced PSSM features
title Real value prediction of protein solvent accessibility using enhanced PSSM features
title_full Real value prediction of protein solvent accessibility using enhanced PSSM features
title_fullStr Real value prediction of protein solvent accessibility using enhanced PSSM features
title_full_unstemmed Real value prediction of protein solvent accessibility using enhanced PSSM features
title_short Real value prediction of protein solvent accessibility using enhanced PSSM features
title_sort real value prediction of protein solvent accessibility using enhanced pssm features
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2638152/
https://www.ncbi.nlm.nih.gov/pubmed/19091011
http://dx.doi.org/10.1186/1471-2105-9-S12-S12
work_keys_str_mv AT changdarbytienhao realvaluepredictionofproteinsolventaccessibilityusingenhancedpssmfeatures
AT huanghsuanyu realvaluepredictionofproteinsolventaccessibilityusingenhancedpssmfeatures
AT syuyutang realvaluepredictionofproteinsolventaccessibilityusingenhancedpssmfeatures
AT wuchihpeng realvaluepredictionofproteinsolventaccessibilityusingenhancedpssmfeatures