Cargando…

Prediction of relative solvent accessibility by support vector regression and best-first method

Since, it is believed that the native structure of most proteins is defined by their sequences, utilizing data mining methods to extract hidden knowledge from protein sequences, are unavoidable. A major difficulty in mining bioinformatics data is due to the size of the datasets which contain frequen...

Descripción completa

Detalles Bibliográficos
Autores principales: Meshkin, Alireza, Ghafuri, Hossein
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Leibniz Research Centre for Working Environment and Human Factors 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5698889/
https://www.ncbi.nlm.nih.gov/pubmed/29255385
_version_ 1783280846208237568
author Meshkin, Alireza
Ghafuri, Hossein
author_facet Meshkin, Alireza
Ghafuri, Hossein
author_sort Meshkin, Alireza
collection PubMed
description Since, it is believed that the native structure of most proteins is defined by their sequences, utilizing data mining methods to extract hidden knowledge from protein sequences, are unavoidable. A major difficulty in mining bioinformatics data is due to the size of the datasets which contain frequently large numbers of variables. In this study, a two-step procedure for prediction of relative solvent accessibility of proteins is presented. In a first “feature selection” step, a small subset of evolutionary information is identified on the basis of selected physicochemical properties. In the second step, support vector regression is used to real value prediction of protein solvent accessibility with these custom selected features of evolutionary information. The experiment results show that the proposed method is an improvement in average prediction accuracy and training time.
format Online
Article
Text
id pubmed-5698889
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Leibniz Research Centre for Working Environment and Human Factors
record_format MEDLINE/PubMed
spelling pubmed-56988892017-12-18 Prediction of relative solvent accessibility by support vector regression and best-first method Meshkin, Alireza Ghafuri, Hossein EXCLI J Original Article Since, it is believed that the native structure of most proteins is defined by their sequences, utilizing data mining methods to extract hidden knowledge from protein sequences, are unavoidable. A major difficulty in mining bioinformatics data is due to the size of the datasets which contain frequently large numbers of variables. In this study, a two-step procedure for prediction of relative solvent accessibility of proteins is presented. In a first “feature selection” step, a small subset of evolutionary information is identified on the basis of selected physicochemical properties. In the second step, support vector regression is used to real value prediction of protein solvent accessibility with these custom selected features of evolutionary information. The experiment results show that the proposed method is an improvement in average prediction accuracy and training time. Leibniz Research Centre for Working Environment and Human Factors 2010-02-08 /pmc/articles/PMC5698889/ /pubmed/29255385 Text en Copyright © 2010 Meshkin et al. http://www.excli.de/documents/assignment_of_rights.pdf This is an Open Access article distributed under the following Assignment of Rights http://www.excli.de/documents/assignment_of_rights.pdf. You are free to copy, distribute and transmit the work, provided the original author and source are credited.
spellingShingle Original Article
Meshkin, Alireza
Ghafuri, Hossein
Prediction of relative solvent accessibility by support vector regression and best-first method
title Prediction of relative solvent accessibility by support vector regression and best-first method
title_full Prediction of relative solvent accessibility by support vector regression and best-first method
title_fullStr Prediction of relative solvent accessibility by support vector regression and best-first method
title_full_unstemmed Prediction of relative solvent accessibility by support vector regression and best-first method
title_short Prediction of relative solvent accessibility by support vector regression and best-first method
title_sort prediction of relative solvent accessibility by support vector regression and best-first method
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5698889/
https://www.ncbi.nlm.nih.gov/pubmed/29255385
work_keys_str_mv AT meshkinalireza predictionofrelativesolventaccessibilitybysupportvectorregressionandbestfirstmethod
AT ghafurihossein predictionofrelativesolventaccessibilitybysupportvectorregressionandbestfirstmethod