Cargando…

svmPRAT: SVM-based Protein Residue Annotation Toolkit

BACKGROUND: Over the last decade several prediction methods have been developed for determining the structural and functional properties of individual protein residues using sequence and sequence-derived information. Most of these methods are based on support vector machines as they provide accurate...

Descripción completa

Detalles Bibliográficos
Autores principales: Rangwala, Huzefa, Kauffman, Christopher, Karypis, George
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2805646/
https://www.ncbi.nlm.nih.gov/pubmed/20028521
http://dx.doi.org/10.1186/1471-2105-10-439
_version_ 1782176210568085504
author Rangwala, Huzefa
Kauffman, Christopher
Karypis, George
author_facet Rangwala, Huzefa
Kauffman, Christopher
Karypis, George
author_sort Rangwala, Huzefa
collection PubMed
description BACKGROUND: Over the last decade several prediction methods have been developed for determining the structural and functional properties of individual protein residues using sequence and sequence-derived information. Most of these methods are based on support vector machines as they provide accurate and generalizable prediction models. RESULTS: We present a general purpose protein residue annotation toolkit (svmPRAT) to allow biologists to formulate residue-wise prediction problems. svmPRAT formulates the annotation problem as a classification or regression problem using support vector machines. One of the key features of svmPRAT is its ease of use in incorporating any user-provided information in the form of feature matrices. For every residue svmPRAT captures local information around the reside to create fixed length feature vectors. svmPRAT implements accurate and fast kernel functions, and also introduces a flexible window-based encoding scheme that accurately captures signals and pattern for training effective predictive models. CONCLUSIONS: In this work we evaluate svmPRAT on several classification and regression problems including disorder prediction, residue-wise contact order estimation, DNA-binding site prediction, and local structure alphabet prediction. svmPRAT has also been used for the development of state-of-the-art transmembrane helix prediction method called TOPTMH, and secondary structure prediction method called YASSPP. This toolkit developed provides practitioners an efficient and easy-to-use tool for a wide variety of annotation problems. Availability: http://www.cs.gmu.edu/~mlbio/svmprat
format Text
id pubmed-2805646
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28056462010-01-13 svmPRAT: SVM-based Protein Residue Annotation Toolkit Rangwala, Huzefa Kauffman, Christopher Karypis, George BMC Bioinformatics Software BACKGROUND: Over the last decade several prediction methods have been developed for determining the structural and functional properties of individual protein residues using sequence and sequence-derived information. Most of these methods are based on support vector machines as they provide accurate and generalizable prediction models. RESULTS: We present a general purpose protein residue annotation toolkit (svmPRAT) to allow biologists to formulate residue-wise prediction problems. svmPRAT formulates the annotation problem as a classification or regression problem using support vector machines. One of the key features of svmPRAT is its ease of use in incorporating any user-provided information in the form of feature matrices. For every residue svmPRAT captures local information around the reside to create fixed length feature vectors. svmPRAT implements accurate and fast kernel functions, and also introduces a flexible window-based encoding scheme that accurately captures signals and pattern for training effective predictive models. CONCLUSIONS: In this work we evaluate svmPRAT on several classification and regression problems including disorder prediction, residue-wise contact order estimation, DNA-binding site prediction, and local structure alphabet prediction. svmPRAT has also been used for the development of state-of-the-art transmembrane helix prediction method called TOPTMH, and secondary structure prediction method called YASSPP. This toolkit developed provides practitioners an efficient and easy-to-use tool for a wide variety of annotation problems. Availability: http://www.cs.gmu.edu/~mlbio/svmprat BioMed Central 2009-12-22 /pmc/articles/PMC2805646/ /pubmed/20028521 http://dx.doi.org/10.1186/1471-2105-10-439 Text en Copyright ©2009 Rangwala et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Rangwala, Huzefa
Kauffman, Christopher
Karypis, George
svmPRAT: SVM-based Protein Residue Annotation Toolkit
title svmPRAT: SVM-based Protein Residue Annotation Toolkit
title_full svmPRAT: SVM-based Protein Residue Annotation Toolkit
title_fullStr svmPRAT: SVM-based Protein Residue Annotation Toolkit
title_full_unstemmed svmPRAT: SVM-based Protein Residue Annotation Toolkit
title_short svmPRAT: SVM-based Protein Residue Annotation Toolkit
title_sort svmprat: svm-based protein residue annotation toolkit
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2805646/
https://www.ncbi.nlm.nih.gov/pubmed/20028521
http://dx.doi.org/10.1186/1471-2105-10-439
work_keys_str_mv AT rangwalahuzefa svmpratsvmbasedproteinresidueannotationtoolkit
AT kauffmanchristopher svmpratsvmbasedproteinresidueannotationtoolkit
AT karypisgeorge svmpratsvmbasedproteinresidueannotationtoolkit