Cargando…

Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach

Metal-binding proteins play important roles in structural stability, signaling, regulation, transport, immune response, metabolism control, and metal homeostasis. Because of their functional and sequence diversity, it is desirable to explore additional methods for predicting metal-binding proteins i...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, HH, Han, LY, Zhang, HL, Zheng, CJ, Xie, B, Cao, ZW, Chen, YZ
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764469/
https://www.ncbi.nlm.nih.gov/pubmed/17254297
http://dx.doi.org/10.1186/1471-2105-7-S5-S13
_version_ 1782131617305722880
author Lin, HH
Han, LY
Zhang, HL
Zheng, CJ
Xie, B
Cao, ZW
Chen, YZ
author_facet Lin, HH
Han, LY
Zhang, HL
Zheng, CJ
Xie, B
Cao, ZW
Chen, YZ
author_sort Lin, HH
collection PubMed
description Metal-binding proteins play important roles in structural stability, signaling, regulation, transport, immune response, metabolism control, and metal homeostasis. Because of their functional and sequence diversity, it is desirable to explore additional methods for predicting metal-binding proteins irrespective of sequence similarity. This work explores support vector machines (SVM) as such a method. SVM prediction systems were developed by using 53,333 metal-binding and 147,347 non-metal-binding proteins, and evaluated by an independent set of 31,448 metal-binding and 79,051 non-metal-binding proteins. The computed prediction accuracy is 86.3%, 81.6%, 83.5%, 94.0%, 81.2%, 85.4%, 77.6%, 90.4%, 90.9%, 74.9% and 78.1% for calcium-binding, cobalt-binding, copper-binding, iron-binding, magnesium-binding, manganese-binding, nickel-binding, potassium-binding, sodium-binding, zinc-binding, and all metal-binding proteins respectively. The accuracy for the non-member proteins of each class is 88.2%, 99.9%, 98.1%, 91.4%, 87.9%, 94.5%, 99.2%, 99.9%, 99.9%, 98.0%, and 88.0% respectively. Comparable accuracies were obtained by using a different SVM kernel function. Our method predicts 67% of the 87 metal-binding proteins non-homologous to any protein in the Swissprot database and 85.3% of the 333 proteins of known metal-binding domains as metal-binding. These suggest the usefulness of SVM for facilitating the prediction of metal-binding proteins. Our software can be accessed at the SVMProt server .
format Text
id pubmed-1764469
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-17644692007-01-09 Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach Lin, HH Han, LY Zhang, HL Zheng, CJ Xie, B Cao, ZW Chen, YZ BMC Bioinformatics Proceedings Metal-binding proteins play important roles in structural stability, signaling, regulation, transport, immune response, metabolism control, and metal homeostasis. Because of their functional and sequence diversity, it is desirable to explore additional methods for predicting metal-binding proteins irrespective of sequence similarity. This work explores support vector machines (SVM) as such a method. SVM prediction systems were developed by using 53,333 metal-binding and 147,347 non-metal-binding proteins, and evaluated by an independent set of 31,448 metal-binding and 79,051 non-metal-binding proteins. The computed prediction accuracy is 86.3%, 81.6%, 83.5%, 94.0%, 81.2%, 85.4%, 77.6%, 90.4%, 90.9%, 74.9% and 78.1% for calcium-binding, cobalt-binding, copper-binding, iron-binding, magnesium-binding, manganese-binding, nickel-binding, potassium-binding, sodium-binding, zinc-binding, and all metal-binding proteins respectively. The accuracy for the non-member proteins of each class is 88.2%, 99.9%, 98.1%, 91.4%, 87.9%, 94.5%, 99.2%, 99.9%, 99.9%, 98.0%, and 88.0% respectively. Comparable accuracies were obtained by using a different SVM kernel function. Our method predicts 67% of the 87 metal-binding proteins non-homologous to any protein in the Swissprot database and 85.3% of the 333 proteins of known metal-binding domains as metal-binding. These suggest the usefulness of SVM for facilitating the prediction of metal-binding proteins. Our software can be accessed at the SVMProt server . BioMed Central 2006-12-18 /pmc/articles/PMC1764469/ /pubmed/17254297 http://dx.doi.org/10.1186/1471-2105-7-S5-S13 Text en Copyright © 2006 Lin et al; licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Lin, HH
Han, LY
Zhang, HL
Zheng, CJ
Xie, B
Cao, ZW
Chen, YZ
Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach
title Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach
title_full Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach
title_fullStr Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach
title_full_unstemmed Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach
title_short Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach
title_sort prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764469/
https://www.ncbi.nlm.nih.gov/pubmed/17254297
http://dx.doi.org/10.1186/1471-2105-7-S5-S13
work_keys_str_mv AT linhh predictionofthefunctionalclassofmetalbindingproteinsfromsequencederivedphysicochemicalpropertiesbysupportvectormachineapproach
AT hanly predictionofthefunctionalclassofmetalbindingproteinsfromsequencederivedphysicochemicalpropertiesbysupportvectormachineapproach
AT zhanghl predictionofthefunctionalclassofmetalbindingproteinsfromsequencederivedphysicochemicalpropertiesbysupportvectormachineapproach
AT zhengcj predictionofthefunctionalclassofmetalbindingproteinsfromsequencederivedphysicochemicalpropertiesbysupportvectormachineapproach
AT xieb predictionofthefunctionalclassofmetalbindingproteinsfromsequencederivedphysicochemicalpropertiesbysupportvectormachineapproach
AT caozw predictionofthefunctionalclassofmetalbindingproteinsfromsequencederivedphysicochemicalpropertiesbysupportvectormachineapproach
AT chenyz predictionofthefunctionalclassofmetalbindingproteinsfromsequencederivedphysicochemicalpropertiesbysupportvectormachineapproach