Cargando…
Prediction of protein binding sites in protein structures using hidden Markov support vector machine
BACKGROUND: Predicting the binding sites between two interacting proteins provides important clues to the function of a protein. Recent research on protein binding site prediction has been mainly based on widely known machine learning techniques, such as artificial neural networks, support vector ma...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2785799/ https://www.ncbi.nlm.nih.gov/pubmed/19925685 http://dx.doi.org/10.1186/1471-2105-10-381 |
_version_ | 1782174837251244032 |
---|---|
author | Liu, Bin Wang, Xiaolong Lin, Lei Tang, Buzhou Dong, Qiwen Wang, Xuan |
author_facet | Liu, Bin Wang, Xiaolong Lin, Lei Tang, Buzhou Dong, Qiwen Wang, Xuan |
author_sort | Liu, Bin |
collection | PubMed |
description | BACKGROUND: Predicting the binding sites between two interacting proteins provides important clues to the function of a protein. Recent research on protein binding site prediction has been mainly based on widely known machine learning techniques, such as artificial neural networks, support vector machines, conditional random field, etc. However, the prediction performance is still too low to be used in practice. It is necessary to explore new algorithms, theories and features to further improve the performance. RESULTS: In this study, we introduce a novel machine learning model hidden Markov support vector machine for protein binding site prediction. The model treats the protein binding site prediction as a sequential labelling task based on the maximum margin criterion. Common features derived from protein sequences and structures, including protein sequence profile and residue accessible surface area, are used to train hidden Markov support vector machine. When tested on six data sets, the method based on hidden Markov support vector machine shows better performance than some state-of-the-art methods, including artificial neural networks, support vector machines and conditional random field. Furthermore, its running time is several orders of magnitude shorter than that of the compared methods. CONCLUSION: The improved prediction performance and computational efficiency of the method based on hidden Markov support vector machine can be attributed to the following three factors. Firstly, the relation between labels of neighbouring residues is useful for protein binding site prediction. Secondly, the kernel trick is very advantageous to this field. Thirdly, the complexity of the training step for hidden Markov support vector machine is linear with the number of training samples by using the cutting-plane algorithm. |
format | Text |
id | pubmed-2785799 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27857992009-12-01 Prediction of protein binding sites in protein structures using hidden Markov support vector machine Liu, Bin Wang, Xiaolong Lin, Lei Tang, Buzhou Dong, Qiwen Wang, Xuan BMC Bioinformatics Research article BACKGROUND: Predicting the binding sites between two interacting proteins provides important clues to the function of a protein. Recent research on protein binding site prediction has been mainly based on widely known machine learning techniques, such as artificial neural networks, support vector machines, conditional random field, etc. However, the prediction performance is still too low to be used in practice. It is necessary to explore new algorithms, theories and features to further improve the performance. RESULTS: In this study, we introduce a novel machine learning model hidden Markov support vector machine for protein binding site prediction. The model treats the protein binding site prediction as a sequential labelling task based on the maximum margin criterion. Common features derived from protein sequences and structures, including protein sequence profile and residue accessible surface area, are used to train hidden Markov support vector machine. When tested on six data sets, the method based on hidden Markov support vector machine shows better performance than some state-of-the-art methods, including artificial neural networks, support vector machines and conditional random field. Furthermore, its running time is several orders of magnitude shorter than that of the compared methods. CONCLUSION: The improved prediction performance and computational efficiency of the method based on hidden Markov support vector machine can be attributed to the following three factors. Firstly, the relation between labels of neighbouring residues is useful for protein binding site prediction. Secondly, the kernel trick is very advantageous to this field. Thirdly, the complexity of the training step for hidden Markov support vector machine is linear with the number of training samples by using the cutting-plane algorithm. BioMed Central 2009-11-20 /pmc/articles/PMC2785799/ /pubmed/19925685 http://dx.doi.org/10.1186/1471-2105-10-381 Text en Copyright ©2009 Liu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research article Liu, Bin Wang, Xiaolong Lin, Lei Tang, Buzhou Dong, Qiwen Wang, Xuan Prediction of protein binding sites in protein structures using hidden Markov support vector machine |
title | Prediction of protein binding sites in protein structures using hidden Markov support vector machine |
title_full | Prediction of protein binding sites in protein structures using hidden Markov support vector machine |
title_fullStr | Prediction of protein binding sites in protein structures using hidden Markov support vector machine |
title_full_unstemmed | Prediction of protein binding sites in protein structures using hidden Markov support vector machine |
title_short | Prediction of protein binding sites in protein structures using hidden Markov support vector machine |
title_sort | prediction of protein binding sites in protein structures using hidden markov support vector machine |
topic | Research article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2785799/ https://www.ncbi.nlm.nih.gov/pubmed/19925685 http://dx.doi.org/10.1186/1471-2105-10-381 |
work_keys_str_mv | AT liubin predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine AT wangxiaolong predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine AT linlei predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine AT tangbuzhou predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine AT dongqiwen predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine AT wangxuan predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine |