Cargando…

Prediction of DNA-binding protein based on statistical and geometric features and support vector machines

BACKGROUND: Previous studies on protein-DNA interaction mostly focused on the bound structure of DNA-binding proteins but few paid enough attention to the unbound structures. As more new proteins are discovered, it is useful and imperative to develop algorithms for the functional prediction of unbou...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Weiqiang, Yan, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289070/
https://www.ncbi.nlm.nih.gov/pubmed/22166014
http://dx.doi.org/10.1186/1477-5956-9-S1-S1
_version_ 1782224844312543232
author Zhou, Weiqiang
Yan, Hong
author_facet Zhou, Weiqiang
Yan, Hong
author_sort Zhou, Weiqiang
collection PubMed
description BACKGROUND: Previous studies on protein-DNA interaction mostly focused on the bound structure of DNA-binding proteins but few paid enough attention to the unbound structures. As more new proteins are discovered, it is useful and imperative to develop algorithms for the functional prediction of unbound proteins. In our work, we apply an alpha shape model to represent the surface structure of the protein-DNA complex and extract useful statistical and geometric features, and use structural alignment and support vector machines for the prediction of unbound DNA-binding proteins. RESULTS: The performance of our method is evaluated by discriminating a set of 104 DNA-binding proteins from 401 non-DNA-binding proteins. In the same test, the proposed method outperforms the other method using conditional probability. The results achieved by our proposed method for; precision, 83.33%; accuracy, 86.53%; and MCC, 0.5368 demonstrate its good performance. CONCLUSIONS: In this study we develop an effective method for the prediction of protein-DNA interactions based on statistical and geometric features and support vector machines. Our results show that interface surface features play an important role in protein-DNA interaction. Our technique is able to predict unbound DNA-binding protein and discriminatory DNA-binding proteins from proteins that bind with other molecules.
format Online
Article
Text
id pubmed-3289070
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32890702012-02-29 Prediction of DNA-binding protein based on statistical and geometric features and support vector machines Zhou, Weiqiang Yan, Hong Proteome Sci Proceedings BACKGROUND: Previous studies on protein-DNA interaction mostly focused on the bound structure of DNA-binding proteins but few paid enough attention to the unbound structures. As more new proteins are discovered, it is useful and imperative to develop algorithms for the functional prediction of unbound proteins. In our work, we apply an alpha shape model to represent the surface structure of the protein-DNA complex and extract useful statistical and geometric features, and use structural alignment and support vector machines for the prediction of unbound DNA-binding proteins. RESULTS: The performance of our method is evaluated by discriminating a set of 104 DNA-binding proteins from 401 non-DNA-binding proteins. In the same test, the proposed method outperforms the other method using conditional probability. The results achieved by our proposed method for; precision, 83.33%; accuracy, 86.53%; and MCC, 0.5368 demonstrate its good performance. CONCLUSIONS: In this study we develop an effective method for the prediction of protein-DNA interactions based on statistical and geometric features and support vector machines. Our results show that interface surface features play an important role in protein-DNA interaction. Our technique is able to predict unbound DNA-binding protein and discriminatory DNA-binding proteins from proteins that bind with other molecules. BioMed Central 2011-10-14 /pmc/articles/PMC3289070/ /pubmed/22166014 http://dx.doi.org/10.1186/1477-5956-9-S1-S1 Text en Copyright ©2011 Zhou and Yan; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Zhou, Weiqiang
Yan, Hong
Prediction of DNA-binding protein based on statistical and geometric features and support vector machines
title Prediction of DNA-binding protein based on statistical and geometric features and support vector machines
title_full Prediction of DNA-binding protein based on statistical and geometric features and support vector machines
title_fullStr Prediction of DNA-binding protein based on statistical and geometric features and support vector machines
title_full_unstemmed Prediction of DNA-binding protein based on statistical and geometric features and support vector machines
title_short Prediction of DNA-binding protein based on statistical and geometric features and support vector machines
title_sort prediction of dna-binding protein based on statistical and geometric features and support vector machines
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289070/
https://www.ncbi.nlm.nih.gov/pubmed/22166014
http://dx.doi.org/10.1186/1477-5956-9-S1-S1
work_keys_str_mv AT zhouweiqiang predictionofdnabindingproteinbasedonstatisticalandgeometricfeaturesandsupportvectormachines
AT yanhong predictionofdnabindingproteinbasedonstatisticalandgeometricfeaturesandsupportvectormachines