Cargando…
Prediction of DNA-binding protein based on statistical and geometric features and support vector machines
BACKGROUND: Previous studies on protein-DNA interaction mostly focused on the bound structure of DNA-binding proteins but few paid enough attention to the unbound structures. As more new proteins are discovered, it is useful and imperative to develop algorithms for the functional prediction of unbou...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289070/ https://www.ncbi.nlm.nih.gov/pubmed/22166014 http://dx.doi.org/10.1186/1477-5956-9-S1-S1 |
_version_ | 1782224844312543232 |
---|---|
author | Zhou, Weiqiang Yan, Hong |
author_facet | Zhou, Weiqiang Yan, Hong |
author_sort | Zhou, Weiqiang |
collection | PubMed |
description | BACKGROUND: Previous studies on protein-DNA interaction mostly focused on the bound structure of DNA-binding proteins but few paid enough attention to the unbound structures. As more new proteins are discovered, it is useful and imperative to develop algorithms for the functional prediction of unbound proteins. In our work, we apply an alpha shape model to represent the surface structure of the protein-DNA complex and extract useful statistical and geometric features, and use structural alignment and support vector machines for the prediction of unbound DNA-binding proteins. RESULTS: The performance of our method is evaluated by discriminating a set of 104 DNA-binding proteins from 401 non-DNA-binding proteins. In the same test, the proposed method outperforms the other method using conditional probability. The results achieved by our proposed method for; precision, 83.33%; accuracy, 86.53%; and MCC, 0.5368 demonstrate its good performance. CONCLUSIONS: In this study we develop an effective method for the prediction of protein-DNA interactions based on statistical and geometric features and support vector machines. Our results show that interface surface features play an important role in protein-DNA interaction. Our technique is able to predict unbound DNA-binding protein and discriminatory DNA-binding proteins from proteins that bind with other molecules. |
format | Online Article Text |
id | pubmed-3289070 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32890702012-02-29 Prediction of DNA-binding protein based on statistical and geometric features and support vector machines Zhou, Weiqiang Yan, Hong Proteome Sci Proceedings BACKGROUND: Previous studies on protein-DNA interaction mostly focused on the bound structure of DNA-binding proteins but few paid enough attention to the unbound structures. As more new proteins are discovered, it is useful and imperative to develop algorithms for the functional prediction of unbound proteins. In our work, we apply an alpha shape model to represent the surface structure of the protein-DNA complex and extract useful statistical and geometric features, and use structural alignment and support vector machines for the prediction of unbound DNA-binding proteins. RESULTS: The performance of our method is evaluated by discriminating a set of 104 DNA-binding proteins from 401 non-DNA-binding proteins. In the same test, the proposed method outperforms the other method using conditional probability. The results achieved by our proposed method for; precision, 83.33%; accuracy, 86.53%; and MCC, 0.5368 demonstrate its good performance. CONCLUSIONS: In this study we develop an effective method for the prediction of protein-DNA interactions based on statistical and geometric features and support vector machines. Our results show that interface surface features play an important role in protein-DNA interaction. Our technique is able to predict unbound DNA-binding protein and discriminatory DNA-binding proteins from proteins that bind with other molecules. BioMed Central 2011-10-14 /pmc/articles/PMC3289070/ /pubmed/22166014 http://dx.doi.org/10.1186/1477-5956-9-S1-S1 Text en Copyright ©2011 Zhou and Yan; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Zhou, Weiqiang Yan, Hong Prediction of DNA-binding protein based on statistical and geometric features and support vector machines |
title | Prediction of DNA-binding protein based on statistical and geometric features and support vector machines |
title_full | Prediction of DNA-binding protein based on statistical and geometric features and support vector machines |
title_fullStr | Prediction of DNA-binding protein based on statistical and geometric features and support vector machines |
title_full_unstemmed | Prediction of DNA-binding protein based on statistical and geometric features and support vector machines |
title_short | Prediction of DNA-binding protein based on statistical and geometric features and support vector machines |
title_sort | prediction of dna-binding protein based on statistical and geometric features and support vector machines |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289070/ https://www.ncbi.nlm.nih.gov/pubmed/22166014 http://dx.doi.org/10.1186/1477-5956-9-S1-S1 |
work_keys_str_mv | AT zhouweiqiang predictionofdnabindingproteinbasedonstatisticalandgeometricfeaturesandsupportvectormachines AT yanhong predictionofdnabindingproteinbasedonstatisticalandgeometricfeaturesandsupportvectormachines |