Cargando…
Identification of single-stranded and double-stranded dna binding proteins based on protein structure
BACKGROUND: Protein-DNA interactions are essential for many biological processes. However, the structural mechanisms underlying these interactions are not fully understood. DNA binding proteins can be classified into double-stranded DNA binding proteins (DSBs) and single-stranded DNA binding protein...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243121/ https://www.ncbi.nlm.nih.gov/pubmed/25474071 http://dx.doi.org/10.1186/1471-2105-15-S12-S4 |
_version_ | 1782346063924953088 |
---|---|
author | Wang, Wei Liu, Juan Zhou, Xionghui |
author_facet | Wang, Wei Liu, Juan Zhou, Xionghui |
author_sort | Wang, Wei |
collection | PubMed |
description | BACKGROUND: Protein-DNA interactions are essential for many biological processes. However, the structural mechanisms underlying these interactions are not fully understood. DNA binding proteins can be classified into double-stranded DNA binding proteins (DSBs) and single-stranded DNA binding proteins (SSBs), and they take part in different biological functions. DSBs usually act as transcriptional factors to regulate the genes' expressions, while SSBs usually play roles in DNA replication, recombination, and repair, etc. Understanding the binding specificity of a DNA binding protein is helpful for the research of protein functions. RESULTS: In this paper, we investigated the differences between DSBs and SSBs on surface tunnels as well as the OB-fold domain information. We detected the largest clefts on the protein surfaces, to obtain several features to be used for distinguishing the potential interfaces between SSBs and DSBs, and compared its structure with each of the six OB-fold protein templates, and use the maximal alignment score TM-score as the OB-fold feature of the protein, based on which, we constructed the support vector machine (SVM) classification model to automatically distinguish these two kinds of proteins, with prediction accuracy of 87%,83% and 83% for HOLO-set, APO-set and Mixed-set respectively. CONCLUSIONS: We found that they have different ranges of tunnel lengths and tunnel curvatures; moreover, the alignment results with OB-fold templates have also found to be the discriminative feature of SSBs and DSBs. Experimental results on 10-fold cross validation indicate that the new feature set are effective to describe DNA binding proteins. The evaluation results on both bound (DNA-bound) and non-bound (DNA-free) proteins have shown the satisfactory performance of our method. |
format | Online Article Text |
id | pubmed-4243121 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42431212014-11-26 Identification of single-stranded and double-stranded dna binding proteins based on protein structure Wang, Wei Liu, Juan Zhou, Xionghui BMC Bioinformatics Research BACKGROUND: Protein-DNA interactions are essential for many biological processes. However, the structural mechanisms underlying these interactions are not fully understood. DNA binding proteins can be classified into double-stranded DNA binding proteins (DSBs) and single-stranded DNA binding proteins (SSBs), and they take part in different biological functions. DSBs usually act as transcriptional factors to regulate the genes' expressions, while SSBs usually play roles in DNA replication, recombination, and repair, etc. Understanding the binding specificity of a DNA binding protein is helpful for the research of protein functions. RESULTS: In this paper, we investigated the differences between DSBs and SSBs on surface tunnels as well as the OB-fold domain information. We detected the largest clefts on the protein surfaces, to obtain several features to be used for distinguishing the potential interfaces between SSBs and DSBs, and compared its structure with each of the six OB-fold protein templates, and use the maximal alignment score TM-score as the OB-fold feature of the protein, based on which, we constructed the support vector machine (SVM) classification model to automatically distinguish these two kinds of proteins, with prediction accuracy of 87%,83% and 83% for HOLO-set, APO-set and Mixed-set respectively. CONCLUSIONS: We found that they have different ranges of tunnel lengths and tunnel curvatures; moreover, the alignment results with OB-fold templates have also found to be the discriminative feature of SSBs and DSBs. Experimental results on 10-fold cross validation indicate that the new feature set are effective to describe DNA binding proteins. The evaluation results on both bound (DNA-bound) and non-bound (DNA-free) proteins have shown the satisfactory performance of our method. BioMed Central 2014-11-06 /pmc/articles/PMC4243121/ /pubmed/25474071 http://dx.doi.org/10.1186/1471-2105-15-S12-S4 Text en Copyright © 2014 Wang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Wang, Wei Liu, Juan Zhou, Xionghui Identification of single-stranded and double-stranded dna binding proteins based on protein structure |
title | Identification of single-stranded and double-stranded dna binding proteins based on protein structure |
title_full | Identification of single-stranded and double-stranded dna binding proteins based on protein structure |
title_fullStr | Identification of single-stranded and double-stranded dna binding proteins based on protein structure |
title_full_unstemmed | Identification of single-stranded and double-stranded dna binding proteins based on protein structure |
title_short | Identification of single-stranded and double-stranded dna binding proteins based on protein structure |
title_sort | identification of single-stranded and double-stranded dna binding proteins based on protein structure |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243121/ https://www.ncbi.nlm.nih.gov/pubmed/25474071 http://dx.doi.org/10.1186/1471-2105-15-S12-S4 |
work_keys_str_mv | AT wangwei identificationofsinglestrandedanddoublestrandeddnabindingproteinsbasedonproteinstructure AT liujuan identificationofsinglestrandedanddoublestrandeddnabindingproteinsbasedonproteinstructure AT zhouxionghui identificationofsinglestrandedanddoublestrandeddnabindingproteinsbasedonproteinstructure |