Cargando…

Identification of single-stranded and double-stranded dna binding proteins based on protein structure

BACKGROUND: Protein-DNA interactions are essential for many biological processes. However, the structural mechanisms underlying these interactions are not fully understood. DNA binding proteins can be classified into double-stranded DNA binding proteins (DSBs) and single-stranded DNA binding protein...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Wei, Liu, Juan, Zhou, Xionghui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243121/
https://www.ncbi.nlm.nih.gov/pubmed/25474071
http://dx.doi.org/10.1186/1471-2105-15-S12-S4
_version_ 1782346063924953088
author Wang, Wei
Liu, Juan
Zhou, Xionghui
author_facet Wang, Wei
Liu, Juan
Zhou, Xionghui
author_sort Wang, Wei
collection PubMed
description BACKGROUND: Protein-DNA interactions are essential for many biological processes. However, the structural mechanisms underlying these interactions are not fully understood. DNA binding proteins can be classified into double-stranded DNA binding proteins (DSBs) and single-stranded DNA binding proteins (SSBs), and they take part in different biological functions. DSBs usually act as transcriptional factors to regulate the genes' expressions, while SSBs usually play roles in DNA replication, recombination, and repair, etc. Understanding the binding specificity of a DNA binding protein is helpful for the research of protein functions. RESULTS: In this paper, we investigated the differences between DSBs and SSBs on surface tunnels as well as the OB-fold domain information. We detected the largest clefts on the protein surfaces, to obtain several features to be used for distinguishing the potential interfaces between SSBs and DSBs, and compared its structure with each of the six OB-fold protein templates, and use the maximal alignment score TM-score as the OB-fold feature of the protein, based on which, we constructed the support vector machine (SVM) classification model to automatically distinguish these two kinds of proteins, with prediction accuracy of 87%,83% and 83% for HOLO-set, APO-set and Mixed-set respectively. CONCLUSIONS: We found that they have different ranges of tunnel lengths and tunnel curvatures; moreover, the alignment results with OB-fold templates have also found to be the discriminative feature of SSBs and DSBs. Experimental results on 10-fold cross validation indicate that the new feature set are effective to describe DNA binding proteins. The evaluation results on both bound (DNA-bound) and non-bound (DNA-free) proteins have shown the satisfactory performance of our method.
format Online
Article
Text
id pubmed-4243121
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42431212014-11-26 Identification of single-stranded and double-stranded dna binding proteins based on protein structure Wang, Wei Liu, Juan Zhou, Xionghui BMC Bioinformatics Research BACKGROUND: Protein-DNA interactions are essential for many biological processes. However, the structural mechanisms underlying these interactions are not fully understood. DNA binding proteins can be classified into double-stranded DNA binding proteins (DSBs) and single-stranded DNA binding proteins (SSBs), and they take part in different biological functions. DSBs usually act as transcriptional factors to regulate the genes' expressions, while SSBs usually play roles in DNA replication, recombination, and repair, etc. Understanding the binding specificity of a DNA binding protein is helpful for the research of protein functions. RESULTS: In this paper, we investigated the differences between DSBs and SSBs on surface tunnels as well as the OB-fold domain information. We detected the largest clefts on the protein surfaces, to obtain several features to be used for distinguishing the potential interfaces between SSBs and DSBs, and compared its structure with each of the six OB-fold protein templates, and use the maximal alignment score TM-score as the OB-fold feature of the protein, based on which, we constructed the support vector machine (SVM) classification model to automatically distinguish these two kinds of proteins, with prediction accuracy of 87%,83% and 83% for HOLO-set, APO-set and Mixed-set respectively. CONCLUSIONS: We found that they have different ranges of tunnel lengths and tunnel curvatures; moreover, the alignment results with OB-fold templates have also found to be the discriminative feature of SSBs and DSBs. Experimental results on 10-fold cross validation indicate that the new feature set are effective to describe DNA binding proteins. The evaluation results on both bound (DNA-bound) and non-bound (DNA-free) proteins have shown the satisfactory performance of our method. BioMed Central 2014-11-06 /pmc/articles/PMC4243121/ /pubmed/25474071 http://dx.doi.org/10.1186/1471-2105-15-S12-S4 Text en Copyright © 2014 Wang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Wang, Wei
Liu, Juan
Zhou, Xionghui
Identification of single-stranded and double-stranded dna binding proteins based on protein structure
title Identification of single-stranded and double-stranded dna binding proteins based on protein structure
title_full Identification of single-stranded and double-stranded dna binding proteins based on protein structure
title_fullStr Identification of single-stranded and double-stranded dna binding proteins based on protein structure
title_full_unstemmed Identification of single-stranded and double-stranded dna binding proteins based on protein structure
title_short Identification of single-stranded and double-stranded dna binding proteins based on protein structure
title_sort identification of single-stranded and double-stranded dna binding proteins based on protein structure
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243121/
https://www.ncbi.nlm.nih.gov/pubmed/25474071
http://dx.doi.org/10.1186/1471-2105-15-S12-S4
work_keys_str_mv AT wangwei identificationofsinglestrandedanddoublestrandeddnabindingproteinsbasedonproteinstructure
AT liujuan identificationofsinglestrandedanddoublestrandeddnabindingproteinsbasedonproteinstructure
AT zhouxionghui identificationofsinglestrandedanddoublestrandeddnabindingproteinsbasedonproteinstructure