Cargando…

Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties

BACKGROUND: DNA-binding proteins perform their functions through specific or non-specific sequence recognition. Although many sequence- or structure-based approaches have been proposed to identify DNA-binding residues on proteins or protein-binding sites on DNA sequences with satisfied performance,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Chien-Chih, Chen, Chien-Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289072/
https://www.ncbi.nlm.nih.gov/pubmed/22166082
http://dx.doi.org/10.1186/1477-5956-9-S1-S11
_version_ 1782224844784402432
author Wang, Chien-Chih
Chen, Chien-Yu
author_facet Wang, Chien-Chih
Chen, Chien-Yu
author_sort Wang, Chien-Chih
collection PubMed
description BACKGROUND: DNA-binding proteins perform their functions through specific or non-specific sequence recognition. Although many sequence- or structure-based approaches have been proposed to identify DNA-binding residues on proteins or protein-binding sites on DNA sequences with satisfied performance, it remains a challenging task to unveil the exact mechanism of protein-DNA interactions without crystal complex structures. Without information from complexes, the linkages between DNA-binding proteins and their binding sites on DNA are still missing. METHODS: While it is still difficult to acquire co-crystallized structures in an efficient way, this study proposes a knowledge-based learning method to effectively predict DNA orientation and base locations around the protein’s DNA-binding sites when given a protein structure. First, the functionally important residues of a query protein are predicted by a sequential pattern mining tool. After that, surface residues falling in the predicted functional regions are determined based on the given structure. These residues are then clustered based on their spatial coordinates and the resultant clusters are ranked by a proposed DNA-binding propensity function. Clusters with high DNA-binding propensities are treated as DNA-binding units (DBUs) and each DBU is analyzed by principal component analysis (PCA) to predict potential orientation of DNA grooves. More specifically, the proposed method is developed to predict the direction of the tangent line to the helix curve of the DNA groove where a DBU is going to bind. RESULTS: This paper proposes a knowledge-based learning procedure to determine the spatial location of the DNA groove with respect to the query protein structure by considering geometric propensity between protein side chains and DNA bases. The 11 test cases used in this study reveal that the location and orientation of the DNA groove around a selected DBU can be predicted with satisfied errors. CONCLUSIONS: This study presents a method to predict the location and orientation of DNA grooves with respect to the structure of a DNA-binding protein. The test cases shown in this study reveal the possibility of imaging protein-DNA binding conformation before co-crystallized structure can be determined. How the proposed method can be incorporated with existing protein-DNA docking tools to study protein-DNA interactions deserve further studies in the near future.
format Online
Article
Text
id pubmed-3289072
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32890722012-02-29 Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties Wang, Chien-Chih Chen, Chien-Yu Proteome Sci Proceedings BACKGROUND: DNA-binding proteins perform their functions through specific or non-specific sequence recognition. Although many sequence- or structure-based approaches have been proposed to identify DNA-binding residues on proteins or protein-binding sites on DNA sequences with satisfied performance, it remains a challenging task to unveil the exact mechanism of protein-DNA interactions without crystal complex structures. Without information from complexes, the linkages between DNA-binding proteins and their binding sites on DNA are still missing. METHODS: While it is still difficult to acquire co-crystallized structures in an efficient way, this study proposes a knowledge-based learning method to effectively predict DNA orientation and base locations around the protein’s DNA-binding sites when given a protein structure. First, the functionally important residues of a query protein are predicted by a sequential pattern mining tool. After that, surface residues falling in the predicted functional regions are determined based on the given structure. These residues are then clustered based on their spatial coordinates and the resultant clusters are ranked by a proposed DNA-binding propensity function. Clusters with high DNA-binding propensities are treated as DNA-binding units (DBUs) and each DBU is analyzed by principal component analysis (PCA) to predict potential orientation of DNA grooves. More specifically, the proposed method is developed to predict the direction of the tangent line to the helix curve of the DNA groove where a DBU is going to bind. RESULTS: This paper proposes a knowledge-based learning procedure to determine the spatial location of the DNA groove with respect to the query protein structure by considering geometric propensity between protein side chains and DNA bases. The 11 test cases used in this study reveal that the location and orientation of the DNA groove around a selected DBU can be predicted with satisfied errors. CONCLUSIONS: This study presents a method to predict the location and orientation of DNA grooves with respect to the structure of a DNA-binding protein. The test cases shown in this study reveal the possibility of imaging protein-DNA binding conformation before co-crystallized structure can be determined. How the proposed method can be incorporated with existing protein-DNA docking tools to study protein-DNA interactions deserve further studies in the near future. BioMed Central 2011-10-14 /pmc/articles/PMC3289072/ /pubmed/22166082 http://dx.doi.org/10.1186/1477-5956-9-S1-S11 Text en Copyright ©2011 Wang and Chen; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Wang, Chien-Chih
Chen, Chien-Yu
Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties
title Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties
title_full Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties
title_fullStr Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties
title_full_unstemmed Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties
title_short Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties
title_sort predicting dna-binding locations and orientation on proteins using knowledge-based learning of geometric properties
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289072/
https://www.ncbi.nlm.nih.gov/pubmed/22166082
http://dx.doi.org/10.1186/1477-5956-9-S1-S11
work_keys_str_mv AT wangchienchih predictingdnabindinglocationsandorientationonproteinsusingknowledgebasedlearningofgeometricproperties
AT chenchienyu predictingdnabindinglocationsandorientationonproteinsusingknowledgebasedlearningofgeometricproperties