Cargando…
Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties
BACKGROUND: DNA-binding proteins perform their functions through specific or non-specific sequence recognition. Although many sequence- or structure-based approaches have been proposed to identify DNA-binding residues on proteins or protein-binding sites on DNA sequences with satisfied performance,...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289072/ https://www.ncbi.nlm.nih.gov/pubmed/22166082 http://dx.doi.org/10.1186/1477-5956-9-S1-S11 |
_version_ | 1782224844784402432 |
---|---|
author | Wang, Chien-Chih Chen, Chien-Yu |
author_facet | Wang, Chien-Chih Chen, Chien-Yu |
author_sort | Wang, Chien-Chih |
collection | PubMed |
description | BACKGROUND: DNA-binding proteins perform their functions through specific or non-specific sequence recognition. Although many sequence- or structure-based approaches have been proposed to identify DNA-binding residues on proteins or protein-binding sites on DNA sequences with satisfied performance, it remains a challenging task to unveil the exact mechanism of protein-DNA interactions without crystal complex structures. Without information from complexes, the linkages between DNA-binding proteins and their binding sites on DNA are still missing. METHODS: While it is still difficult to acquire co-crystallized structures in an efficient way, this study proposes a knowledge-based learning method to effectively predict DNA orientation and base locations around the protein’s DNA-binding sites when given a protein structure. First, the functionally important residues of a query protein are predicted by a sequential pattern mining tool. After that, surface residues falling in the predicted functional regions are determined based on the given structure. These residues are then clustered based on their spatial coordinates and the resultant clusters are ranked by a proposed DNA-binding propensity function. Clusters with high DNA-binding propensities are treated as DNA-binding units (DBUs) and each DBU is analyzed by principal component analysis (PCA) to predict potential orientation of DNA grooves. More specifically, the proposed method is developed to predict the direction of the tangent line to the helix curve of the DNA groove where a DBU is going to bind. RESULTS: This paper proposes a knowledge-based learning procedure to determine the spatial location of the DNA groove with respect to the query protein structure by considering geometric propensity between protein side chains and DNA bases. The 11 test cases used in this study reveal that the location and orientation of the DNA groove around a selected DBU can be predicted with satisfied errors. CONCLUSIONS: This study presents a method to predict the location and orientation of DNA grooves with respect to the structure of a DNA-binding protein. The test cases shown in this study reveal the possibility of imaging protein-DNA binding conformation before co-crystallized structure can be determined. How the proposed method can be incorporated with existing protein-DNA docking tools to study protein-DNA interactions deserve further studies in the near future. |
format | Online Article Text |
id | pubmed-3289072 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32890722012-02-29 Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties Wang, Chien-Chih Chen, Chien-Yu Proteome Sci Proceedings BACKGROUND: DNA-binding proteins perform their functions through specific or non-specific sequence recognition. Although many sequence- or structure-based approaches have been proposed to identify DNA-binding residues on proteins or protein-binding sites on DNA sequences with satisfied performance, it remains a challenging task to unveil the exact mechanism of protein-DNA interactions without crystal complex structures. Without information from complexes, the linkages between DNA-binding proteins and their binding sites on DNA are still missing. METHODS: While it is still difficult to acquire co-crystallized structures in an efficient way, this study proposes a knowledge-based learning method to effectively predict DNA orientation and base locations around the protein’s DNA-binding sites when given a protein structure. First, the functionally important residues of a query protein are predicted by a sequential pattern mining tool. After that, surface residues falling in the predicted functional regions are determined based on the given structure. These residues are then clustered based on their spatial coordinates and the resultant clusters are ranked by a proposed DNA-binding propensity function. Clusters with high DNA-binding propensities are treated as DNA-binding units (DBUs) and each DBU is analyzed by principal component analysis (PCA) to predict potential orientation of DNA grooves. More specifically, the proposed method is developed to predict the direction of the tangent line to the helix curve of the DNA groove where a DBU is going to bind. RESULTS: This paper proposes a knowledge-based learning procedure to determine the spatial location of the DNA groove with respect to the query protein structure by considering geometric propensity between protein side chains and DNA bases. The 11 test cases used in this study reveal that the location and orientation of the DNA groove around a selected DBU can be predicted with satisfied errors. CONCLUSIONS: This study presents a method to predict the location and orientation of DNA grooves with respect to the structure of a DNA-binding protein. The test cases shown in this study reveal the possibility of imaging protein-DNA binding conformation before co-crystallized structure can be determined. How the proposed method can be incorporated with existing protein-DNA docking tools to study protein-DNA interactions deserve further studies in the near future. BioMed Central 2011-10-14 /pmc/articles/PMC3289072/ /pubmed/22166082 http://dx.doi.org/10.1186/1477-5956-9-S1-S11 Text en Copyright ©2011 Wang and Chen; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Wang, Chien-Chih Chen, Chien-Yu Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties |
title | Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties |
title_full | Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties |
title_fullStr | Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties |
title_full_unstemmed | Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties |
title_short | Predicting DNA-binding locations and orientation on proteins using knowledge-based learning of geometric properties |
title_sort | predicting dna-binding locations and orientation on proteins using knowledge-based learning of geometric properties |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289072/ https://www.ncbi.nlm.nih.gov/pubmed/22166082 http://dx.doi.org/10.1186/1477-5956-9-S1-S11 |
work_keys_str_mv | AT wangchienchih predictingdnabindinglocationsandorientationonproteinsusingknowledgebasedlearningofgeometricproperties AT chenchienyu predictingdnabindinglocationsandorientationonproteinsusingknowledgebasedlearningofgeometricproperties |