Cargando…

A graph kernel method for DNA-binding site prediction

BACKGROUND: Protein-DNA interactions play important roles in many biological processes. Computational methods that can accurately predict DNA-binding sites on proteins will greatly expedite research on problems involving protein-DNA interactions. RESULTS: This paper presents a method for predicting...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Changhui, Wang, Yingfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4290685/
https://www.ncbi.nlm.nih.gov/pubmed/25521807
http://dx.doi.org/10.1186/1752-0509-8-S4-S10
_version_ 1782352286599610368
author Yan, Changhui
Wang, Yingfeng
author_facet Yan, Changhui
Wang, Yingfeng
author_sort Yan, Changhui
collection PubMed
description BACKGROUND: Protein-DNA interactions play important roles in many biological processes. Computational methods that can accurately predict DNA-binding sites on proteins will greatly expedite research on problems involving protein-DNA interactions. RESULTS: This paper presents a method for predicting DNA-binding sites on protein structures. The method represents protein surface patches using labeled graphs and uses a graph kernel method to calculate the similarities between graphs. A new surface patch is predicted to be interface or non-interface patch based on its similarities to known DNA-binding patches and non-DNA-binding patches. The proposed method achieved high accuracy when tested on a representative set of 146 protein-DNA complexes using leave-one-out cross-validation. Then, the method was applied to identify DNA-binding sties on 13 unbound structures of DNA-binding proteins. In each of the unbound structure, the top 1 patch predicted by the proposed method precisely indicated the location of the DNA-binding site. Comparisons with other methods showed that the proposed method was competitive in predicting DNA-binding sites on unbound proteins. CONCLUSIONS: The proposed method uses graphs to encode the feature's distribution in the 3-dimensional (3D) space. Thus, compared with other vector-based methods, it has the advantage of taking into account the spatial distribution of features on the proteins. Using an efficient kernel method to compare graphs the proposed method also avoids the demanding computations required for 3D objects comparison. It provides a competitive method for predicting DNA-binding sites without requiring structure alignment.
format Online
Article
Text
id pubmed-4290685
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42906852015-01-15 A graph kernel method for DNA-binding site prediction Yan, Changhui Wang, Yingfeng BMC Syst Biol Research BACKGROUND: Protein-DNA interactions play important roles in many biological processes. Computational methods that can accurately predict DNA-binding sites on proteins will greatly expedite research on problems involving protein-DNA interactions. RESULTS: This paper presents a method for predicting DNA-binding sites on protein structures. The method represents protein surface patches using labeled graphs and uses a graph kernel method to calculate the similarities between graphs. A new surface patch is predicted to be interface or non-interface patch based on its similarities to known DNA-binding patches and non-DNA-binding patches. The proposed method achieved high accuracy when tested on a representative set of 146 protein-DNA complexes using leave-one-out cross-validation. Then, the method was applied to identify DNA-binding sties on 13 unbound structures of DNA-binding proteins. In each of the unbound structure, the top 1 patch predicted by the proposed method precisely indicated the location of the DNA-binding site. Comparisons with other methods showed that the proposed method was competitive in predicting DNA-binding sites on unbound proteins. CONCLUSIONS: The proposed method uses graphs to encode the feature's distribution in the 3-dimensional (3D) space. Thus, compared with other vector-based methods, it has the advantage of taking into account the spatial distribution of features on the proteins. Using an efficient kernel method to compare graphs the proposed method also avoids the demanding computations required for 3D objects comparison. It provides a competitive method for predicting DNA-binding sites without requiring structure alignment. BioMed Central 2014-12-08 /pmc/articles/PMC4290685/ /pubmed/25521807 http://dx.doi.org/10.1186/1752-0509-8-S4-S10 Text en Copyright © 2014 Yan Wang; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Yan, Changhui
Wang, Yingfeng
A graph kernel method for DNA-binding site prediction
title A graph kernel method for DNA-binding site prediction
title_full A graph kernel method for DNA-binding site prediction
title_fullStr A graph kernel method for DNA-binding site prediction
title_full_unstemmed A graph kernel method for DNA-binding site prediction
title_short A graph kernel method for DNA-binding site prediction
title_sort graph kernel method for dna-binding site prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4290685/
https://www.ncbi.nlm.nih.gov/pubmed/25521807
http://dx.doi.org/10.1186/1752-0509-8-S4-S10
work_keys_str_mv AT yanchanghui agraphkernelmethodfordnabindingsiteprediction
AT wangyingfeng agraphkernelmethodfordnabindingsiteprediction
AT yanchanghui graphkernelmethodfordnabindingsiteprediction
AT wangyingfeng graphkernelmethodfordnabindingsiteprediction