Cargando…

Identification of binding pockets in protein structures using a knowledge-based potential derived from local structural similarities

BACKGROUND: The identification of ligand binding sites is a key task in the annotation of proteins with known structure but uncharacterized function. Here we describe a knowledge-based method exploiting the observation that unrelated binding sites share small structural motifs that bind the same che...

Descripción completa

Detalles Bibliográficos
Autores principales: Bianchi, Valerio, Gherardini, Pier Federico, Helmer-Citterich, Manuela, Ausiello, Gabriele
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3434446/
https://www.ncbi.nlm.nih.gov/pubmed/22536963
http://dx.doi.org/10.1186/1471-2105-13-S4-S17
_version_ 1782242447420555264
author Bianchi, Valerio
Gherardini, Pier Federico
Helmer-Citterich, Manuela
Ausiello, Gabriele
author_facet Bianchi, Valerio
Gherardini, Pier Federico
Helmer-Citterich, Manuela
Ausiello, Gabriele
author_sort Bianchi, Valerio
collection PubMed
description BACKGROUND: The identification of ligand binding sites is a key task in the annotation of proteins with known structure but uncharacterized function. Here we describe a knowledge-based method exploiting the observation that unrelated binding sites share small structural motifs that bind the same chemical fragments irrespective of the nature of the ligand as a whole. RESULTS: PDBinder compares a query protein against a library of binding and non-binding protein surface regions derived from the PDB. The results of the comparison are used to derive a propensity value for each residue which is correlated with the likelihood that the residue is part of a ligand binding site. The method was applied to two different problems: i) the prediction of ligand binding residues and ii) the identification of which surface cleft harbours the binding site. In both cases PDBinder performed consistently better than existing methods. PDBinder has been trained on a non-redundant set of 1356 high-quality protein-ligand complexes and tested on a set of 239 holo and apo complex pairs. We obtained an MCC of 0.313 on the holo set with a PPV of 0.413 while on the apo set we achieved an MCC of 0.271 and a PPV of 0.372. CONCLUSIONS: We show that PDBinder performs better than existing methods. The good performance on the unbound proteins is extremely important for real-world applications where the location of the binding site is unknown. Moreover, since our approach is orthogonal to those used in other programs, the PDBinder propensity value can be integrated in other algorithms further increasing the final performance.
format Online
Article
Text
id pubmed-3434446
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34344462012-09-06 Identification of binding pockets in protein structures using a knowledge-based potential derived from local structural similarities Bianchi, Valerio Gherardini, Pier Federico Helmer-Citterich, Manuela Ausiello, Gabriele BMC Bioinformatics Research BACKGROUND: The identification of ligand binding sites is a key task in the annotation of proteins with known structure but uncharacterized function. Here we describe a knowledge-based method exploiting the observation that unrelated binding sites share small structural motifs that bind the same chemical fragments irrespective of the nature of the ligand as a whole. RESULTS: PDBinder compares a query protein against a library of binding and non-binding protein surface regions derived from the PDB. The results of the comparison are used to derive a propensity value for each residue which is correlated with the likelihood that the residue is part of a ligand binding site. The method was applied to two different problems: i) the prediction of ligand binding residues and ii) the identification of which surface cleft harbours the binding site. In both cases PDBinder performed consistently better than existing methods. PDBinder has been trained on a non-redundant set of 1356 high-quality protein-ligand complexes and tested on a set of 239 holo and apo complex pairs. We obtained an MCC of 0.313 on the holo set with a PPV of 0.413 while on the apo set we achieved an MCC of 0.271 and a PPV of 0.372. CONCLUSIONS: We show that PDBinder performs better than existing methods. The good performance on the unbound proteins is extremely important for real-world applications where the location of the binding site is unknown. Moreover, since our approach is orthogonal to those used in other programs, the PDBinder propensity value can be integrated in other algorithms further increasing the final performance. BioMed Central 2012-03-28 /pmc/articles/PMC3434446/ /pubmed/22536963 http://dx.doi.org/10.1186/1471-2105-13-S4-S17 Text en Copyright ©2012 Bianchi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Bianchi, Valerio
Gherardini, Pier Federico
Helmer-Citterich, Manuela
Ausiello, Gabriele
Identification of binding pockets in protein structures using a knowledge-based potential derived from local structural similarities
title Identification of binding pockets in protein structures using a knowledge-based potential derived from local structural similarities
title_full Identification of binding pockets in protein structures using a knowledge-based potential derived from local structural similarities
title_fullStr Identification of binding pockets in protein structures using a knowledge-based potential derived from local structural similarities
title_full_unstemmed Identification of binding pockets in protein structures using a knowledge-based potential derived from local structural similarities
title_short Identification of binding pockets in protein structures using a knowledge-based potential derived from local structural similarities
title_sort identification of binding pockets in protein structures using a knowledge-based potential derived from local structural similarities
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3434446/
https://www.ncbi.nlm.nih.gov/pubmed/22536963
http://dx.doi.org/10.1186/1471-2105-13-S4-S17
work_keys_str_mv AT bianchivalerio identificationofbindingpocketsinproteinstructuresusingaknowledgebasedpotentialderivedfromlocalstructuralsimilarities
AT gherardinipierfederico identificationofbindingpocketsinproteinstructuresusingaknowledgebasedpotentialderivedfromlocalstructuralsimilarities
AT helmercitterichmanuela identificationofbindingpocketsinproteinstructuresusingaknowledgebasedpotentialderivedfromlocalstructuralsimilarities
AT ausiellogabriele identificationofbindingpocketsinproteinstructuresusingaknowledgebasedpotentialderivedfromlocalstructuralsimilarities