Cargando…

Predicting small ligand binding sites in proteins using backbone structure

Motivation: Specific non-covalent binding of metal ions and ligands, such as nucleotides and cofactors, is essential for the function of many proteins. Computational methods are useful for predicting the location of such binding sites when experimental information is lacking. Methods that use struct...

Descripción completa

Detalles Bibliográficos
Autor principal: Bordner, Andrew J.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2639300/
https://www.ncbi.nlm.nih.gov/pubmed/18940825
http://dx.doi.org/10.1093/bioinformatics/btn543
_version_ 1782164451133227008
author Bordner, Andrew J.
author_facet Bordner, Andrew J.
author_sort Bordner, Andrew J.
collection PubMed
description Motivation: Specific non-covalent binding of metal ions and ligands, such as nucleotides and cofactors, is essential for the function of many proteins. Computational methods are useful for predicting the location of such binding sites when experimental information is lacking. Methods that use structural information, when available, are particularly promising since they can potentially identify non-contiguous binding motifs that cannot be found using only the amino acid sequence. Furthermore, a prediction method that can utilize low-resolution models is advantageous because high-resolution structures are available for only a relatively small fraction of proteins. Results: SitePredict is a machine learning-based method for predicting binding sites in protein structures for specific metal ions or small molecules. The method uses Random Forest classifiers trained on diverse residue-based site properties including spatial clustering of residue types and evolutionary conservation. SitePredict was tested by cross-validation on a set of known binding sites for six different metal ions and five different small molecules in a non-redundant set of protein–ligand complex structures. The prediction performance was good for all ligands considered, as reflected by AUC values of at least 0.8. Furthermore, a more realistic test on unbound structures showed only a slight decrease in the accuracy. The properties that contribute the most to the prediction accuracy of each ligand were also examined. Finally, examples of predicted binding sites in homology models and uncharacterized proteins are discussed. Availability: Binding site prediction results for all PDB protein structures and human protein homology models are available at http://sitepredict.org/. Contact: bordner.andrew@mayo.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2639300
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-26393002009-02-25 Predicting small ligand binding sites in proteins using backbone structure Bordner, Andrew J. Bioinformatics Original Papers Motivation: Specific non-covalent binding of metal ions and ligands, such as nucleotides and cofactors, is essential for the function of many proteins. Computational methods are useful for predicting the location of such binding sites when experimental information is lacking. Methods that use structural information, when available, are particularly promising since they can potentially identify non-contiguous binding motifs that cannot be found using only the amino acid sequence. Furthermore, a prediction method that can utilize low-resolution models is advantageous because high-resolution structures are available for only a relatively small fraction of proteins. Results: SitePredict is a machine learning-based method for predicting binding sites in protein structures for specific metal ions or small molecules. The method uses Random Forest classifiers trained on diverse residue-based site properties including spatial clustering of residue types and evolutionary conservation. SitePredict was tested by cross-validation on a set of known binding sites for six different metal ions and five different small molecules in a non-redundant set of protein–ligand complex structures. The prediction performance was good for all ligands considered, as reflected by AUC values of at least 0.8. Furthermore, a more realistic test on unbound structures showed only a slight decrease in the accuracy. The properties that contribute the most to the prediction accuracy of each ligand were also examined. Finally, examples of predicted binding sites in homology models and uncharacterized proteins are discussed. Availability: Binding site prediction results for all PDB protein structures and human protein homology models are available at http://sitepredict.org/. Contact: bordner.andrew@mayo.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2008-12-15 2008-10-21 /pmc/articles/PMC2639300/ /pubmed/18940825 http://dx.doi.org/10.1093/bioinformatics/btn543 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Bordner, Andrew J.
Predicting small ligand binding sites in proteins using backbone structure
title Predicting small ligand binding sites in proteins using backbone structure
title_full Predicting small ligand binding sites in proteins using backbone structure
title_fullStr Predicting small ligand binding sites in proteins using backbone structure
title_full_unstemmed Predicting small ligand binding sites in proteins using backbone structure
title_short Predicting small ligand binding sites in proteins using backbone structure
title_sort predicting small ligand binding sites in proteins using backbone structure
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2639300/
https://www.ncbi.nlm.nih.gov/pubmed/18940825
http://dx.doi.org/10.1093/bioinformatics/btn543
work_keys_str_mv AT bordnerandrewj predictingsmallligandbindingsitesinproteinsusingbackbonestructure