Cargando…

Probabilistic Prediction of Contacts in Protein-Ligand Complexes

We introduce a statistical method for evaluating atomic level 3D interaction patterns of protein-ligand contacts. Such patterns can be used for fast separation of likely ligand and ligand binding site combinations out of all those that are geometrically possible. The practical purpose of this probab...

Descripción completa

Detalles Bibliográficos
Autores principales: Hakulinen, Riku, Puranen, Santeri, Lehtonen, Jukka V., Johnson, Mark S., Corander, Jukka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3498326/
https://www.ncbi.nlm.nih.gov/pubmed/23155467
http://dx.doi.org/10.1371/journal.pone.0049216
_version_ 1782249826089435136
author Hakulinen, Riku
Puranen, Santeri
Lehtonen, Jukka V.
Johnson, Mark S.
Corander, Jukka
author_facet Hakulinen, Riku
Puranen, Santeri
Lehtonen, Jukka V.
Johnson, Mark S.
Corander, Jukka
author_sort Hakulinen, Riku
collection PubMed
description We introduce a statistical method for evaluating atomic level 3D interaction patterns of protein-ligand contacts. Such patterns can be used for fast separation of likely ligand and ligand binding site combinations out of all those that are geometrically possible. The practical purpose of this probabilistic method is for molecular docking and scoring, as an essential part of a scoring function. Probabilities of interaction patterns are calculated conditional on structural x-ray data and predefined chemical classification of molecular fragment types. Spatial coordinates of atoms are modeled using a Bayesian statistical framework with parametric 3D probability densities. The parameters are given distributions a priori, which provides the possibility to update the densities of model parameters with new structural data and use the parameter estimates to create a contact hierarchy. The contact preferences can be defined for any spatial area around a specified type of fragment. We compared calculated contact point hierarchies with the number of contact atoms found near the contact point in a reference set of x-ray data, and found that these were in general in a close agreement. Additionally, using substrate binding site in cathechol-O-methyltransferase and 27 small potential binder molecules, it was demonstrated that these probabilities together with auxiliary parameters separate well ligands from decoys (true positive rate 0.75, false positive rate 0). A particularly useful feature of the proposed Bayesian framework is that it also characterizes predictive uncertainty in terms of probabilities, which have an intuitive interpretation from the applied perspective.
format Online
Article
Text
id pubmed-3498326
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34983262012-11-15 Probabilistic Prediction of Contacts in Protein-Ligand Complexes Hakulinen, Riku Puranen, Santeri Lehtonen, Jukka V. Johnson, Mark S. Corander, Jukka PLoS One Research Article We introduce a statistical method for evaluating atomic level 3D interaction patterns of protein-ligand contacts. Such patterns can be used for fast separation of likely ligand and ligand binding site combinations out of all those that are geometrically possible. The practical purpose of this probabilistic method is for molecular docking and scoring, as an essential part of a scoring function. Probabilities of interaction patterns are calculated conditional on structural x-ray data and predefined chemical classification of molecular fragment types. Spatial coordinates of atoms are modeled using a Bayesian statistical framework with parametric 3D probability densities. The parameters are given distributions a priori, which provides the possibility to update the densities of model parameters with new structural data and use the parameter estimates to create a contact hierarchy. The contact preferences can be defined for any spatial area around a specified type of fragment. We compared calculated contact point hierarchies with the number of contact atoms found near the contact point in a reference set of x-ray data, and found that these were in general in a close agreement. Additionally, using substrate binding site in cathechol-O-methyltransferase and 27 small potential binder molecules, it was demonstrated that these probabilities together with auxiliary parameters separate well ligands from decoys (true positive rate 0.75, false positive rate 0). A particularly useful feature of the proposed Bayesian framework is that it also characterizes predictive uncertainty in terms of probabilities, which have an intuitive interpretation from the applied perspective. Public Library of Science 2012-11-14 /pmc/articles/PMC3498326/ /pubmed/23155467 http://dx.doi.org/10.1371/journal.pone.0049216 Text en © 2012 Hakulinen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Hakulinen, Riku
Puranen, Santeri
Lehtonen, Jukka V.
Johnson, Mark S.
Corander, Jukka
Probabilistic Prediction of Contacts in Protein-Ligand Complexes
title Probabilistic Prediction of Contacts in Protein-Ligand Complexes
title_full Probabilistic Prediction of Contacts in Protein-Ligand Complexes
title_fullStr Probabilistic Prediction of Contacts in Protein-Ligand Complexes
title_full_unstemmed Probabilistic Prediction of Contacts in Protein-Ligand Complexes
title_short Probabilistic Prediction of Contacts in Protein-Ligand Complexes
title_sort probabilistic prediction of contacts in protein-ligand complexes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3498326/
https://www.ncbi.nlm.nih.gov/pubmed/23155467
http://dx.doi.org/10.1371/journal.pone.0049216
work_keys_str_mv AT hakulinenriku probabilisticpredictionofcontactsinproteinligandcomplexes
AT puranensanteri probabilisticpredictionofcontactsinproteinligandcomplexes
AT lehtonenjukkav probabilisticpredictionofcontactsinproteinligandcomplexes
AT johnsonmarks probabilisticpredictionofcontactsinproteinligandcomplexes
AT coranderjukka probabilisticpredictionofcontactsinproteinligandcomplexes