Cargando…

Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures

Identification of small molecule ligands that bind to proteins is a critical step in drug discovery. Computational methods have been developed to accelerate the prediction of protein-ligand binding, but often depend on 3D protein structures. As only a limited number of protein 3D structures have bee...

Descripción completa

Detalles Bibliográficos
Autores principales: Greenside, Peyton, Hillenmeyer, Maureen, Kundaje, Anshul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6211855/
https://www.ncbi.nlm.nih.gov/pubmed/29218866
_version_ 1783367418162184192
author Greenside, Peyton
Hillenmeyer, Maureen
Kundaje, Anshul
author_facet Greenside, Peyton
Hillenmeyer, Maureen
Kundaje, Anshul
author_sort Greenside, Peyton
collection PubMed
description Identification of small molecule ligands that bind to proteins is a critical step in drug discovery. Computational methods have been developed to accelerate the prediction of protein-ligand binding, but often depend on 3D protein structures. As only a limited number of protein 3D structures have been resolved, the ability to predict protein-ligand interactions without relying on a 3D representation would be highly valuable. We use an interpretable confidence-rated boosting algorithm to predict protein-ligand interactions with high accuracy from ligand chemical substructures and protein 1D sequence motifs, without relying on 3D protein structures. We compare several protein motif definitions, assess generalization of our model’s predictions to unseen proteins and ligands, demonstrate recovery of well established interactions and identify globally predictive protein-ligand motif pairs. By bridging biological and chemical perspectives, we demonstrate that it is possible to predict protein-ligand interactions using only motif-based features and that interpretation of these features can reveal new insights into the molecular mechanics underlying each interaction. Our work also lays a foundation to explore more predictive feature sets and sophisticated machine learning approaches as well as other applications, such as predicting unintended interactions or the effects of mutations.
format Online
Article
Text
id pubmed-6211855
institution National Center for Biotechnology Information
language English
publishDate 2018
record_format MEDLINE/PubMed
spelling pubmed-62118552018-11-01 Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures Greenside, Peyton Hillenmeyer, Maureen Kundaje, Anshul Pac Symp Biocomput Article Identification of small molecule ligands that bind to proteins is a critical step in drug discovery. Computational methods have been developed to accelerate the prediction of protein-ligand binding, but often depend on 3D protein structures. As only a limited number of protein 3D structures have been resolved, the ability to predict protein-ligand interactions without relying on a 3D representation would be highly valuable. We use an interpretable confidence-rated boosting algorithm to predict protein-ligand interactions with high accuracy from ligand chemical substructures and protein 1D sequence motifs, without relying on 3D protein structures. We compare several protein motif definitions, assess generalization of our model’s predictions to unseen proteins and ligands, demonstrate recovery of well established interactions and identify globally predictive protein-ligand motif pairs. By bridging biological and chemical perspectives, we demonstrate that it is possible to predict protein-ligand interactions using only motif-based features and that interpretation of these features can reveal new insights into the molecular mechanics underlying each interaction. Our work also lays a foundation to explore more predictive feature sets and sophisticated machine learning approaches as well as other applications, such as predicting unintended interactions or the effects of mutations. 2018 /pmc/articles/PMC6211855/ /pubmed/29218866 Text en http://creativecommons.org/licenses/by-nc/4.0/ Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License.
spellingShingle Article
Greenside, Peyton
Hillenmeyer, Maureen
Kundaje, Anshul
Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures
title Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures
title_full Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures
title_fullStr Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures
title_full_unstemmed Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures
title_short Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures
title_sort prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6211855/
https://www.ncbi.nlm.nih.gov/pubmed/29218866
work_keys_str_mv AT greensidepeyton predictionofproteinligandinteractionsfrompairedproteinsequencemotifsandligandsubstructures
AT hillenmeyermaureen predictionofproteinligandinteractionsfrompairedproteinsequencemotifsandligandsubstructures
AT kundajeanshul predictionofproteinligandinteractionsfrompairedproteinsequencemotifsandligandsubstructures