Cargando…
Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures
Identification of small molecule ligands that bind to proteins is a critical step in drug discovery. Computational methods have been developed to accelerate the prediction of protein-ligand binding, but often depend on 3D protein structures. As only a limited number of protein 3D structures have bee...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6211855/ https://www.ncbi.nlm.nih.gov/pubmed/29218866 |
_version_ | 1783367418162184192 |
---|---|
author | Greenside, Peyton Hillenmeyer, Maureen Kundaje, Anshul |
author_facet | Greenside, Peyton Hillenmeyer, Maureen Kundaje, Anshul |
author_sort | Greenside, Peyton |
collection | PubMed |
description | Identification of small molecule ligands that bind to proteins is a critical step in drug discovery. Computational methods have been developed to accelerate the prediction of protein-ligand binding, but often depend on 3D protein structures. As only a limited number of protein 3D structures have been resolved, the ability to predict protein-ligand interactions without relying on a 3D representation would be highly valuable. We use an interpretable confidence-rated boosting algorithm to predict protein-ligand interactions with high accuracy from ligand chemical substructures and protein 1D sequence motifs, without relying on 3D protein structures. We compare several protein motif definitions, assess generalization of our model’s predictions to unseen proteins and ligands, demonstrate recovery of well established interactions and identify globally predictive protein-ligand motif pairs. By bridging biological and chemical perspectives, we demonstrate that it is possible to predict protein-ligand interactions using only motif-based features and that interpretation of these features can reveal new insights into the molecular mechanics underlying each interaction. Our work also lays a foundation to explore more predictive feature sets and sophisticated machine learning approaches as well as other applications, such as predicting unintended interactions or the effects of mutations. |
format | Online Article Text |
id | pubmed-6211855 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
record_format | MEDLINE/PubMed |
spelling | pubmed-62118552018-11-01 Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures Greenside, Peyton Hillenmeyer, Maureen Kundaje, Anshul Pac Symp Biocomput Article Identification of small molecule ligands that bind to proteins is a critical step in drug discovery. Computational methods have been developed to accelerate the prediction of protein-ligand binding, but often depend on 3D protein structures. As only a limited number of protein 3D structures have been resolved, the ability to predict protein-ligand interactions without relying on a 3D representation would be highly valuable. We use an interpretable confidence-rated boosting algorithm to predict protein-ligand interactions with high accuracy from ligand chemical substructures and protein 1D sequence motifs, without relying on 3D protein structures. We compare several protein motif definitions, assess generalization of our model’s predictions to unseen proteins and ligands, demonstrate recovery of well established interactions and identify globally predictive protein-ligand motif pairs. By bridging biological and chemical perspectives, we demonstrate that it is possible to predict protein-ligand interactions using only motif-based features and that interpretation of these features can reveal new insights into the molecular mechanics underlying each interaction. Our work also lays a foundation to explore more predictive feature sets and sophisticated machine learning approaches as well as other applications, such as predicting unintended interactions or the effects of mutations. 2018 /pmc/articles/PMC6211855/ /pubmed/29218866 Text en http://creativecommons.org/licenses/by-nc/4.0/ Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License. |
spellingShingle | Article Greenside, Peyton Hillenmeyer, Maureen Kundaje, Anshul Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures |
title | Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures |
title_full | Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures |
title_fullStr | Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures |
title_full_unstemmed | Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures |
title_short | Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures |
title_sort | prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6211855/ https://www.ncbi.nlm.nih.gov/pubmed/29218866 |
work_keys_str_mv | AT greensidepeyton predictionofproteinligandinteractionsfrompairedproteinsequencemotifsandligandsubstructures AT hillenmeyermaureen predictionofproteinligandinteractionsfrompairedproteinsequencemotifsandligandsubstructures AT kundajeanshul predictionofproteinligandinteractionsfrompairedproteinsequencemotifsandligandsubstructures |