Cargando…

A reinforcement learning approach for protein–ligand binding pose prediction

Protein ligand docking is an indispensable tool for computational prediction of protein functions and screening drug candidates. Despite significant progress over the past two decades, it is still a challenging problem, characterized by the still limited understanding of the energetics between prote...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Chenran, Chen, Yang, Zhang, Yuan, Li, Keqiao, Lin, Menghan, Pan, Feng, Wu, Wei, Zhang, Jinfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9454149/
https://www.ncbi.nlm.nih.gov/pubmed/36076158
http://dx.doi.org/10.1186/s12859-022-04912-7
_version_ 1784785289405792256
author Wang, Chenran
Chen, Yang
Zhang, Yuan
Li, Keqiao
Lin, Menghan
Pan, Feng
Wu, Wei
Zhang, Jinfeng
author_facet Wang, Chenran
Chen, Yang
Zhang, Yuan
Li, Keqiao
Lin, Menghan
Pan, Feng
Wu, Wei
Zhang, Jinfeng
author_sort Wang, Chenran
collection PubMed
description Protein ligand docking is an indispensable tool for computational prediction of protein functions and screening drug candidates. Despite significant progress over the past two decades, it is still a challenging problem, characterized by the still limited understanding of the energetics between proteins and ligands, and the vast conformational space that has to be searched to find a satisfactory solution. In this project, we developed a novel reinforcement learning (RL) approach, the asynchronous advantage actor-critic model (A3C), to address the protein ligand docking problem. The overall framework consists of two models. During the search process, the agent takes an action selected by the actor model based on the current location. The critic model then evaluates this action and predict the distance between the current location and true binding site. Experimental results showed that in both single- and multi-atom cases, our model improves binding site prediction substantially compared to a naïve model. For the single-atom ligand, copper ion (Cu(2+)), the model predicted binding sites have a median root-mean-square-deviation (RMSD) of 2.39 Å to the true binding sites when starting from random starting locations. For the multi-atom ligand, sulfate ion (SO(4)(2−)), the predicted binding sites have a median RMSD of 3.82 Å to the true binding sites. The ligand-specific models built in this study can be used in solvent mapping studies and the RL framework can be readily scaled up to larger and more diverse sets of ligands.
format Online
Article
Text
id pubmed-9454149
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94541492022-09-09 A reinforcement learning approach for protein–ligand binding pose prediction Wang, Chenran Chen, Yang Zhang, Yuan Li, Keqiao Lin, Menghan Pan, Feng Wu, Wei Zhang, Jinfeng BMC Bioinformatics Research Protein ligand docking is an indispensable tool for computational prediction of protein functions and screening drug candidates. Despite significant progress over the past two decades, it is still a challenging problem, characterized by the still limited understanding of the energetics between proteins and ligands, and the vast conformational space that has to be searched to find a satisfactory solution. In this project, we developed a novel reinforcement learning (RL) approach, the asynchronous advantage actor-critic model (A3C), to address the protein ligand docking problem. The overall framework consists of two models. During the search process, the agent takes an action selected by the actor model based on the current location. The critic model then evaluates this action and predict the distance between the current location and true binding site. Experimental results showed that in both single- and multi-atom cases, our model improves binding site prediction substantially compared to a naïve model. For the single-atom ligand, copper ion (Cu(2+)), the model predicted binding sites have a median root-mean-square-deviation (RMSD) of 2.39 Å to the true binding sites when starting from random starting locations. For the multi-atom ligand, sulfate ion (SO(4)(2−)), the predicted binding sites have a median RMSD of 3.82 Å to the true binding sites. The ligand-specific models built in this study can be used in solvent mapping studies and the RL framework can be readily scaled up to larger and more diverse sets of ligands. BioMed Central 2022-09-08 /pmc/articles/PMC9454149/ /pubmed/36076158 http://dx.doi.org/10.1186/s12859-022-04912-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Wang, Chenran
Chen, Yang
Zhang, Yuan
Li, Keqiao
Lin, Menghan
Pan, Feng
Wu, Wei
Zhang, Jinfeng
A reinforcement learning approach for protein–ligand binding pose prediction
title A reinforcement learning approach for protein–ligand binding pose prediction
title_full A reinforcement learning approach for protein–ligand binding pose prediction
title_fullStr A reinforcement learning approach for protein–ligand binding pose prediction
title_full_unstemmed A reinforcement learning approach for protein–ligand binding pose prediction
title_short A reinforcement learning approach for protein–ligand binding pose prediction
title_sort reinforcement learning approach for protein–ligand binding pose prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9454149/
https://www.ncbi.nlm.nih.gov/pubmed/36076158
http://dx.doi.org/10.1186/s12859-022-04912-7
work_keys_str_mv AT wangchenran areinforcementlearningapproachforproteinligandbindingposeprediction
AT chenyang areinforcementlearningapproachforproteinligandbindingposeprediction
AT zhangyuan areinforcementlearningapproachforproteinligandbindingposeprediction
AT likeqiao areinforcementlearningapproachforproteinligandbindingposeprediction
AT linmenghan areinforcementlearningapproachforproteinligandbindingposeprediction
AT panfeng areinforcementlearningapproachforproteinligandbindingposeprediction
AT wuwei areinforcementlearningapproachforproteinligandbindingposeprediction
AT zhangjinfeng areinforcementlearningapproachforproteinligandbindingposeprediction
AT wangchenran reinforcementlearningapproachforproteinligandbindingposeprediction
AT chenyang reinforcementlearningapproachforproteinligandbindingposeprediction
AT zhangyuan reinforcementlearningapproachforproteinligandbindingposeprediction
AT likeqiao reinforcementlearningapproachforproteinligandbindingposeprediction
AT linmenghan reinforcementlearningapproachforproteinligandbindingposeprediction
AT panfeng reinforcementlearningapproachforproteinligandbindingposeprediction
AT wuwei reinforcementlearningapproachforproteinligandbindingposeprediction
AT zhangjinfeng reinforcementlearningapproachforproteinligandbindingposeprediction