Cargando…

A generalized protein–ligand scoring framework with balanced scoring, docking, ranking and screening powers

Applying machine learning algorithms to protein–ligand scoring functions has aroused widespread attention in recent years due to the high predictive accuracy and affordable computational cost. Nevertheless, most machine learning-based scoring functions are only applicable to a specific task, e.g., b...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Chao, Zhang, Xujun, Hsieh, Chang-Yu, Deng, Yafeng, Wang, Dong, Xu, Lei, Wu, Jian, Li, Dan, Kang, Yu, Hou, Tingjun, Pan, Peichen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10395315/
https://www.ncbi.nlm.nih.gov/pubmed/37538816
http://dx.doi.org/10.1039/d3sc02044d
_version_ 1785083556559585280
author Shen, Chao
Zhang, Xujun
Hsieh, Chang-Yu
Deng, Yafeng
Wang, Dong
Xu, Lei
Wu, Jian
Li, Dan
Kang, Yu
Hou, Tingjun
Pan, Peichen
author_facet Shen, Chao
Zhang, Xujun
Hsieh, Chang-Yu
Deng, Yafeng
Wang, Dong
Xu, Lei
Wu, Jian
Li, Dan
Kang, Yu
Hou, Tingjun
Pan, Peichen
author_sort Shen, Chao
collection PubMed
description Applying machine learning algorithms to protein–ligand scoring functions has aroused widespread attention in recent years due to the high predictive accuracy and affordable computational cost. Nevertheless, most machine learning-based scoring functions are only applicable to a specific task, e.g., binding affinity prediction, binding pose prediction or virtual screening, suggesting that the development of a scoring function with balanced performance in all critical tasks remains a grand challenge. To this end, we propose a novel parameterization strategy by introducing an adjustable binding affinity term that represents the correlation between the predicted outcomes and experimental data into the training of mixture density network. The resulting residue-atom distance likelihood potential not only retains the superior docking and screening power over all the other state-of-the-art approaches, but also achieves a remarkable improvement in scoring and ranking performance. We emphatically explore the impacts of several key elements on prediction accuracy as well as the task preference, and demonstrate that the performance of scoring/ranking and docking/screening tasks of a certain model could be well balanced through an appropriate manner. Overall, our study highlights the potential utility of our innovative parameterization strategy as well as the resulting scoring framework in future structure-based drug design.
format Online
Article
Text
id pubmed-10395315
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-103953152023-08-03 A generalized protein–ligand scoring framework with balanced scoring, docking, ranking and screening powers Shen, Chao Zhang, Xujun Hsieh, Chang-Yu Deng, Yafeng Wang, Dong Xu, Lei Wu, Jian Li, Dan Kang, Yu Hou, Tingjun Pan, Peichen Chem Sci Chemistry Applying machine learning algorithms to protein–ligand scoring functions has aroused widespread attention in recent years due to the high predictive accuracy and affordable computational cost. Nevertheless, most machine learning-based scoring functions are only applicable to a specific task, e.g., binding affinity prediction, binding pose prediction or virtual screening, suggesting that the development of a scoring function with balanced performance in all critical tasks remains a grand challenge. To this end, we propose a novel parameterization strategy by introducing an adjustable binding affinity term that represents the correlation between the predicted outcomes and experimental data into the training of mixture density network. The resulting residue-atom distance likelihood potential not only retains the superior docking and screening power over all the other state-of-the-art approaches, but also achieves a remarkable improvement in scoring and ranking performance. We emphatically explore the impacts of several key elements on prediction accuracy as well as the task preference, and demonstrate that the performance of scoring/ranking and docking/screening tasks of a certain model could be well balanced through an appropriate manner. Overall, our study highlights the potential utility of our innovative parameterization strategy as well as the resulting scoring framework in future structure-based drug design. The Royal Society of Chemistry 2023-07-04 /pmc/articles/PMC10395315/ /pubmed/37538816 http://dx.doi.org/10.1039/d3sc02044d Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Shen, Chao
Zhang, Xujun
Hsieh, Chang-Yu
Deng, Yafeng
Wang, Dong
Xu, Lei
Wu, Jian
Li, Dan
Kang, Yu
Hou, Tingjun
Pan, Peichen
A generalized protein–ligand scoring framework with balanced scoring, docking, ranking and screening powers
title A generalized protein–ligand scoring framework with balanced scoring, docking, ranking and screening powers
title_full A generalized protein–ligand scoring framework with balanced scoring, docking, ranking and screening powers
title_fullStr A generalized protein–ligand scoring framework with balanced scoring, docking, ranking and screening powers
title_full_unstemmed A generalized protein–ligand scoring framework with balanced scoring, docking, ranking and screening powers
title_short A generalized protein–ligand scoring framework with balanced scoring, docking, ranking and screening powers
title_sort generalized protein–ligand scoring framework with balanced scoring, docking, ranking and screening powers
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10395315/
https://www.ncbi.nlm.nih.gov/pubmed/37538816
http://dx.doi.org/10.1039/d3sc02044d
work_keys_str_mv AT shenchao ageneralizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT zhangxujun ageneralizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT hsiehchangyu ageneralizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT dengyafeng ageneralizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT wangdong ageneralizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT xulei ageneralizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT wujian ageneralizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT lidan ageneralizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT kangyu ageneralizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT houtingjun ageneralizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT panpeichen ageneralizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT shenchao generalizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT zhangxujun generalizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT hsiehchangyu generalizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT dengyafeng generalizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT wangdong generalizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT xulei generalizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT wujian generalizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT lidan generalizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT kangyu generalizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT houtingjun generalizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers
AT panpeichen generalizedproteinligandscoringframeworkwithbalancedscoringdockingrankingandscreeningpowers