Cargando…

XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein–Ligand Scoring and Ranking

[Image: see text] Prediction of protein–ligand binding affinities is a central issue in structure-based computer-aided drug design. In recent years, much effort has been devoted to the prediction of the binding affinity in protein–ligand complexes using machine learning (ML). Due to the remarkable a...

Descripción completa

Detalles Bibliográficos
Autores principales: Dong, Lina, Qu, Xiaoyang, Wang, Binju
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245135/
https://www.ncbi.nlm.nih.gov/pubmed/35785279
http://dx.doi.org/10.1021/acsomega.2c01723
_version_ 1784738683134410752
author Dong, Lina
Qu, Xiaoyang
Wang, Binju
author_facet Dong, Lina
Qu, Xiaoyang
Wang, Binju
author_sort Dong, Lina
collection PubMed
description [Image: see text] Prediction of protein–ligand binding affinities is a central issue in structure-based computer-aided drug design. In recent years, much effort has been devoted to the prediction of the binding affinity in protein–ligand complexes using machine learning (ML). Due to the remarkable ability of ML methods in nonlinear fitting, ML-based scoring functions (SFs) can deliver much improved performance on a selected test set, such as the comparative assessment of scoring functions (CASF), when compared to the classical SFs. However, the performance of ML-based SFs heavily relies on the overall similarity of the training set and the test set. To improve the performance and transferability of an SF, we have tried to combine various features including energy terms from X-score and AutoDock Vina, the properties of ligands, and the statistical sequence-related information from either the binding site or the full protein. In conjunction with extreme trees (ET), an ML model, we have developed XLPFE, a new SF. Compared with other tested methods such as X-score, AutoDock Vina, ΔvinaXGB, PSH-ML, or CNN-score, XLPFE achieves consistently better scoring and ranking power for various types of protein–ligand complex structures beyond the CASF, suggesting that XLPFE has superior transferability. In particular, XLPFE performs better with metalloenzymes. With its faster speed, improved accuracy, and better transferability, XLPFE could be usefully applied to a diverse range of protein–ligand complexes.
format Online
Article
Text
id pubmed-9245135
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-92451352022-07-01 XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein–Ligand Scoring and Ranking Dong, Lina Qu, Xiaoyang Wang, Binju ACS Omega [Image: see text] Prediction of protein–ligand binding affinities is a central issue in structure-based computer-aided drug design. In recent years, much effort has been devoted to the prediction of the binding affinity in protein–ligand complexes using machine learning (ML). Due to the remarkable ability of ML methods in nonlinear fitting, ML-based scoring functions (SFs) can deliver much improved performance on a selected test set, such as the comparative assessment of scoring functions (CASF), when compared to the classical SFs. However, the performance of ML-based SFs heavily relies on the overall similarity of the training set and the test set. To improve the performance and transferability of an SF, we have tried to combine various features including energy terms from X-score and AutoDock Vina, the properties of ligands, and the statistical sequence-related information from either the binding site or the full protein. In conjunction with extreme trees (ET), an ML model, we have developed XLPFE, a new SF. Compared with other tested methods such as X-score, AutoDock Vina, ΔvinaXGB, PSH-ML, or CNN-score, XLPFE achieves consistently better scoring and ranking power for various types of protein–ligand complex structures beyond the CASF, suggesting that XLPFE has superior transferability. In particular, XLPFE performs better with metalloenzymes. With its faster speed, improved accuracy, and better transferability, XLPFE could be usefully applied to a diverse range of protein–ligand complexes. American Chemical Society 2022-06-13 /pmc/articles/PMC9245135/ /pubmed/35785279 http://dx.doi.org/10.1021/acsomega.2c01723 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Dong, Lina
Qu, Xiaoyang
Wang, Binju
XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein–Ligand Scoring and Ranking
title XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein–Ligand Scoring and Ranking
title_full XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein–Ligand Scoring and Ranking
title_fullStr XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein–Ligand Scoring and Ranking
title_full_unstemmed XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein–Ligand Scoring and Ranking
title_short XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein–Ligand Scoring and Ranking
title_sort xlpfe: a simple and effective machine learning scoring function for protein–ligand scoring and ranking
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245135/
https://www.ncbi.nlm.nih.gov/pubmed/35785279
http://dx.doi.org/10.1021/acsomega.2c01723
work_keys_str_mv AT donglina xlpfeasimpleandeffectivemachinelearningscoringfunctionforproteinligandscoringandranking
AT quxiaoyang xlpfeasimpleandeffectivemachinelearningscoringfunctionforproteinligandscoringandranking
AT wangbinju xlpfeasimpleandeffectivemachinelearningscoringfunctionforproteinligandscoringandranking