Cargando…
XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein–Ligand Scoring and Ranking
[Image: see text] Prediction of protein–ligand binding affinities is a central issue in structure-based computer-aided drug design. In recent years, much effort has been devoted to the prediction of the binding affinity in protein–ligand complexes using machine learning (ML). Due to the remarkable a...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2022
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245135/ https://www.ncbi.nlm.nih.gov/pubmed/35785279 http://dx.doi.org/10.1021/acsomega.2c01723 |
_version_ | 1784738683134410752 |
---|---|
author | Dong, Lina Qu, Xiaoyang Wang, Binju |
author_facet | Dong, Lina Qu, Xiaoyang Wang, Binju |
author_sort | Dong, Lina |
collection | PubMed |
description | [Image: see text] Prediction of protein–ligand binding affinities is a central issue in structure-based computer-aided drug design. In recent years, much effort has been devoted to the prediction of the binding affinity in protein–ligand complexes using machine learning (ML). Due to the remarkable ability of ML methods in nonlinear fitting, ML-based scoring functions (SFs) can deliver much improved performance on a selected test set, such as the comparative assessment of scoring functions (CASF), when compared to the classical SFs. However, the performance of ML-based SFs heavily relies on the overall similarity of the training set and the test set. To improve the performance and transferability of an SF, we have tried to combine various features including energy terms from X-score and AutoDock Vina, the properties of ligands, and the statistical sequence-related information from either the binding site or the full protein. In conjunction with extreme trees (ET), an ML model, we have developed XLPFE, a new SF. Compared with other tested methods such as X-score, AutoDock Vina, ΔvinaXGB, PSH-ML, or CNN-score, XLPFE achieves consistently better scoring and ranking power for various types of protein–ligand complex structures beyond the CASF, suggesting that XLPFE has superior transferability. In particular, XLPFE performs better with metalloenzymes. With its faster speed, improved accuracy, and better transferability, XLPFE could be usefully applied to a diverse range of protein–ligand complexes. |
format | Online Article Text |
id | pubmed-9245135 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-92451352022-07-01 XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein–Ligand Scoring and Ranking Dong, Lina Qu, Xiaoyang Wang, Binju ACS Omega [Image: see text] Prediction of protein–ligand binding affinities is a central issue in structure-based computer-aided drug design. In recent years, much effort has been devoted to the prediction of the binding affinity in protein–ligand complexes using machine learning (ML). Due to the remarkable ability of ML methods in nonlinear fitting, ML-based scoring functions (SFs) can deliver much improved performance on a selected test set, such as the comparative assessment of scoring functions (CASF), when compared to the classical SFs. However, the performance of ML-based SFs heavily relies on the overall similarity of the training set and the test set. To improve the performance and transferability of an SF, we have tried to combine various features including energy terms from X-score and AutoDock Vina, the properties of ligands, and the statistical sequence-related information from either the binding site or the full protein. In conjunction with extreme trees (ET), an ML model, we have developed XLPFE, a new SF. Compared with other tested methods such as X-score, AutoDock Vina, ΔvinaXGB, PSH-ML, or CNN-score, XLPFE achieves consistently better scoring and ranking power for various types of protein–ligand complex structures beyond the CASF, suggesting that XLPFE has superior transferability. In particular, XLPFE performs better with metalloenzymes. With its faster speed, improved accuracy, and better transferability, XLPFE could be usefully applied to a diverse range of protein–ligand complexes. American Chemical Society 2022-06-13 /pmc/articles/PMC9245135/ /pubmed/35785279 http://dx.doi.org/10.1021/acsomega.2c01723 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Dong, Lina Qu, Xiaoyang Wang, Binju XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein–Ligand Scoring and Ranking |
title | XLPFE: A Simple and Effective Machine Learning Scoring
Function for Protein–Ligand Scoring and Ranking |
title_full | XLPFE: A Simple and Effective Machine Learning Scoring
Function for Protein–Ligand Scoring and Ranking |
title_fullStr | XLPFE: A Simple and Effective Machine Learning Scoring
Function for Protein–Ligand Scoring and Ranking |
title_full_unstemmed | XLPFE: A Simple and Effective Machine Learning Scoring
Function for Protein–Ligand Scoring and Ranking |
title_short | XLPFE: A Simple and Effective Machine Learning Scoring
Function for Protein–Ligand Scoring and Ranking |
title_sort | xlpfe: a simple and effective machine learning scoring
function for protein–ligand scoring and ranking |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245135/ https://www.ncbi.nlm.nih.gov/pubmed/35785279 http://dx.doi.org/10.1021/acsomega.2c01723 |
work_keys_str_mv | AT donglina xlpfeasimpleandeffectivemachinelearningscoringfunctionforproteinligandscoringandranking AT quxiaoyang xlpfeasimpleandeffectivemachinelearningscoringfunctionforproteinligandscoringandranking AT wangbinju xlpfeasimpleandeffectivemachinelearningscoringfunctionforproteinligandscoringandranking |