Cargando…
Machine-Learning- and Knowledge-Based Scoring Functions Incorporating Ligand and Protein Fingerprints
[Image: see text] We propose a novel machine-learning-based scoring function for drug discovery that incorporates ligand and protein structural information into a knowledge-based PMF score. Molecular docking, a simulation method for structure-based drug design (SBDD), is expected to reduce the enorm...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2022
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9178954/ https://www.ncbi.nlm.nih.gov/pubmed/35694525 http://dx.doi.org/10.1021/acsomega.2c02822 |
_version_ | 1784723166174642176 |
---|---|
author | Fujimoto, Kazuhiro J. Minami, Shota Yanai, Takeshi |
author_facet | Fujimoto, Kazuhiro J. Minami, Shota Yanai, Takeshi |
author_sort | Fujimoto, Kazuhiro J. |
collection | PubMed |
description | [Image: see text] We propose a novel machine-learning-based scoring function for drug discovery that incorporates ligand and protein structural information into a knowledge-based PMF score. Molecular docking, a simulation method for structure-based drug design (SBDD), is expected to reduce the enormous costs associated with conventional experimental methods in terms of rational drug discovery. Molecular docking has two main purposes: to predict ligand-binding structures for target proteins and to predict protein–ligand binding affinity. Currently available programs of molecular docking offer an accurate prediction of ligand binding structures for many systems. However, the accurate prediction of binding affinity remains challenging. In this study, we developed a new scoring function that incorporates fingerprints representing ligand and protein structures as descriptors in the PMF score. Here, regression analysis of the scoring function was performed using the following machine learning techniques: least absolute shrinkage and selection operator (LASSO) and light gradient boosting machine (LightGBM). The results on a test data set showed that the binding affinity delivered by the newly developed scoring function has a Pearson correlation coefficient of 0.79 with the experimental value, which surpasses that of the conventional scoring functions. Further analysis provided a chemical understanding of the descriptors that contributed significantly to the improvement in prediction accuracy. Our approach and findings are useful for rational drug discovery. |
format | Online Article Text |
id | pubmed-9178954 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-91789542022-06-10 Machine-Learning- and Knowledge-Based Scoring Functions Incorporating Ligand and Protein Fingerprints Fujimoto, Kazuhiro J. Minami, Shota Yanai, Takeshi ACS Omega [Image: see text] We propose a novel machine-learning-based scoring function for drug discovery that incorporates ligand and protein structural information into a knowledge-based PMF score. Molecular docking, a simulation method for structure-based drug design (SBDD), is expected to reduce the enormous costs associated with conventional experimental methods in terms of rational drug discovery. Molecular docking has two main purposes: to predict ligand-binding structures for target proteins and to predict protein–ligand binding affinity. Currently available programs of molecular docking offer an accurate prediction of ligand binding structures for many systems. However, the accurate prediction of binding affinity remains challenging. In this study, we developed a new scoring function that incorporates fingerprints representing ligand and protein structures as descriptors in the PMF score. Here, regression analysis of the scoring function was performed using the following machine learning techniques: least absolute shrinkage and selection operator (LASSO) and light gradient boosting machine (LightGBM). The results on a test data set showed that the binding affinity delivered by the newly developed scoring function has a Pearson correlation coefficient of 0.79 with the experimental value, which surpasses that of the conventional scoring functions. Further analysis provided a chemical understanding of the descriptors that contributed significantly to the improvement in prediction accuracy. Our approach and findings are useful for rational drug discovery. American Chemical Society 2022-05-25 /pmc/articles/PMC9178954/ /pubmed/35694525 http://dx.doi.org/10.1021/acsomega.2c02822 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Fujimoto, Kazuhiro J. Minami, Shota Yanai, Takeshi Machine-Learning- and Knowledge-Based Scoring Functions Incorporating Ligand and Protein Fingerprints |
title | Machine-Learning- and Knowledge-Based Scoring Functions
Incorporating Ligand and Protein Fingerprints |
title_full | Machine-Learning- and Knowledge-Based Scoring Functions
Incorporating Ligand and Protein Fingerprints |
title_fullStr | Machine-Learning- and Knowledge-Based Scoring Functions
Incorporating Ligand and Protein Fingerprints |
title_full_unstemmed | Machine-Learning- and Knowledge-Based Scoring Functions
Incorporating Ligand and Protein Fingerprints |
title_short | Machine-Learning- and Knowledge-Based Scoring Functions
Incorporating Ligand and Protein Fingerprints |
title_sort | machine-learning- and knowledge-based scoring functions
incorporating ligand and protein fingerprints |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9178954/ https://www.ncbi.nlm.nih.gov/pubmed/35694525 http://dx.doi.org/10.1021/acsomega.2c02822 |
work_keys_str_mv | AT fujimotokazuhiroj machinelearningandknowledgebasedscoringfunctionsincorporatingligandandproteinfingerprints AT minamishota machinelearningandknowledgebasedscoringfunctionsincorporatingligandandproteinfingerprints AT yanaitakeshi machinelearningandknowledgebasedscoringfunctionsincorporatingligandandproteinfingerprints |