Cargando…
Applied machine learning for predicting the lanthanide-ligand binding affinities
Binding affinities of metal–ligand complexes are central to a multitude of applications like drug design, chelation therapy, designing reagents for solvent extraction etc. While state-of-the-art molecular modelling approaches are usually employed to gather structural and chemical insights about the...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7459320/ https://www.ncbi.nlm.nih.gov/pubmed/32868845 http://dx.doi.org/10.1038/s41598-020-71255-9 |
_version_ | 1783576349967908864 |
---|---|
author | Chaube, Suryanaman Goverapet Srinivasan, Sriram Rai, Beena |
author_facet | Chaube, Suryanaman Goverapet Srinivasan, Sriram Rai, Beena |
author_sort | Chaube, Suryanaman |
collection | PubMed |
description | Binding affinities of metal–ligand complexes are central to a multitude of applications like drug design, chelation therapy, designing reagents for solvent extraction etc. While state-of-the-art molecular modelling approaches are usually employed to gather structural and chemical insights about the metal complexation with ligands, their computational cost and the limited ability to predict metal–ligand stability constants with reasonable accuracy, renders them impractical to screen large chemical spaces. In this context, leveraging vast amounts of experimental data to learn the metal-binding affinities of ligands becomes a promising alternative. Here, we develop a machine learning framework for predicting binding affinities (logK(1)) of lanthanide cations with several structurally diverse molecular ligands. Six supervised machine learning algorithms—Random Forest (RF), k-Nearest Neighbours (KNN), Support Vector Machines (SVM), Kernel Ridge Regression (KRR), Multi Layered Perceptrons (MLP) and Adaptive Boosting (AdaBoost)—were trained on a dataset comprising thousands of experimental values of logK(1) and validated in an external 10-folds cross-validation procedure. This was followed by a thorough feature engineering and feature importance analysis to identify the molecular, metallic and solvent features most relevant to binding affinity prediction, along with an evaluation of performance metrics against the dimensionality of feature space. Having demonstrated the excellent predictive ability of our framework, we utilized the best performing AdaBoost model to predict the logK(1) values of lanthanide cations with nearly 71 million compounds present in the PubChem database. Our methodology opens up an opportunity for significantly accelerating screening and design of ligands for various targeted applications, from vast chemical spaces. |
format | Online Article Text |
id | pubmed-7459320 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-74593202020-09-01 Applied machine learning for predicting the lanthanide-ligand binding affinities Chaube, Suryanaman Goverapet Srinivasan, Sriram Rai, Beena Sci Rep Article Binding affinities of metal–ligand complexes are central to a multitude of applications like drug design, chelation therapy, designing reagents for solvent extraction etc. While state-of-the-art molecular modelling approaches are usually employed to gather structural and chemical insights about the metal complexation with ligands, their computational cost and the limited ability to predict metal–ligand stability constants with reasonable accuracy, renders them impractical to screen large chemical spaces. In this context, leveraging vast amounts of experimental data to learn the metal-binding affinities of ligands becomes a promising alternative. Here, we develop a machine learning framework for predicting binding affinities (logK(1)) of lanthanide cations with several structurally diverse molecular ligands. Six supervised machine learning algorithms—Random Forest (RF), k-Nearest Neighbours (KNN), Support Vector Machines (SVM), Kernel Ridge Regression (KRR), Multi Layered Perceptrons (MLP) and Adaptive Boosting (AdaBoost)—were trained on a dataset comprising thousands of experimental values of logK(1) and validated in an external 10-folds cross-validation procedure. This was followed by a thorough feature engineering and feature importance analysis to identify the molecular, metallic and solvent features most relevant to binding affinity prediction, along with an evaluation of performance metrics against the dimensionality of feature space. Having demonstrated the excellent predictive ability of our framework, we utilized the best performing AdaBoost model to predict the logK(1) values of lanthanide cations with nearly 71 million compounds present in the PubChem database. Our methodology opens up an opportunity for significantly accelerating screening and design of ligands for various targeted applications, from vast chemical spaces. Nature Publishing Group UK 2020-08-31 /pmc/articles/PMC7459320/ /pubmed/32868845 http://dx.doi.org/10.1038/s41598-020-71255-9 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Chaube, Suryanaman Goverapet Srinivasan, Sriram Rai, Beena Applied machine learning for predicting the lanthanide-ligand binding affinities |
title | Applied machine learning for predicting the lanthanide-ligand binding affinities |
title_full | Applied machine learning for predicting the lanthanide-ligand binding affinities |
title_fullStr | Applied machine learning for predicting the lanthanide-ligand binding affinities |
title_full_unstemmed | Applied machine learning for predicting the lanthanide-ligand binding affinities |
title_short | Applied machine learning for predicting the lanthanide-ligand binding affinities |
title_sort | applied machine learning for predicting the lanthanide-ligand binding affinities |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7459320/ https://www.ncbi.nlm.nih.gov/pubmed/32868845 http://dx.doi.org/10.1038/s41598-020-71255-9 |
work_keys_str_mv | AT chaubesuryanaman appliedmachinelearningforpredictingthelanthanideligandbindingaffinities AT goverapetsrinivasansriram appliedmachinelearningforpredictingthelanthanideligandbindingaffinities AT raibeena appliedmachinelearningforpredictingthelanthanideligandbindingaffinities |