Cargando…

Applied machine learning for predicting the lanthanide-ligand binding affinities

Binding affinities of metal–ligand complexes are central to a multitude of applications like drug design, chelation therapy, designing reagents for solvent extraction etc. While state-of-the-art molecular modelling approaches are usually employed to gather structural and chemical insights about the...

Descripción completa

Detalles Bibliográficos
Autores principales: Chaube, Suryanaman, Goverapet Srinivasan, Sriram, Rai, Beena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7459320/
https://www.ncbi.nlm.nih.gov/pubmed/32868845
http://dx.doi.org/10.1038/s41598-020-71255-9
_version_ 1783576349967908864
author Chaube, Suryanaman
Goverapet Srinivasan, Sriram
Rai, Beena
author_facet Chaube, Suryanaman
Goverapet Srinivasan, Sriram
Rai, Beena
author_sort Chaube, Suryanaman
collection PubMed
description Binding affinities of metal–ligand complexes are central to a multitude of applications like drug design, chelation therapy, designing reagents for solvent extraction etc. While state-of-the-art molecular modelling approaches are usually employed to gather structural and chemical insights about the metal complexation with ligands, their computational cost and the limited ability to predict metal–ligand stability constants with reasonable accuracy, renders them impractical to screen large chemical spaces. In this context, leveraging vast amounts of experimental data to learn the metal-binding affinities of ligands becomes a promising alternative. Here, we develop a machine learning framework for predicting binding affinities (logK(1)) of lanthanide cations with several structurally diverse molecular ligands. Six supervised machine learning algorithms—Random Forest (RF), k-Nearest Neighbours (KNN), Support Vector Machines (SVM), Kernel Ridge Regression (KRR), Multi Layered Perceptrons (MLP) and Adaptive Boosting (AdaBoost)—were trained on a dataset comprising thousands of experimental values of logK(1) and validated in an external 10-folds cross-validation procedure. This was followed by a thorough feature engineering and feature importance analysis to identify the molecular, metallic and solvent features most relevant to binding affinity prediction, along with an evaluation of performance metrics against the dimensionality of feature space. Having demonstrated the excellent predictive ability of our framework, we utilized the best performing AdaBoost model to predict the logK(1) values of lanthanide cations with nearly 71 million compounds present in the PubChem database. Our methodology opens up an opportunity for significantly accelerating screening and design of ligands for various targeted applications, from vast chemical spaces.
format Online
Article
Text
id pubmed-7459320
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-74593202020-09-01 Applied machine learning for predicting the lanthanide-ligand binding affinities Chaube, Suryanaman Goverapet Srinivasan, Sriram Rai, Beena Sci Rep Article Binding affinities of metal–ligand complexes are central to a multitude of applications like drug design, chelation therapy, designing reagents for solvent extraction etc. While state-of-the-art molecular modelling approaches are usually employed to gather structural and chemical insights about the metal complexation with ligands, their computational cost and the limited ability to predict metal–ligand stability constants with reasonable accuracy, renders them impractical to screen large chemical spaces. In this context, leveraging vast amounts of experimental data to learn the metal-binding affinities of ligands becomes a promising alternative. Here, we develop a machine learning framework for predicting binding affinities (logK(1)) of lanthanide cations with several structurally diverse molecular ligands. Six supervised machine learning algorithms—Random Forest (RF), k-Nearest Neighbours (KNN), Support Vector Machines (SVM), Kernel Ridge Regression (KRR), Multi Layered Perceptrons (MLP) and Adaptive Boosting (AdaBoost)—were trained on a dataset comprising thousands of experimental values of logK(1) and validated in an external 10-folds cross-validation procedure. This was followed by a thorough feature engineering and feature importance analysis to identify the molecular, metallic and solvent features most relevant to binding affinity prediction, along with an evaluation of performance metrics against the dimensionality of feature space. Having demonstrated the excellent predictive ability of our framework, we utilized the best performing AdaBoost model to predict the logK(1) values of lanthanide cations with nearly 71 million compounds present in the PubChem database. Our methodology opens up an opportunity for significantly accelerating screening and design of ligands for various targeted applications, from vast chemical spaces. Nature Publishing Group UK 2020-08-31 /pmc/articles/PMC7459320/ /pubmed/32868845 http://dx.doi.org/10.1038/s41598-020-71255-9 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Chaube, Suryanaman
Goverapet Srinivasan, Sriram
Rai, Beena
Applied machine learning for predicting the lanthanide-ligand binding affinities
title Applied machine learning for predicting the lanthanide-ligand binding affinities
title_full Applied machine learning for predicting the lanthanide-ligand binding affinities
title_fullStr Applied machine learning for predicting the lanthanide-ligand binding affinities
title_full_unstemmed Applied machine learning for predicting the lanthanide-ligand binding affinities
title_short Applied machine learning for predicting the lanthanide-ligand binding affinities
title_sort applied machine learning for predicting the lanthanide-ligand binding affinities
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7459320/
https://www.ncbi.nlm.nih.gov/pubmed/32868845
http://dx.doi.org/10.1038/s41598-020-71255-9
work_keys_str_mv AT chaubesuryanaman appliedmachinelearningforpredictingthelanthanideligandbindingaffinities
AT goverapetsrinivasansriram appliedmachinelearningforpredictingthelanthanideligandbindingaffinities
AT raibeena appliedmachinelearningforpredictingthelanthanideligandbindingaffinities