Cargando…

Applied machine learning for predicting the lanthanide-ligand binding affinities

Binding affinities of metal–ligand complexes are central to a multitude of applications like drug design, chelation therapy, designing reagents for solvent extraction etc. While state-of-the-art molecular modelling approaches are usually employed to gather structural and chemical insights about the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chaube, Suryanaman, Goverapet Srinivasan, Sriram, Rai, Beena
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7459320/ https://www.ncbi.nlm.nih.gov/pubmed/32868845 http://dx.doi.org/10.1038/s41598-020-71255-9

_version_	1783576349967908864
author	Chaube, Suryanaman Goverapet Srinivasan, Sriram Rai, Beena
author_facet	Chaube, Suryanaman Goverapet Srinivasan, Sriram Rai, Beena
author_sort	Chaube, Suryanaman
collection	PubMed
description	Binding affinities of metal–ligand complexes are central to a multitude of applications like drug design, chelation therapy, designing reagents for solvent extraction etc. While state-of-the-art molecular modelling approaches are usually employed to gather structural and chemical insights about the metal complexation with ligands, their computational cost and the limited ability to predict metal–ligand stability constants with reasonable accuracy, renders them impractical to screen large chemical spaces. In this context, leveraging vast amounts of experimental data to learn the metal-binding affinities of ligands becomes a promising alternative. Here, we develop a machine learning framework for predicting binding affinities (logK(1)) of lanthanide cations with several structurally diverse molecular ligands. Six supervised machine learning algorithms—Random Forest (RF), k-Nearest Neighbours (KNN), Support Vector Machines (SVM), Kernel Ridge Regression (KRR), Multi Layered Perceptrons (MLP) and Adaptive Boosting (AdaBoost)—were trained on a dataset comprising thousands of experimental values of logK(1) and validated in an external 10-folds cross-validation procedure. This was followed by a thorough feature engineering and feature importance analysis to identify the molecular, metallic and solvent features most relevant to binding affinity prediction, along with an evaluation of performance metrics against the dimensionality of feature space. Having demonstrated the excellent predictive ability of our framework, we utilized the best performing AdaBoost model to predict the logK(1) values of lanthanide cations with nearly 71 million compounds present in the PubChem database. Our methodology opens up an opportunity for significantly accelerating screening and design of ligands for various targeted applications, from vast chemical spaces.
format	Online Article Text
id	pubmed-7459320
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-74593202020-09-01 Applied machine learning for predicting the lanthanide-ligand binding affinities Chaube, Suryanaman Goverapet Srinivasan, Sriram Rai, Beena Sci Rep Article Binding affinities of metal–ligand complexes are central to a multitude of applications like drug design, chelation therapy, designing reagents for solvent extraction etc. While state-of-the-art molecular modelling approaches are usually employed to gather structural and chemical insights about the metal complexation with ligands, their computational cost and the limited ability to predict metal–ligand stability constants with reasonable accuracy, renders them impractical to screen large chemical spaces. In this context, leveraging vast amounts of experimental data to learn the metal-binding affinities of ligands becomes a promising alternative. Here, we develop a machine learning framework for predicting binding affinities (logK(1)) of lanthanide cations with several structurally diverse molecular ligands. Six supervised machine learning algorithms—Random Forest (RF), k-Nearest Neighbours (KNN), Support Vector Machines (SVM), Kernel Ridge Regression (KRR), Multi Layered Perceptrons (MLP) and Adaptive Boosting (AdaBoost)—were trained on a dataset comprising thousands of experimental values of logK(1) and validated in an external 10-folds cross-validation procedure. This was followed by a thorough feature engineering and feature importance analysis to identify the molecular, metallic and solvent features most relevant to binding affinity prediction, along with an evaluation of performance metrics against the dimensionality of feature space. Having demonstrated the excellent predictive ability of our framework, we utilized the best performing AdaBoost model to predict the logK(1) values of lanthanide cations with nearly 71 million compounds present in the PubChem database. Our methodology opens up an opportunity for significantly accelerating screening and design of ligands for various targeted applications, from vast chemical spaces. Nature Publishing Group UK 2020-08-31 /pmc/articles/PMC7459320/ /pubmed/32868845 http://dx.doi.org/10.1038/s41598-020-71255-9 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Article Chaube, Suryanaman Goverapet Srinivasan, Sriram Rai, Beena Applied machine learning for predicting the lanthanide-ligand binding affinities
title	Applied machine learning for predicting the lanthanide-ligand binding affinities
title_full	Applied machine learning for predicting the lanthanide-ligand binding affinities
title_fullStr	Applied machine learning for predicting the lanthanide-ligand binding affinities
title_full_unstemmed	Applied machine learning for predicting the lanthanide-ligand binding affinities
title_short	Applied machine learning for predicting the lanthanide-ligand binding affinities
title_sort	applied machine learning for predicting the lanthanide-ligand binding affinities
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7459320/ https://www.ncbi.nlm.nih.gov/pubmed/32868845 http://dx.doi.org/10.1038/s41598-020-71255-9
work_keys_str_mv	AT chaubesuryanaman appliedmachinelearningforpredictingthelanthanideligandbindingaffinities AT goverapetsrinivasansriram appliedmachinelearningforpredictingthelanthanideligandbindingaffinities AT raibeena appliedmachinelearningforpredictingthelanthanideligandbindingaffinities

Applied machine learning for predicting the lanthanide-ligand binding affinities

Ejemplares similares