Cargando…

Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules

Machine learning (ML) approaches are receiving increasing attention from pharmaceutical companies and regulatory agencies, given their ability to mine knowledge from available data. In drug discovery, for example, they are employed in quantitative structure–property relationship (QSPR) models to pre...

Descripción completa

Detalles Bibliográficos
Autores principales: Tosca, Elena M., Bartolucci, Roberta, Magni, Paolo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309152/
https://www.ncbi.nlm.nih.gov/pubmed/34371792
http://dx.doi.org/10.3390/pharmaceutics13071101
_version_ 1783728454806536192
author Tosca, Elena M.
Bartolucci, Roberta
Magni, Paolo
author_facet Tosca, Elena M.
Bartolucci, Roberta
Magni, Paolo
author_sort Tosca, Elena M.
collection PubMed
description Machine learning (ML) approaches are receiving increasing attention from pharmaceutical companies and regulatory agencies, given their ability to mine knowledge from available data. In drug discovery, for example, they are employed in quantitative structure–property relationship (QSPR) models to predict biological properties from the chemical structure of a drug molecule. In this paper, following the Second Solubility Challenge (SC-2), a QSPR model based on artificial neural networks (ANNs) was built to predict the intrinsic solubility (logS(0)) of the 100-compound low-variance tight set and the 32-compound high-variance loose set provided by SC-2 as test datasets. First, a training dataset of 270 drug-like molecules with logS(0) value experimentally determined was gathered from the literature. Then, a standard three-layer feed-forward neural network was defined by using 10 ChemGPS physico-chemical descriptors as input features. The developed ANN showed adequate predictive performances on both of the SC-2 test datasets. Benefits and limitations of ML approaches have been highlighted and discussed, starting from this case-study. The main findings confirmed that ML approaches are an attractive and promising tool to predict logS(0); however, many aspects, such as data quality, molecular descriptor computation and selection, and assessment of applicability domain, are crucial but often neglected, and should be carefully considered to improve predictions based on ML.
format Online
Article
Text
id pubmed-8309152
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83091522021-07-25 Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules Tosca, Elena M. Bartolucci, Roberta Magni, Paolo Pharmaceutics Article Machine learning (ML) approaches are receiving increasing attention from pharmaceutical companies and regulatory agencies, given their ability to mine knowledge from available data. In drug discovery, for example, they are employed in quantitative structure–property relationship (QSPR) models to predict biological properties from the chemical structure of a drug molecule. In this paper, following the Second Solubility Challenge (SC-2), a QSPR model based on artificial neural networks (ANNs) was built to predict the intrinsic solubility (logS(0)) of the 100-compound low-variance tight set and the 32-compound high-variance loose set provided by SC-2 as test datasets. First, a training dataset of 270 drug-like molecules with logS(0) value experimentally determined was gathered from the literature. Then, a standard three-layer feed-forward neural network was defined by using 10 ChemGPS physico-chemical descriptors as input features. The developed ANN showed adequate predictive performances on both of the SC-2 test datasets. Benefits and limitations of ML approaches have been highlighted and discussed, starting from this case-study. The main findings confirmed that ML approaches are an attractive and promising tool to predict logS(0); however, many aspects, such as data quality, molecular descriptor computation and selection, and assessment of applicability domain, are crucial but often neglected, and should be carefully considered to improve predictions based on ML. MDPI 2021-07-20 /pmc/articles/PMC8309152/ /pubmed/34371792 http://dx.doi.org/10.3390/pharmaceutics13071101 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tosca, Elena M.
Bartolucci, Roberta
Magni, Paolo
Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules
title Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules
title_full Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules
title_fullStr Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules
title_full_unstemmed Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules
title_short Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules
title_sort application of artificial neural networks to predict the intrinsic solubility of drug-like molecules
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309152/
https://www.ncbi.nlm.nih.gov/pubmed/34371792
http://dx.doi.org/10.3390/pharmaceutics13071101
work_keys_str_mv AT toscaelenam applicationofartificialneuralnetworkstopredicttheintrinsicsolubilityofdruglikemolecules
AT bartolucciroberta applicationofartificialneuralnetworkstopredicttheintrinsicsolubilityofdruglikemolecules
AT magnipaolo applicationofartificialneuralnetworkstopredicttheintrinsicsolubilityofdruglikemolecules