Cargando…
Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules
Machine learning (ML) approaches are receiving increasing attention from pharmaceutical companies and regulatory agencies, given their ability to mine knowledge from available data. In drug discovery, for example, they are employed in quantitative structure–property relationship (QSPR) models to pre...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309152/ https://www.ncbi.nlm.nih.gov/pubmed/34371792 http://dx.doi.org/10.3390/pharmaceutics13071101 |
_version_ | 1783728454806536192 |
---|---|
author | Tosca, Elena M. Bartolucci, Roberta Magni, Paolo |
author_facet | Tosca, Elena M. Bartolucci, Roberta Magni, Paolo |
author_sort | Tosca, Elena M. |
collection | PubMed |
description | Machine learning (ML) approaches are receiving increasing attention from pharmaceutical companies and regulatory agencies, given their ability to mine knowledge from available data. In drug discovery, for example, they are employed in quantitative structure–property relationship (QSPR) models to predict biological properties from the chemical structure of a drug molecule. In this paper, following the Second Solubility Challenge (SC-2), a QSPR model based on artificial neural networks (ANNs) was built to predict the intrinsic solubility (logS(0)) of the 100-compound low-variance tight set and the 32-compound high-variance loose set provided by SC-2 as test datasets. First, a training dataset of 270 drug-like molecules with logS(0) value experimentally determined was gathered from the literature. Then, a standard three-layer feed-forward neural network was defined by using 10 ChemGPS physico-chemical descriptors as input features. The developed ANN showed adequate predictive performances on both of the SC-2 test datasets. Benefits and limitations of ML approaches have been highlighted and discussed, starting from this case-study. The main findings confirmed that ML approaches are an attractive and promising tool to predict logS(0); however, many aspects, such as data quality, molecular descriptor computation and selection, and assessment of applicability domain, are crucial but often neglected, and should be carefully considered to improve predictions based on ML. |
format | Online Article Text |
id | pubmed-8309152 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83091522021-07-25 Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules Tosca, Elena M. Bartolucci, Roberta Magni, Paolo Pharmaceutics Article Machine learning (ML) approaches are receiving increasing attention from pharmaceutical companies and regulatory agencies, given their ability to mine knowledge from available data. In drug discovery, for example, they are employed in quantitative structure–property relationship (QSPR) models to predict biological properties from the chemical structure of a drug molecule. In this paper, following the Second Solubility Challenge (SC-2), a QSPR model based on artificial neural networks (ANNs) was built to predict the intrinsic solubility (logS(0)) of the 100-compound low-variance tight set and the 32-compound high-variance loose set provided by SC-2 as test datasets. First, a training dataset of 270 drug-like molecules with logS(0) value experimentally determined was gathered from the literature. Then, a standard three-layer feed-forward neural network was defined by using 10 ChemGPS physico-chemical descriptors as input features. The developed ANN showed adequate predictive performances on both of the SC-2 test datasets. Benefits and limitations of ML approaches have been highlighted and discussed, starting from this case-study. The main findings confirmed that ML approaches are an attractive and promising tool to predict logS(0); however, many aspects, such as data quality, molecular descriptor computation and selection, and assessment of applicability domain, are crucial but often neglected, and should be carefully considered to improve predictions based on ML. MDPI 2021-07-20 /pmc/articles/PMC8309152/ /pubmed/34371792 http://dx.doi.org/10.3390/pharmaceutics13071101 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Tosca, Elena M. Bartolucci, Roberta Magni, Paolo Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules |
title | Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules |
title_full | Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules |
title_fullStr | Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules |
title_full_unstemmed | Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules |
title_short | Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules |
title_sort | application of artificial neural networks to predict the intrinsic solubility of drug-like molecules |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309152/ https://www.ncbi.nlm.nih.gov/pubmed/34371792 http://dx.doi.org/10.3390/pharmaceutics13071101 |
work_keys_str_mv | AT toscaelenam applicationofartificialneuralnetworkstopredicttheintrinsicsolubilityofdruglikemolecules AT bartolucciroberta applicationofartificialneuralnetworkstopredicttheintrinsicsolubilityofdruglikemolecules AT magnipaolo applicationofartificialneuralnetworkstopredicttheintrinsicsolubilityofdruglikemolecules |