Cargando…

Machine Learning Scoring Functions for Drug Discovery from Experimental and Computer-Generated Protein–Ligand Structures: Towards Per-Target Scoring Functions

In recent years, machine learning has been proposed as a promising strategy to build accurate scoring functions for computational docking finalized to numerically empowered drug discovery. However, the latest studies have suggested that over-optimistic results had been reported due to the correlatio...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pellicani, Francesco, Dal Ben, Diego, Perali, Andrea, Pilati, Sebastiano
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9966217/ https://www.ncbi.nlm.nih.gov/pubmed/36838647 http://dx.doi.org/10.3390/molecules28041661

_version_	1784896961225162752
author	Pellicani, Francesco Dal Ben, Diego Perali, Andrea Pilati, Sebastiano
author_facet	Pellicani, Francesco Dal Ben, Diego Perali, Andrea Pilati, Sebastiano
author_sort	Pellicani, Francesco
collection	PubMed
description	In recent years, machine learning has been proposed as a promising strategy to build accurate scoring functions for computational docking finalized to numerically empowered drug discovery. However, the latest studies have suggested that over-optimistic results had been reported due to the correlations present in the experimental databases used for training and testing. Here, we investigate the performance of an artificial neural network in binding affinity predictions, comparing results obtained using both experimental protein–ligand structures as well as larger sets of computer-generated structures created using commercial software. Interestingly, similar performances are obtained on both databases. We find a noticeable performance suppression when moving from random horizontal tests to vertical tests performed on target proteins not included in the training data. The possibility to train the network on relatively easily created computer-generated databases leads us to explore per-target scoring functions, trained and tested ad-hoc on complexes including only one target protein. Encouraging results are obtained, depending on the type of protein being addressed.
format	Online Article Text
id	pubmed-9966217
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-99662172023-02-26 Machine Learning Scoring Functions for Drug Discovery from Experimental and Computer-Generated Protein–Ligand Structures: Towards Per-Target Scoring Functions Pellicani, Francesco Dal Ben, Diego Perali, Andrea Pilati, Sebastiano Molecules Article In recent years, machine learning has been proposed as a promising strategy to build accurate scoring functions for computational docking finalized to numerically empowered drug discovery. However, the latest studies have suggested that over-optimistic results had been reported due to the correlations present in the experimental databases used for training and testing. Here, we investigate the performance of an artificial neural network in binding affinity predictions, comparing results obtained using both experimental protein–ligand structures as well as larger sets of computer-generated structures created using commercial software. Interestingly, similar performances are obtained on both databases. We find a noticeable performance suppression when moving from random horizontal tests to vertical tests performed on target proteins not included in the training data. The possibility to train the network on relatively easily created computer-generated databases leads us to explore per-target scoring functions, trained and tested ad-hoc on complexes including only one target protein. Encouraging results are obtained, depending on the type of protein being addressed. MDPI 2023-02-09 /pmc/articles/PMC9966217/ /pubmed/36838647 http://dx.doi.org/10.3390/molecules28041661 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Pellicani, Francesco Dal Ben, Diego Perali, Andrea Pilati, Sebastiano Machine Learning Scoring Functions for Drug Discovery from Experimental and Computer-Generated Protein–Ligand Structures: Towards Per-Target Scoring Functions
title	Machine Learning Scoring Functions for Drug Discovery from Experimental and Computer-Generated Protein–Ligand Structures: Towards Per-Target Scoring Functions
title_full	Machine Learning Scoring Functions for Drug Discovery from Experimental and Computer-Generated Protein–Ligand Structures: Towards Per-Target Scoring Functions
title_fullStr	Machine Learning Scoring Functions for Drug Discovery from Experimental and Computer-Generated Protein–Ligand Structures: Towards Per-Target Scoring Functions
title_full_unstemmed	Machine Learning Scoring Functions for Drug Discovery from Experimental and Computer-Generated Protein–Ligand Structures: Towards Per-Target Scoring Functions
title_short	Machine Learning Scoring Functions for Drug Discovery from Experimental and Computer-Generated Protein–Ligand Structures: Towards Per-Target Scoring Functions
title_sort	machine learning scoring functions for drug discovery from experimental and computer-generated protein–ligand structures: towards per-target scoring functions
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9966217/ https://www.ncbi.nlm.nih.gov/pubmed/36838647 http://dx.doi.org/10.3390/molecules28041661
work_keys_str_mv	AT pellicanifrancesco machinelearningscoringfunctionsfordrugdiscoveryfromexperimentalandcomputergeneratedproteinligandstructurestowardspertargetscoringfunctions AT dalbendiego machinelearningscoringfunctionsfordrugdiscoveryfromexperimentalandcomputergeneratedproteinligandstructurestowardspertargetscoringfunctions AT peraliandrea machinelearningscoringfunctionsfordrugdiscoveryfromexperimentalandcomputergeneratedproteinligandstructurestowardspertargetscoringfunctions AT pilatisebastiano machinelearningscoringfunctionsfordrugdiscoveryfromexperimentalandcomputergeneratedproteinligandstructurestowardspertargetscoringfunctions

Machine Learning Scoring Functions for Drug Discovery from Experimental and Computer-Generated Protein–Ligand Structures: Towards Per-Target Scoring Functions

Ejemplares similares