Cargando…

The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks

Artificial intelligence (AI) has gained significant traction in the field of drug discovery, with deep learning (DL) algorithms playing a crucial role in predicting protein–ligand binding affinities. Despite advancements in neural network architectures, system representation, and training techniques...

Descripción completa

Detalles Bibliográficos
Autores principales:	Libouban, Pierre-Yves, Aci-Sèche, Samia, Gómez-Tamayo, Jose Carlos, Tresadern, Gary, Bonnet, Pascal
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10671244/ https://www.ncbi.nlm.nih.gov/pubmed/38003312 http://dx.doi.org/10.3390/ijms242216120

_version_	1785140108776701952
author	Libouban, Pierre-Yves Aci-Sèche, Samia Gómez-Tamayo, Jose Carlos Tresadern, Gary Bonnet, Pascal
author_facet	Libouban, Pierre-Yves Aci-Sèche, Samia Gómez-Tamayo, Jose Carlos Tresadern, Gary Bonnet, Pascal
author_sort	Libouban, Pierre-Yves
collection	PubMed
description	Artificial intelligence (AI) has gained significant traction in the field of drug discovery, with deep learning (DL) algorithms playing a crucial role in predicting protein–ligand binding affinities. Despite advancements in neural network architectures, system representation, and training techniques, the performance of DL affinity prediction has reached a plateau, prompting the question of whether it is truly solved or if the current performance is overly optimistic and reliant on biased, easily predictable data. Like other DL-related problems, this issue seems to stem from the training and test sets used when building the models. In this work, we investigate the impact of several parameters related to the input data on the performance of neural network affinity prediction models. Notably, we identify the size of the binding pocket as a critical factor influencing the performance of our statistical models; furthermore, it is more important to train a model with as much data as possible than to restrict the training to only high-quality datasets. Finally, we also confirm the bias in the typically used current test sets. Therefore, several types of evaluation and benchmarking are required to understand models’ decision-making processes and accurately compare the performance of models.
format	Online Article Text
id	pubmed-10671244
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-106712442023-11-09 The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks Libouban, Pierre-Yves Aci-Sèche, Samia Gómez-Tamayo, Jose Carlos Tresadern, Gary Bonnet, Pascal Int J Mol Sci Article Artificial intelligence (AI) has gained significant traction in the field of drug discovery, with deep learning (DL) algorithms playing a crucial role in predicting protein–ligand binding affinities. Despite advancements in neural network architectures, system representation, and training techniques, the performance of DL affinity prediction has reached a plateau, prompting the question of whether it is truly solved or if the current performance is overly optimistic and reliant on biased, easily predictable data. Like other DL-related problems, this issue seems to stem from the training and test sets used when building the models. In this work, we investigate the impact of several parameters related to the input data on the performance of neural network affinity prediction models. Notably, we identify the size of the binding pocket as a critical factor influencing the performance of our statistical models; furthermore, it is more important to train a model with as much data as possible than to restrict the training to only high-quality datasets. Finally, we also confirm the bias in the typically used current test sets. Therefore, several types of evaluation and benchmarking are required to understand models’ decision-making processes and accurately compare the performance of models. MDPI 2023-11-09 /pmc/articles/PMC10671244/ /pubmed/38003312 http://dx.doi.org/10.3390/ijms242216120 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Libouban, Pierre-Yves Aci-Sèche, Samia Gómez-Tamayo, Jose Carlos Tresadern, Gary Bonnet, Pascal The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title	The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title_full	The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title_fullStr	The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title_full_unstemmed	The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title_short	The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title_sort	impact of data on structure-based binding affinity predictions using deep neural networks
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10671244/ https://www.ncbi.nlm.nih.gov/pubmed/38003312 http://dx.doi.org/10.3390/ijms242216120
work_keys_str_mv	AT liboubanpierreyves theimpactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks AT acisechesamia theimpactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks AT gomeztamayojosecarlos theimpactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks AT tresaderngary theimpactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks AT bonnetpascal theimpactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks AT liboubanpierreyves impactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks AT acisechesamia impactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks AT gomeztamayojosecarlos impactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks AT tresaderngary impactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks AT bonnetpascal impactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks

The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks

Ejemplares similares