Cargando…

The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks

Artificial intelligence (AI) has gained significant traction in the field of drug discovery, with deep learning (DL) algorithms playing a crucial role in predicting protein–ligand binding affinities. Despite advancements in neural network architectures, system representation, and training techniques...

Descripción completa

Detalles Bibliográficos
Autores principales: Libouban, Pierre-Yves, Aci-Sèche, Samia, Gómez-Tamayo, Jose Carlos, Tresadern, Gary, Bonnet, Pascal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10671244/
https://www.ncbi.nlm.nih.gov/pubmed/38003312
http://dx.doi.org/10.3390/ijms242216120
_version_ 1785140108776701952
author Libouban, Pierre-Yves
Aci-Sèche, Samia
Gómez-Tamayo, Jose Carlos
Tresadern, Gary
Bonnet, Pascal
author_facet Libouban, Pierre-Yves
Aci-Sèche, Samia
Gómez-Tamayo, Jose Carlos
Tresadern, Gary
Bonnet, Pascal
author_sort Libouban, Pierre-Yves
collection PubMed
description Artificial intelligence (AI) has gained significant traction in the field of drug discovery, with deep learning (DL) algorithms playing a crucial role in predicting protein–ligand binding affinities. Despite advancements in neural network architectures, system representation, and training techniques, the performance of DL affinity prediction has reached a plateau, prompting the question of whether it is truly solved or if the current performance is overly optimistic and reliant on biased, easily predictable data. Like other DL-related problems, this issue seems to stem from the training and test sets used when building the models. In this work, we investigate the impact of several parameters related to the input data on the performance of neural network affinity prediction models. Notably, we identify the size of the binding pocket as a critical factor influencing the performance of our statistical models; furthermore, it is more important to train a model with as much data as possible than to restrict the training to only high-quality datasets. Finally, we also confirm the bias in the typically used current test sets. Therefore, several types of evaluation and benchmarking are required to understand models’ decision-making processes and accurately compare the performance of models.
format Online
Article
Text
id pubmed-10671244
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106712442023-11-09 The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks Libouban, Pierre-Yves Aci-Sèche, Samia Gómez-Tamayo, Jose Carlos Tresadern, Gary Bonnet, Pascal Int J Mol Sci Article Artificial intelligence (AI) has gained significant traction in the field of drug discovery, with deep learning (DL) algorithms playing a crucial role in predicting protein–ligand binding affinities. Despite advancements in neural network architectures, system representation, and training techniques, the performance of DL affinity prediction has reached a plateau, prompting the question of whether it is truly solved or if the current performance is overly optimistic and reliant on biased, easily predictable data. Like other DL-related problems, this issue seems to stem from the training and test sets used when building the models. In this work, we investigate the impact of several parameters related to the input data on the performance of neural network affinity prediction models. Notably, we identify the size of the binding pocket as a critical factor influencing the performance of our statistical models; furthermore, it is more important to train a model with as much data as possible than to restrict the training to only high-quality datasets. Finally, we also confirm the bias in the typically used current test sets. Therefore, several types of evaluation and benchmarking are required to understand models’ decision-making processes and accurately compare the performance of models. MDPI 2023-11-09 /pmc/articles/PMC10671244/ /pubmed/38003312 http://dx.doi.org/10.3390/ijms242216120 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Libouban, Pierre-Yves
Aci-Sèche, Samia
Gómez-Tamayo, Jose Carlos
Tresadern, Gary
Bonnet, Pascal
The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title_full The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title_fullStr The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title_full_unstemmed The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title_short The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks
title_sort impact of data on structure-based binding affinity predictions using deep neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10671244/
https://www.ncbi.nlm.nih.gov/pubmed/38003312
http://dx.doi.org/10.3390/ijms242216120
work_keys_str_mv AT liboubanpierreyves theimpactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks
AT acisechesamia theimpactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks
AT gomeztamayojosecarlos theimpactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks
AT tresaderngary theimpactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks
AT bonnetpascal theimpactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks
AT liboubanpierreyves impactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks
AT acisechesamia impactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks
AT gomeztamayojosecarlos impactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks
AT tresaderngary impactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks
AT bonnetpascal impactofdataonstructurebasedbindingaffinitypredictionsusingdeepneuralnetworks