Cargando…

Large-Scale Predictions of Compound Potency with Original and Modified Activity Classes Reveal General Prediction Characteristics and Intrinsic Limitations of Conventional Benchmarking Calculations

Predicting compound potency is a major task in computational medicinal chemistry, for which machine learning is often applied. This study systematically predicted compound potency values for 367 target-based compound activity classes from medicinal chemistry using a preferred machine learning approa...

Descripción completa

Detalles Bibliográficos
Autores principales: Janela, Tiago, Bajorath, Jürgen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10143224/
https://www.ncbi.nlm.nih.gov/pubmed/37111287
http://dx.doi.org/10.3390/ph16040530
_version_ 1785033800883896320
author Janela, Tiago
Bajorath, Jürgen
author_facet Janela, Tiago
Bajorath, Jürgen
author_sort Janela, Tiago
collection PubMed
description Predicting compound potency is a major task in computational medicinal chemistry, for which machine learning is often applied. This study systematically predicted compound potency values for 367 target-based compound activity classes from medicinal chemistry using a preferred machine learning approach and simple control methods. The predictions produced unexpectedly similar results for different classes and comparably high accuracy for machine learning and simple control models. Based on these findings, the influence of different data set modifications on relative prediction accuracies was explored, including potency range balancing, removal of nearest neighbors, and analog series-based compound partitioning. The predictions were surprisingly resistant to these modifications, leading to only small error margin increases. These findings also show that conventional benchmark settings are unsuitable for directly comparing potency prediction methods.
format Online
Article
Text
id pubmed-10143224
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101432242023-04-29 Large-Scale Predictions of Compound Potency with Original and Modified Activity Classes Reveal General Prediction Characteristics and Intrinsic Limitations of Conventional Benchmarking Calculations Janela, Tiago Bajorath, Jürgen Pharmaceuticals (Basel) Article Predicting compound potency is a major task in computational medicinal chemistry, for which machine learning is often applied. This study systematically predicted compound potency values for 367 target-based compound activity classes from medicinal chemistry using a preferred machine learning approach and simple control methods. The predictions produced unexpectedly similar results for different classes and comparably high accuracy for machine learning and simple control models. Based on these findings, the influence of different data set modifications on relative prediction accuracies was explored, including potency range balancing, removal of nearest neighbors, and analog series-based compound partitioning. The predictions were surprisingly resistant to these modifications, leading to only small error margin increases. These findings also show that conventional benchmark settings are unsuitable for directly comparing potency prediction methods. MDPI 2023-04-02 /pmc/articles/PMC10143224/ /pubmed/37111287 http://dx.doi.org/10.3390/ph16040530 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Janela, Tiago
Bajorath, Jürgen
Large-Scale Predictions of Compound Potency with Original and Modified Activity Classes Reveal General Prediction Characteristics and Intrinsic Limitations of Conventional Benchmarking Calculations
title Large-Scale Predictions of Compound Potency with Original and Modified Activity Classes Reveal General Prediction Characteristics and Intrinsic Limitations of Conventional Benchmarking Calculations
title_full Large-Scale Predictions of Compound Potency with Original and Modified Activity Classes Reveal General Prediction Characteristics and Intrinsic Limitations of Conventional Benchmarking Calculations
title_fullStr Large-Scale Predictions of Compound Potency with Original and Modified Activity Classes Reveal General Prediction Characteristics and Intrinsic Limitations of Conventional Benchmarking Calculations
title_full_unstemmed Large-Scale Predictions of Compound Potency with Original and Modified Activity Classes Reveal General Prediction Characteristics and Intrinsic Limitations of Conventional Benchmarking Calculations
title_short Large-Scale Predictions of Compound Potency with Original and Modified Activity Classes Reveal General Prediction Characteristics and Intrinsic Limitations of Conventional Benchmarking Calculations
title_sort large-scale predictions of compound potency with original and modified activity classes reveal general prediction characteristics and intrinsic limitations of conventional benchmarking calculations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10143224/
https://www.ncbi.nlm.nih.gov/pubmed/37111287
http://dx.doi.org/10.3390/ph16040530
work_keys_str_mv AT janelatiago largescalepredictionsofcompoundpotencywithoriginalandmodifiedactivityclassesrevealgeneralpredictioncharacteristicsandintrinsiclimitationsofconventionalbenchmarkingcalculations
AT bajorathjurgen largescalepredictionsofcompoundpotencywithoriginalandmodifiedactivityclassesrevealgeneralpredictioncharacteristicsandintrinsiclimitationsofconventionalbenchmarkingcalculations