Cargando…

ProfhEX: AI-based platform for small molecules liability profiling

Off-target drug interactions are a major reason for candidate failure in the drug discovery process. Anticipating potential drug’s adverse effects in the early stages is necessary to minimize health risks to patients, animal testing, and economical costs. With the constantly increasing size of virtu...

Descripción completa

Detalles Bibliográficos
Autores principales: Lunghini, Filippo, Fava, Anna, Pisapia, Vincenzo, Sacco, Francesco, Iaconis, Daniela, Beccari, Andrea Rosario
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10251600/
https://www.ncbi.nlm.nih.gov/pubmed/37296454
http://dx.doi.org/10.1186/s13321-023-00728-6
_version_ 1785055980765052928
author Lunghini, Filippo
Fava, Anna
Pisapia, Vincenzo
Sacco, Francesco
Iaconis, Daniela
Beccari, Andrea Rosario
author_facet Lunghini, Filippo
Fava, Anna
Pisapia, Vincenzo
Sacco, Francesco
Iaconis, Daniela
Beccari, Andrea Rosario
author_sort Lunghini, Filippo
collection PubMed
description Off-target drug interactions are a major reason for candidate failure in the drug discovery process. Anticipating potential drug’s adverse effects in the early stages is necessary to minimize health risks to patients, animal testing, and economical costs. With the constantly increasing size of virtual screening libraries, AI-driven methods can be exploited as first-tier screening tools to provide liability estimation for drug candidates. In this work we present ProfhEX, an AI-driven suite of 46 OECD-compliant machine learning models that can profile small molecules on 7 relevant liability groups: cardiovascular, central nervous system, gastrointestinal, endocrine, renal, pulmonary and immune system toxicities. Experimental affinity data was collected from public and commercial data sources. The entire chemical space comprised 289′202 activity data for a total of 210′116 unique compounds, spanning over 46 targets with dataset sizes ranging from 819 to 18896. Gradient boosting and random forest algorithms were initially employed and ensembled for the selection of a champion model. Models were validated according to the OECD principles, including robust internal (cross validation, bootstrap, y-scrambling) and external validation. Champion models achieved an average Pearson correlation coefficient of 0.84 (SD of 0.05), an R(2) determination coefficient of 0.68 (SD = 0.1) and a root mean squared error of 0.69 (SD of 0.08). All liability groups showed good hit-detection power with an average enrichment factor at 5% of 13.1 (SD of 4.5) and AUC of 0.92 (SD of 0.05). Benchmarking against already existing tools demonstrated the predictive power of ProfhEX models for large-scale liability profiling. This platform will be further expanded with the inclusion of new targets and through complementary modelling approaches, such as structure and pharmacophore-based models. ProfhEX is freely accessible at the following address: https://profhex.exscalate.eu/. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-023-00728-6.
format Online
Article
Text
id pubmed-10251600
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-102516002023-06-10 ProfhEX: AI-based platform for small molecules liability profiling Lunghini, Filippo Fava, Anna Pisapia, Vincenzo Sacco, Francesco Iaconis, Daniela Beccari, Andrea Rosario J Cheminform Research Off-target drug interactions are a major reason for candidate failure in the drug discovery process. Anticipating potential drug’s adverse effects in the early stages is necessary to minimize health risks to patients, animal testing, and economical costs. With the constantly increasing size of virtual screening libraries, AI-driven methods can be exploited as first-tier screening tools to provide liability estimation for drug candidates. In this work we present ProfhEX, an AI-driven suite of 46 OECD-compliant machine learning models that can profile small molecules on 7 relevant liability groups: cardiovascular, central nervous system, gastrointestinal, endocrine, renal, pulmonary and immune system toxicities. Experimental affinity data was collected from public and commercial data sources. The entire chemical space comprised 289′202 activity data for a total of 210′116 unique compounds, spanning over 46 targets with dataset sizes ranging from 819 to 18896. Gradient boosting and random forest algorithms were initially employed and ensembled for the selection of a champion model. Models were validated according to the OECD principles, including robust internal (cross validation, bootstrap, y-scrambling) and external validation. Champion models achieved an average Pearson correlation coefficient of 0.84 (SD of 0.05), an R(2) determination coefficient of 0.68 (SD = 0.1) and a root mean squared error of 0.69 (SD of 0.08). All liability groups showed good hit-detection power with an average enrichment factor at 5% of 13.1 (SD of 4.5) and AUC of 0.92 (SD of 0.05). Benchmarking against already existing tools demonstrated the predictive power of ProfhEX models for large-scale liability profiling. This platform will be further expanded with the inclusion of new targets and through complementary modelling approaches, such as structure and pharmacophore-based models. ProfhEX is freely accessible at the following address: https://profhex.exscalate.eu/. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-023-00728-6. Springer International Publishing 2023-06-09 /pmc/articles/PMC10251600/ /pubmed/37296454 http://dx.doi.org/10.1186/s13321-023-00728-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Lunghini, Filippo
Fava, Anna
Pisapia, Vincenzo
Sacco, Francesco
Iaconis, Daniela
Beccari, Andrea Rosario
ProfhEX: AI-based platform for small molecules liability profiling
title ProfhEX: AI-based platform for small molecules liability profiling
title_full ProfhEX: AI-based platform for small molecules liability profiling
title_fullStr ProfhEX: AI-based platform for small molecules liability profiling
title_full_unstemmed ProfhEX: AI-based platform for small molecules liability profiling
title_short ProfhEX: AI-based platform for small molecules liability profiling
title_sort profhex: ai-based platform for small molecules liability profiling
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10251600/
https://www.ncbi.nlm.nih.gov/pubmed/37296454
http://dx.doi.org/10.1186/s13321-023-00728-6
work_keys_str_mv AT lunghinifilippo profhexaibasedplatformforsmallmoleculesliabilityprofiling
AT favaanna profhexaibasedplatformforsmallmoleculesliabilityprofiling
AT pisapiavincenzo profhexaibasedplatformforsmallmoleculesliabilityprofiling
AT saccofrancesco profhexaibasedplatformforsmallmoleculesliabilityprofiling
AT iaconisdaniela profhexaibasedplatformforsmallmoleculesliabilityprofiling
AT beccariandrearosario profhexaibasedplatformforsmallmoleculesliabilityprofiling