Cargando…
ProfhEX: AI-based platform for small molecules liability profiling
Off-target drug interactions are a major reason for candidate failure in the drug discovery process. Anticipating potential drug’s adverse effects in the early stages is necessary to minimize health risks to patients, animal testing, and economical costs. With the constantly increasing size of virtu...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10251600/ https://www.ncbi.nlm.nih.gov/pubmed/37296454 http://dx.doi.org/10.1186/s13321-023-00728-6 |
_version_ | 1785055980765052928 |
---|---|
author | Lunghini, Filippo Fava, Anna Pisapia, Vincenzo Sacco, Francesco Iaconis, Daniela Beccari, Andrea Rosario |
author_facet | Lunghini, Filippo Fava, Anna Pisapia, Vincenzo Sacco, Francesco Iaconis, Daniela Beccari, Andrea Rosario |
author_sort | Lunghini, Filippo |
collection | PubMed |
description | Off-target drug interactions are a major reason for candidate failure in the drug discovery process. Anticipating potential drug’s adverse effects in the early stages is necessary to minimize health risks to patients, animal testing, and economical costs. With the constantly increasing size of virtual screening libraries, AI-driven methods can be exploited as first-tier screening tools to provide liability estimation for drug candidates. In this work we present ProfhEX, an AI-driven suite of 46 OECD-compliant machine learning models that can profile small molecules on 7 relevant liability groups: cardiovascular, central nervous system, gastrointestinal, endocrine, renal, pulmonary and immune system toxicities. Experimental affinity data was collected from public and commercial data sources. The entire chemical space comprised 289′202 activity data for a total of 210′116 unique compounds, spanning over 46 targets with dataset sizes ranging from 819 to 18896. Gradient boosting and random forest algorithms were initially employed and ensembled for the selection of a champion model. Models were validated according to the OECD principles, including robust internal (cross validation, bootstrap, y-scrambling) and external validation. Champion models achieved an average Pearson correlation coefficient of 0.84 (SD of 0.05), an R(2) determination coefficient of 0.68 (SD = 0.1) and a root mean squared error of 0.69 (SD of 0.08). All liability groups showed good hit-detection power with an average enrichment factor at 5% of 13.1 (SD of 4.5) and AUC of 0.92 (SD of 0.05). Benchmarking against already existing tools demonstrated the predictive power of ProfhEX models for large-scale liability profiling. This platform will be further expanded with the inclusion of new targets and through complementary modelling approaches, such as structure and pharmacophore-based models. ProfhEX is freely accessible at the following address: https://profhex.exscalate.eu/. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-023-00728-6. |
format | Online Article Text |
id | pubmed-10251600 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-102516002023-06-10 ProfhEX: AI-based platform for small molecules liability profiling Lunghini, Filippo Fava, Anna Pisapia, Vincenzo Sacco, Francesco Iaconis, Daniela Beccari, Andrea Rosario J Cheminform Research Off-target drug interactions are a major reason for candidate failure in the drug discovery process. Anticipating potential drug’s adverse effects in the early stages is necessary to minimize health risks to patients, animal testing, and economical costs. With the constantly increasing size of virtual screening libraries, AI-driven methods can be exploited as first-tier screening tools to provide liability estimation for drug candidates. In this work we present ProfhEX, an AI-driven suite of 46 OECD-compliant machine learning models that can profile small molecules on 7 relevant liability groups: cardiovascular, central nervous system, gastrointestinal, endocrine, renal, pulmonary and immune system toxicities. Experimental affinity data was collected from public and commercial data sources. The entire chemical space comprised 289′202 activity data for a total of 210′116 unique compounds, spanning over 46 targets with dataset sizes ranging from 819 to 18896. Gradient boosting and random forest algorithms were initially employed and ensembled for the selection of a champion model. Models were validated according to the OECD principles, including robust internal (cross validation, bootstrap, y-scrambling) and external validation. Champion models achieved an average Pearson correlation coefficient of 0.84 (SD of 0.05), an R(2) determination coefficient of 0.68 (SD = 0.1) and a root mean squared error of 0.69 (SD of 0.08). All liability groups showed good hit-detection power with an average enrichment factor at 5% of 13.1 (SD of 4.5) and AUC of 0.92 (SD of 0.05). Benchmarking against already existing tools demonstrated the predictive power of ProfhEX models for large-scale liability profiling. This platform will be further expanded with the inclusion of new targets and through complementary modelling approaches, such as structure and pharmacophore-based models. ProfhEX is freely accessible at the following address: https://profhex.exscalate.eu/. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-023-00728-6. Springer International Publishing 2023-06-09 /pmc/articles/PMC10251600/ /pubmed/37296454 http://dx.doi.org/10.1186/s13321-023-00728-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Lunghini, Filippo Fava, Anna Pisapia, Vincenzo Sacco, Francesco Iaconis, Daniela Beccari, Andrea Rosario ProfhEX: AI-based platform for small molecules liability profiling |
title | ProfhEX: AI-based platform for small molecules liability profiling |
title_full | ProfhEX: AI-based platform for small molecules liability profiling |
title_fullStr | ProfhEX: AI-based platform for small molecules liability profiling |
title_full_unstemmed | ProfhEX: AI-based platform for small molecules liability profiling |
title_short | ProfhEX: AI-based platform for small molecules liability profiling |
title_sort | profhex: ai-based platform for small molecules liability profiling |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10251600/ https://www.ncbi.nlm.nih.gov/pubmed/37296454 http://dx.doi.org/10.1186/s13321-023-00728-6 |
work_keys_str_mv | AT lunghinifilippo profhexaibasedplatformforsmallmoleculesliabilityprofiling AT favaanna profhexaibasedplatformforsmallmoleculesliabilityprofiling AT pisapiavincenzo profhexaibasedplatformforsmallmoleculesliabilityprofiling AT saccofrancesco profhexaibasedplatformforsmallmoleculesliabilityprofiling AT iaconisdaniela profhexaibasedplatformforsmallmoleculesliabilityprofiling AT beccariandrearosario profhexaibasedplatformforsmallmoleculesliabilityprofiling |