Cargando…

Nonadditivity in public and inhouse data: implications for drug design

Numerous ligand-based drug discovery projects are based on structure-activity relationship (SAR) analysis, such as Free-Wilson (FW) or matched molecular pair (MMP) analysis. Intrinsically they assume linearity and additivity of substituent contributions. These techniques are challenged by nonadditiv...

Descripción completa

Detalles Bibliográficos
Autores principales: Gogishvili, D., Nittinger, E., Margreitter, C., Tyrchan, C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8254291/
https://www.ncbi.nlm.nih.gov/pubmed/34215341
http://dx.doi.org/10.1186/s13321-021-00525-z
_version_ 1783717698952232960
author Gogishvili, D.
Nittinger, E.
Margreitter, C.
Tyrchan, C.
author_facet Gogishvili, D.
Nittinger, E.
Margreitter, C.
Tyrchan, C.
author_sort Gogishvili, D.
collection PubMed
description Numerous ligand-based drug discovery projects are based on structure-activity relationship (SAR) analysis, such as Free-Wilson (FW) or matched molecular pair (MMP) analysis. Intrinsically they assume linearity and additivity of substituent contributions. These techniques are challenged by nonadditivity (NA) in protein–ligand binding where the change of two functional groups in one molecule results in much higher or lower activity than expected from the respective single changes. Identifying nonlinear cases and possible underlying explanations is crucial for a drug design project since it might influence which lead to follow. By systematically analyzing all AstraZeneca (AZ) inhouse compound data and publicly available ChEMBL25 bioactivity data, we show significant NA events in almost every second assay among the inhouse and once in every third assay in public data sets. Furthermore, 9.4% of all compounds of the AZ database and 5.1% from public sources display significant additivity shifts indicating important SAR features or fundamental measurement errors. Using NA data in combination with machine learning showed that nonadditive data is challenging to predict and even the addition of nonadditive data into training did not result in an increase in predictivity. Overall, NA analysis should be applied on a regular basis in many areas of computational chemistry and can further improve rational drug design. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00525-z.
format Online
Article
Text
id pubmed-8254291
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-82542912021-07-06 Nonadditivity in public and inhouse data: implications for drug design Gogishvili, D. Nittinger, E. Margreitter, C. Tyrchan, C. J Cheminform Research Article Numerous ligand-based drug discovery projects are based on structure-activity relationship (SAR) analysis, such as Free-Wilson (FW) or matched molecular pair (MMP) analysis. Intrinsically they assume linearity and additivity of substituent contributions. These techniques are challenged by nonadditivity (NA) in protein–ligand binding where the change of two functional groups in one molecule results in much higher or lower activity than expected from the respective single changes. Identifying nonlinear cases and possible underlying explanations is crucial for a drug design project since it might influence which lead to follow. By systematically analyzing all AstraZeneca (AZ) inhouse compound data and publicly available ChEMBL25 bioactivity data, we show significant NA events in almost every second assay among the inhouse and once in every third assay in public data sets. Furthermore, 9.4% of all compounds of the AZ database and 5.1% from public sources display significant additivity shifts indicating important SAR features or fundamental measurement errors. Using NA data in combination with machine learning showed that nonadditive data is challenging to predict and even the addition of nonadditive data into training did not result in an increase in predictivity. Overall, NA analysis should be applied on a regular basis in many areas of computational chemistry and can further improve rational drug design. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00525-z. Springer International Publishing 2021-07-02 /pmc/articles/PMC8254291/ /pubmed/34215341 http://dx.doi.org/10.1186/s13321-021-00525-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Gogishvili, D.
Nittinger, E.
Margreitter, C.
Tyrchan, C.
Nonadditivity in public and inhouse data: implications for drug design
title Nonadditivity in public and inhouse data: implications for drug design
title_full Nonadditivity in public and inhouse data: implications for drug design
title_fullStr Nonadditivity in public and inhouse data: implications for drug design
title_full_unstemmed Nonadditivity in public and inhouse data: implications for drug design
title_short Nonadditivity in public and inhouse data: implications for drug design
title_sort nonadditivity in public and inhouse data: implications for drug design
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8254291/
https://www.ncbi.nlm.nih.gov/pubmed/34215341
http://dx.doi.org/10.1186/s13321-021-00525-z
work_keys_str_mv AT gogishvilid nonadditivityinpublicandinhousedataimplicationsfordrugdesign
AT nittingere nonadditivityinpublicandinhousedataimplicationsfordrugdesign
AT margreitterc nonadditivityinpublicandinhousedataimplicationsfordrugdesign
AT tyrchanc nonadditivityinpublicandinhousedataimplicationsfordrugdesign