Cargando…
Nonadditivity in public and inhouse data: implications for drug design
Numerous ligand-based drug discovery projects are based on structure-activity relationship (SAR) analysis, such as Free-Wilson (FW) or matched molecular pair (MMP) analysis. Intrinsically they assume linearity and additivity of substituent contributions. These techniques are challenged by nonadditiv...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8254291/ https://www.ncbi.nlm.nih.gov/pubmed/34215341 http://dx.doi.org/10.1186/s13321-021-00525-z |
_version_ | 1783717698952232960 |
---|---|
author | Gogishvili, D. Nittinger, E. Margreitter, C. Tyrchan, C. |
author_facet | Gogishvili, D. Nittinger, E. Margreitter, C. Tyrchan, C. |
author_sort | Gogishvili, D. |
collection | PubMed |
description | Numerous ligand-based drug discovery projects are based on structure-activity relationship (SAR) analysis, such as Free-Wilson (FW) or matched molecular pair (MMP) analysis. Intrinsically they assume linearity and additivity of substituent contributions. These techniques are challenged by nonadditivity (NA) in protein–ligand binding where the change of two functional groups in one molecule results in much higher or lower activity than expected from the respective single changes. Identifying nonlinear cases and possible underlying explanations is crucial for a drug design project since it might influence which lead to follow. By systematically analyzing all AstraZeneca (AZ) inhouse compound data and publicly available ChEMBL25 bioactivity data, we show significant NA events in almost every second assay among the inhouse and once in every third assay in public data sets. Furthermore, 9.4% of all compounds of the AZ database and 5.1% from public sources display significant additivity shifts indicating important SAR features or fundamental measurement errors. Using NA data in combination with machine learning showed that nonadditive data is challenging to predict and even the addition of nonadditive data into training did not result in an increase in predictivity. Overall, NA analysis should be applied on a regular basis in many areas of computational chemistry and can further improve rational drug design. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00525-z. |
format | Online Article Text |
id | pubmed-8254291 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-82542912021-07-06 Nonadditivity in public and inhouse data: implications for drug design Gogishvili, D. Nittinger, E. Margreitter, C. Tyrchan, C. J Cheminform Research Article Numerous ligand-based drug discovery projects are based on structure-activity relationship (SAR) analysis, such as Free-Wilson (FW) or matched molecular pair (MMP) analysis. Intrinsically they assume linearity and additivity of substituent contributions. These techniques are challenged by nonadditivity (NA) in protein–ligand binding where the change of two functional groups in one molecule results in much higher or lower activity than expected from the respective single changes. Identifying nonlinear cases and possible underlying explanations is crucial for a drug design project since it might influence which lead to follow. By systematically analyzing all AstraZeneca (AZ) inhouse compound data and publicly available ChEMBL25 bioactivity data, we show significant NA events in almost every second assay among the inhouse and once in every third assay in public data sets. Furthermore, 9.4% of all compounds of the AZ database and 5.1% from public sources display significant additivity shifts indicating important SAR features or fundamental measurement errors. Using NA data in combination with machine learning showed that nonadditive data is challenging to predict and even the addition of nonadditive data into training did not result in an increase in predictivity. Overall, NA analysis should be applied on a regular basis in many areas of computational chemistry and can further improve rational drug design. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00525-z. Springer International Publishing 2021-07-02 /pmc/articles/PMC8254291/ /pubmed/34215341 http://dx.doi.org/10.1186/s13321-021-00525-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Gogishvili, D. Nittinger, E. Margreitter, C. Tyrchan, C. Nonadditivity in public and inhouse data: implications for drug design |
title | Nonadditivity in public and inhouse data: implications for drug design |
title_full | Nonadditivity in public and inhouse data: implications for drug design |
title_fullStr | Nonadditivity in public and inhouse data: implications for drug design |
title_full_unstemmed | Nonadditivity in public and inhouse data: implications for drug design |
title_short | Nonadditivity in public and inhouse data: implications for drug design |
title_sort | nonadditivity in public and inhouse data: implications for drug design |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8254291/ https://www.ncbi.nlm.nih.gov/pubmed/34215341 http://dx.doi.org/10.1186/s13321-021-00525-z |
work_keys_str_mv | AT gogishvilid nonadditivityinpublicandinhousedataimplicationsfordrugdesign AT nittingere nonadditivityinpublicandinhousedataimplicationsfordrugdesign AT margreitterc nonadditivityinpublicandinhousedataimplicationsfordrugdesign AT tyrchanc nonadditivityinpublicandinhousedataimplicationsfordrugdesign |