Cargando…
On the validity versus utility of activity landscapes: are all activity cliffs statistically significant?
BACKGROUND: Most work on the topic of activity landscapes has focused on their quantitative description and visual representation, with the aim of aiding navigation of SAR. Recent developments have addressed applications such as quantifying the proportion of activity cliffs, investigating the predic...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021161/ https://www.ncbi.nlm.nih.gov/pubmed/24694189 http://dx.doi.org/10.1186/1758-2946-6-11 |
_version_ | 1782316184135270400 |
---|---|
author | Guha, Rajarshi Medina-Franco, José L |
author_facet | Guha, Rajarshi Medina-Franco, José L |
author_sort | Guha, Rajarshi |
collection | PubMed |
description | BACKGROUND: Most work on the topic of activity landscapes has focused on their quantitative description and visual representation, with the aim of aiding navigation of SAR. Recent developments have addressed applications such as quantifying the proportion of activity cliffs, investigating the predictive abilities of activity landscape methods and so on. However, all these publications have worked under the assumption that the activity landscape models are “real” (i.e., statistically significant). RESULTS: The current study addresses for the first time, in a quantitative manner, the significance of a landscape or individual cliffs in the landscape. In particular, we question whether the activity landscape derived from observed (experimental) activity data is different from a randomly generated landscape. To address this we used the SALI measure with six different data sets tested against one or more molecular targets. We also assessed the significance of the landscapes for single and multiple representations. CONCLUSIONS: We find that non-random landscapes are data set and molecular representation dependent. For the data sets and representations used in this work, our results suggest that not all representations lead to non-random landscapes. This indicates that not all molecular representations should be used to a) interpret the SAR and b) combined to generate consensus models. Our results suggest that significance testing of activity landscape models and in particular, activity cliffs, is key, prior to the use of such models. |
format | Online Article Text |
id | pubmed-4021161 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40211612014-05-15 On the validity versus utility of activity landscapes: are all activity cliffs statistically significant? Guha, Rajarshi Medina-Franco, José L J Cheminform Research Article BACKGROUND: Most work on the topic of activity landscapes has focused on their quantitative description and visual representation, with the aim of aiding navigation of SAR. Recent developments have addressed applications such as quantifying the proportion of activity cliffs, investigating the predictive abilities of activity landscape methods and so on. However, all these publications have worked under the assumption that the activity landscape models are “real” (i.e., statistically significant). RESULTS: The current study addresses for the first time, in a quantitative manner, the significance of a landscape or individual cliffs in the landscape. In particular, we question whether the activity landscape derived from observed (experimental) activity data is different from a randomly generated landscape. To address this we used the SALI measure with six different data sets tested against one or more molecular targets. We also assessed the significance of the landscapes for single and multiple representations. CONCLUSIONS: We find that non-random landscapes are data set and molecular representation dependent. For the data sets and representations used in this work, our results suggest that not all representations lead to non-random landscapes. This indicates that not all molecular representations should be used to a) interpret the SAR and b) combined to generate consensus models. Our results suggest that significance testing of activity landscape models and in particular, activity cliffs, is key, prior to the use of such models. BioMed Central 2014-04-02 /pmc/articles/PMC4021161/ /pubmed/24694189 http://dx.doi.org/10.1186/1758-2946-6-11 Text en Copyright © 2014 Guha and Medina-Franco; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Guha, Rajarshi Medina-Franco, José L On the validity versus utility of activity landscapes: are all activity cliffs statistically significant? |
title | On the validity versus utility of activity landscapes: are all activity cliffs statistically significant? |
title_full | On the validity versus utility of activity landscapes: are all activity cliffs statistically significant? |
title_fullStr | On the validity versus utility of activity landscapes: are all activity cliffs statistically significant? |
title_full_unstemmed | On the validity versus utility of activity landscapes: are all activity cliffs statistically significant? |
title_short | On the validity versus utility of activity landscapes: are all activity cliffs statistically significant? |
title_sort | on the validity versus utility of activity landscapes: are all activity cliffs statistically significant? |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021161/ https://www.ncbi.nlm.nih.gov/pubmed/24694189 http://dx.doi.org/10.1186/1758-2946-6-11 |
work_keys_str_mv | AT guharajarshi onthevalidityversusutilityofactivitylandscapesareallactivitycliffsstatisticallysignificant AT medinafrancojosel onthevalidityversusutilityofactivitylandscapesareallactivitycliffsstatisticallysignificant |