Cargando…

On the validity versus utility of activity landscapes: are all activity cliffs statistically significant?

BACKGROUND: Most work on the topic of activity landscapes has focused on their quantitative description and visual representation, with the aim of aiding navigation of SAR. Recent developments have addressed applications such as quantifying the proportion of activity cliffs, investigating the predic...

Descripción completa

Detalles Bibliográficos
Autores principales: Guha, Rajarshi, Medina-Franco, José L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021161/
https://www.ncbi.nlm.nih.gov/pubmed/24694189
http://dx.doi.org/10.1186/1758-2946-6-11
_version_ 1782316184135270400
author Guha, Rajarshi
Medina-Franco, José L
author_facet Guha, Rajarshi
Medina-Franco, José L
author_sort Guha, Rajarshi
collection PubMed
description BACKGROUND: Most work on the topic of activity landscapes has focused on their quantitative description and visual representation, with the aim of aiding navigation of SAR. Recent developments have addressed applications such as quantifying the proportion of activity cliffs, investigating the predictive abilities of activity landscape methods and so on. However, all these publications have worked under the assumption that the activity landscape models are “real” (i.e., statistically significant). RESULTS: The current study addresses for the first time, in a quantitative manner, the significance of a landscape or individual cliffs in the landscape. In particular, we question whether the activity landscape derived from observed (experimental) activity data is different from a randomly generated landscape. To address this we used the SALI measure with six different data sets tested against one or more molecular targets. We also assessed the significance of the landscapes for single and multiple representations. CONCLUSIONS: We find that non-random landscapes are data set and molecular representation dependent. For the data sets and representations used in this work, our results suggest that not all representations lead to non-random landscapes. This indicates that not all molecular representations should be used to a) interpret the SAR and b) combined to generate consensus models. Our results suggest that significance testing of activity landscape models and in particular, activity cliffs, is key, prior to the use of such models.
format Online
Article
Text
id pubmed-4021161
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40211612014-05-15 On the validity versus utility of activity landscapes: are all activity cliffs statistically significant? Guha, Rajarshi Medina-Franco, José L J Cheminform Research Article BACKGROUND: Most work on the topic of activity landscapes has focused on their quantitative description and visual representation, with the aim of aiding navigation of SAR. Recent developments have addressed applications such as quantifying the proportion of activity cliffs, investigating the predictive abilities of activity landscape methods and so on. However, all these publications have worked under the assumption that the activity landscape models are “real” (i.e., statistically significant). RESULTS: The current study addresses for the first time, in a quantitative manner, the significance of a landscape or individual cliffs in the landscape. In particular, we question whether the activity landscape derived from observed (experimental) activity data is different from a randomly generated landscape. To address this we used the SALI measure with six different data sets tested against one or more molecular targets. We also assessed the significance of the landscapes for single and multiple representations. CONCLUSIONS: We find that non-random landscapes are data set and molecular representation dependent. For the data sets and representations used in this work, our results suggest that not all representations lead to non-random landscapes. This indicates that not all molecular representations should be used to a) interpret the SAR and b) combined to generate consensus models. Our results suggest that significance testing of activity landscape models and in particular, activity cliffs, is key, prior to the use of such models. BioMed Central 2014-04-02 /pmc/articles/PMC4021161/ /pubmed/24694189 http://dx.doi.org/10.1186/1758-2946-6-11 Text en Copyright © 2014 Guha and Medina-Franco; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Guha, Rajarshi
Medina-Franco, José L
On the validity versus utility of activity landscapes: are all activity cliffs statistically significant?
title On the validity versus utility of activity landscapes: are all activity cliffs statistically significant?
title_full On the validity versus utility of activity landscapes: are all activity cliffs statistically significant?
title_fullStr On the validity versus utility of activity landscapes: are all activity cliffs statistically significant?
title_full_unstemmed On the validity versus utility of activity landscapes: are all activity cliffs statistically significant?
title_short On the validity versus utility of activity landscapes: are all activity cliffs statistically significant?
title_sort on the validity versus utility of activity landscapes: are all activity cliffs statistically significant?
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4021161/
https://www.ncbi.nlm.nih.gov/pubmed/24694189
http://dx.doi.org/10.1186/1758-2946-6-11
work_keys_str_mv AT guharajarshi onthevalidityversusutilityofactivitylandscapesareallactivitycliffsstatisticallysignificant
AT medinafrancojosel onthevalidityversusutilityofactivitylandscapesareallactivitycliffsstatisticallysignificant