Cargando…

Comparability of Mixed IC(50) Data – A Statistical Analysis

The biochemical half maximal inhibitory concentration (IC(50)) is the most commonly used metric for on-target activity in lead optimization. It is used to guide lead optimization, build large-scale chemogenomics analysis, off-target activity and toxicity models based on public data. However, the use...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kalliokoski, Tuomo, Kramer, Christian, Vulpetti, Anna, Gedeck, Peter
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2013
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3628986/ https://www.ncbi.nlm.nih.gov/pubmed/23613770 http://dx.doi.org/10.1371/journal.pone.0061007

_version_	1782266494680301568
author	Kalliokoski, Tuomo Kramer, Christian Vulpetti, Anna Gedeck, Peter
author_facet	Kalliokoski, Tuomo Kramer, Christian Vulpetti, Anna Gedeck, Peter
author_sort	Kalliokoski, Tuomo
collection	PubMed
description	The biochemical half maximal inhibitory concentration (IC(50)) is the most commonly used metric for on-target activity in lead optimization. It is used to guide lead optimization, build large-scale chemogenomics analysis, off-target activity and toxicity models based on public data. However, the use of public biochemical IC(50) data is problematic, because they are assay specific and comparable only under certain conditions. For large scale analysis it is not feasible to check each data entry manually and it is very tempting to mix all available IC(50) values from public database even if assay information is not reported. As previously reported for K(i) database analysis, we first analyzed the types of errors, the redundancy and the variability that can be found in ChEMBL IC(50) database. For assessing the variability of IC(50) data independently measured in two different labs at least ten IC(50) data for identical protein-ligand systems against the same target were searched in ChEMBL. As a not sufficient number of cases of this type are available, the variability of IC(50) data was assessed by comparing all pairs of independent IC(50) measurements on identical protein-ligand systems. The standard deviation of IC(50) data is only 25% larger than the standard deviation of K(i) data, suggesting that mixing IC(50) data from different assays, even not knowing assay conditions details, only adds a moderate amount of noise to the overall data. The standard deviation of public ChEMBL IC(50) data, as expected, resulted greater than the standard deviation of in-house intra-laboratory/inter-day IC(50) data. Augmenting mixed public IC(50) data by public K(i) data does not deteriorate the quality of the mixed IC(50) data, if the K(i) is corrected by an offset. For a broad dataset such as ChEMBL database a K(i)- IC(50) conversion factor of 2 was found to be the most reasonable.
format	Online Article Text
id	pubmed-3628986
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-36289862013-04-23 Comparability of Mixed IC(50) Data – A Statistical Analysis Kalliokoski, Tuomo Kramer, Christian Vulpetti, Anna Gedeck, Peter PLoS One Research Article The biochemical half maximal inhibitory concentration (IC(50)) is the most commonly used metric for on-target activity in lead optimization. It is used to guide lead optimization, build large-scale chemogenomics analysis, off-target activity and toxicity models based on public data. However, the use of public biochemical IC(50) data is problematic, because they are assay specific and comparable only under certain conditions. For large scale analysis it is not feasible to check each data entry manually and it is very tempting to mix all available IC(50) values from public database even if assay information is not reported. As previously reported for K(i) database analysis, we first analyzed the types of errors, the redundancy and the variability that can be found in ChEMBL IC(50) database. For assessing the variability of IC(50) data independently measured in two different labs at least ten IC(50) data for identical protein-ligand systems against the same target were searched in ChEMBL. As a not sufficient number of cases of this type are available, the variability of IC(50) data was assessed by comparing all pairs of independent IC(50) measurements on identical protein-ligand systems. The standard deviation of IC(50) data is only 25% larger than the standard deviation of K(i) data, suggesting that mixing IC(50) data from different assays, even not knowing assay conditions details, only adds a moderate amount of noise to the overall data. The standard deviation of public ChEMBL IC(50) data, as expected, resulted greater than the standard deviation of in-house intra-laboratory/inter-day IC(50) data. Augmenting mixed public IC(50) data by public K(i) data does not deteriorate the quality of the mixed IC(50) data, if the K(i) is corrected by an offset. For a broad dataset such as ChEMBL database a K(i)- IC(50) conversion factor of 2 was found to be the most reasonable. Public Library of Science 2013-04-16 /pmc/articles/PMC3628986/ /pubmed/23613770 http://dx.doi.org/10.1371/journal.pone.0061007 Text en © 2013 Kalliokoski et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Kalliokoski, Tuomo Kramer, Christian Vulpetti, Anna Gedeck, Peter Comparability of Mixed IC(50) Data – A Statistical Analysis
title	Comparability of Mixed IC(50) Data – A Statistical Analysis
title_full	Comparability of Mixed IC(50) Data – A Statistical Analysis
title_fullStr	Comparability of Mixed IC(50) Data – A Statistical Analysis
title_full_unstemmed	Comparability of Mixed IC(50) Data – A Statistical Analysis
title_short	Comparability of Mixed IC(50) Data – A Statistical Analysis
title_sort	comparability of mixed ic(50) data – a statistical analysis
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3628986/ https://www.ncbi.nlm.nih.gov/pubmed/23613770 http://dx.doi.org/10.1371/journal.pone.0061007
work_keys_str_mv	AT kalliokoskituomo comparabilityofmixedic50dataastatisticalanalysis AT kramerchristian comparabilityofmixedic50dataastatisticalanalysis AT vulpettianna comparabilityofmixedic50dataastatisticalanalysis AT gedeckpeter comparabilityofmixedic50dataastatisticalanalysis

Comparability of Mixed IC(50) Data – A Statistical Analysis

Ejemplares similares