Cargando…

Activity, assay and target data curation and quality in the ChEMBL database

The emergence of a number of publicly available bioactivity databases, such as ChEMBL, PubChem BioAssay and BindingDB, has raised awareness about the topics of data curation, quality and integrity. Here we provide an overview and discussion of the current and future approaches to activity, assay and...

Descripción completa

Detalles Bibliográficos
Autores principales: Papadatos, George, Gaulton, Anna, Hersey, Anne, Overington, John P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4607714/
https://www.ncbi.nlm.nih.gov/pubmed/26201396
http://dx.doi.org/10.1007/s10822-015-9860-5
_version_ 1782395546295599104
author Papadatos, George
Gaulton, Anna
Hersey, Anne
Overington, John P.
author_facet Papadatos, George
Gaulton, Anna
Hersey, Anne
Overington, John P.
author_sort Papadatos, George
collection PubMed
description The emergence of a number of publicly available bioactivity databases, such as ChEMBL, PubChem BioAssay and BindingDB, has raised awareness about the topics of data curation, quality and integrity. Here we provide an overview and discussion of the current and future approaches to activity, assay and target data curation of the ChEMBL database. This curation process involves several manual and automated steps and aims to: (1) maximise data accessibility and comparability; (2) improve data integrity and flag outliers, ambiguities and potential errors; and (3) add further curated annotations and mappings thus increasing the usefulness and accuracy of the ChEMBL data for all users and modellers in particular. Issues related to activity, assay and target data curation and integrity along with their potential impact for users of the data are discussed, alongside robust selection and filter strategies in order to avoid or minimise these, depending on the desired application.
format Online
Article
Text
id pubmed-4607714
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-46077142015-10-20 Activity, assay and target data curation and quality in the ChEMBL database Papadatos, George Gaulton, Anna Hersey, Anne Overington, John P. J Comput Aided Mol Des Article The emergence of a number of publicly available bioactivity databases, such as ChEMBL, PubChem BioAssay and BindingDB, has raised awareness about the topics of data curation, quality and integrity. Here we provide an overview and discussion of the current and future approaches to activity, assay and target data curation of the ChEMBL database. This curation process involves several manual and automated steps and aims to: (1) maximise data accessibility and comparability; (2) improve data integrity and flag outliers, ambiguities and potential errors; and (3) add further curated annotations and mappings thus increasing the usefulness and accuracy of the ChEMBL data for all users and modellers in particular. Issues related to activity, assay and target data curation and integrity along with their potential impact for users of the data are discussed, alongside robust selection and filter strategies in order to avoid or minimise these, depending on the desired application. Springer International Publishing 2015-07-23 2015 /pmc/articles/PMC4607714/ /pubmed/26201396 http://dx.doi.org/10.1007/s10822-015-9860-5 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Papadatos, George
Gaulton, Anna
Hersey, Anne
Overington, John P.
Activity, assay and target data curation and quality in the ChEMBL database
title Activity, assay and target data curation and quality in the ChEMBL database
title_full Activity, assay and target data curation and quality in the ChEMBL database
title_fullStr Activity, assay and target data curation and quality in the ChEMBL database
title_full_unstemmed Activity, assay and target data curation and quality in the ChEMBL database
title_short Activity, assay and target data curation and quality in the ChEMBL database
title_sort activity, assay and target data curation and quality in the chembl database
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4607714/
https://www.ncbi.nlm.nih.gov/pubmed/26201396
http://dx.doi.org/10.1007/s10822-015-9860-5
work_keys_str_mv AT papadatosgeorge activityassayandtargetdatacurationandqualityinthechembldatabase
AT gaultonanna activityassayandtargetdatacurationandqualityinthechembldatabase
AT herseyanne activityassayandtargetdatacurationandqualityinthechembldatabase
AT overingtonjohnp activityassayandtargetdatacurationandqualityinthechembldatabase