Cargando…

How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists

Publicly open databases of small compounds have become an indispensable tool for chemoinformaticians for collection and preparation of datasets suitable for drug discovery questions. Since these databases comprise compounds coming from structure-activity relationship (SAR) studies performed by diffe...

Descripción completa

Detalles Bibliográficos
Autores principales: Tsareva, Daria A, Ecker, Gerhard F
Formato: Online Artículo Texto
Lenguaje:English
Publicado: WILEY-VCH Verlag 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3743172/
https://www.ncbi.nlm.nih.gov/pubmed/23956804
http://dx.doi.org/10.1002/minf.201300019
_version_ 1782280456079671296
author Tsareva, Daria A
Ecker, Gerhard F
author_facet Tsareva, Daria A
Ecker, Gerhard F
author_sort Tsareva, Daria A
collection PubMed
description Publicly open databases of small compounds have become an indispensable tool for chemoinformaticians for collection and preparation of datasets suitable for drug discovery questions. Since these databases comprise compounds coming from structure-activity relationship (SAR) studies performed by different research groups, they are very diverse with respect to the biological assays used. In the present study we analyzed the applicability of a thoroughly curated dataset gathered from open sources for ligand-based studies, using the transient receptor potential vanilloid type 1 (TRPV1) as use case. Thorough curation of compounds according to the biological assay type and conditions led to a dataset of comparable bioactive chemicals. Subsequent exhaustive analysis of the obtained dataset using classification algorithms demonstrated that the models obtained in most of the cases possess reliable quality. Analysis of constantly misclassified compounds showed that they belong to local SAR series, where small changes in structure lead to different class labels. These small structural differences could not be captured by the classification algorithms. However application of the 3D alignment-independent QSAR technique GRIND for local, structurally related series overcomes this problem.
format Online
Article
Text
id pubmed-3743172
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher WILEY-VCH Verlag
record_format MEDLINE/PubMed
spelling pubmed-37431722013-08-15 How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists Tsareva, Daria A Ecker, Gerhard F Mol Inform Full Papers Publicly open databases of small compounds have become an indispensable tool for chemoinformaticians for collection and preparation of datasets suitable for drug discovery questions. Since these databases comprise compounds coming from structure-activity relationship (SAR) studies performed by different research groups, they are very diverse with respect to the biological assays used. In the present study we analyzed the applicability of a thoroughly curated dataset gathered from open sources for ligand-based studies, using the transient receptor potential vanilloid type 1 (TRPV1) as use case. Thorough curation of compounds according to the biological assay type and conditions led to a dataset of comparable bioactive chemicals. Subsequent exhaustive analysis of the obtained dataset using classification algorithms demonstrated that the models obtained in most of the cases possess reliable quality. Analysis of constantly misclassified compounds showed that they belong to local SAR series, where small changes in structure lead to different class labels. These small structural differences could not be captured by the classification algorithms. However application of the 3D alignment-independent QSAR technique GRIND for local, structurally related series overcomes this problem. WILEY-VCH Verlag 2013-06 2013-06-18 /pmc/articles/PMC3743172/ /pubmed/23956804 http://dx.doi.org/10.1002/minf.201300019 Text en © 2013 The Authors. Published by Wiley-VCH Verlag GmbH & Co KGaA http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation.
spellingShingle Full Papers
Tsareva, Daria A
Ecker, Gerhard F
How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists
title How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists
title_full How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists
title_fullStr How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists
title_full_unstemmed How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists
title_short How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists
title_sort how far could we go with open data – a case study for trpv1 antagonists
topic Full Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3743172/
https://www.ncbi.nlm.nih.gov/pubmed/23956804
http://dx.doi.org/10.1002/minf.201300019
work_keys_str_mv AT tsarevadariaa howfarcouldwegowithopendataacasestudyfortrpv1antagonists
AT eckergerhardf howfarcouldwegowithopendataacasestudyfortrpv1antagonists