Cargando…
How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists
Publicly open databases of small compounds have become an indispensable tool for chemoinformaticians for collection and preparation of datasets suitable for drug discovery questions. Since these databases comprise compounds coming from structure-activity relationship (SAR) studies performed by diffe...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
WILEY-VCH Verlag
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3743172/ https://www.ncbi.nlm.nih.gov/pubmed/23956804 http://dx.doi.org/10.1002/minf.201300019 |
_version_ | 1782280456079671296 |
---|---|
author | Tsareva, Daria A Ecker, Gerhard F |
author_facet | Tsareva, Daria A Ecker, Gerhard F |
author_sort | Tsareva, Daria A |
collection | PubMed |
description | Publicly open databases of small compounds have become an indispensable tool for chemoinformaticians for collection and preparation of datasets suitable for drug discovery questions. Since these databases comprise compounds coming from structure-activity relationship (SAR) studies performed by different research groups, they are very diverse with respect to the biological assays used. In the present study we analyzed the applicability of a thoroughly curated dataset gathered from open sources for ligand-based studies, using the transient receptor potential vanilloid type 1 (TRPV1) as use case. Thorough curation of compounds according to the biological assay type and conditions led to a dataset of comparable bioactive chemicals. Subsequent exhaustive analysis of the obtained dataset using classification algorithms demonstrated that the models obtained in most of the cases possess reliable quality. Analysis of constantly misclassified compounds showed that they belong to local SAR series, where small changes in structure lead to different class labels. These small structural differences could not be captured by the classification algorithms. However application of the 3D alignment-independent QSAR technique GRIND for local, structurally related series overcomes this problem. |
format | Online Article Text |
id | pubmed-3743172 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | WILEY-VCH Verlag |
record_format | MEDLINE/PubMed |
spelling | pubmed-37431722013-08-15 How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists Tsareva, Daria A Ecker, Gerhard F Mol Inform Full Papers Publicly open databases of small compounds have become an indispensable tool for chemoinformaticians for collection and preparation of datasets suitable for drug discovery questions. Since these databases comprise compounds coming from structure-activity relationship (SAR) studies performed by different research groups, they are very diverse with respect to the biological assays used. In the present study we analyzed the applicability of a thoroughly curated dataset gathered from open sources for ligand-based studies, using the transient receptor potential vanilloid type 1 (TRPV1) as use case. Thorough curation of compounds according to the biological assay type and conditions led to a dataset of comparable bioactive chemicals. Subsequent exhaustive analysis of the obtained dataset using classification algorithms demonstrated that the models obtained in most of the cases possess reliable quality. Analysis of constantly misclassified compounds showed that they belong to local SAR series, where small changes in structure lead to different class labels. These small structural differences could not be captured by the classification algorithms. However application of the 3D alignment-independent QSAR technique GRIND for local, structurally related series overcomes this problem. WILEY-VCH Verlag 2013-06 2013-06-18 /pmc/articles/PMC3743172/ /pubmed/23956804 http://dx.doi.org/10.1002/minf.201300019 Text en © 2013 The Authors. Published by Wiley-VCH Verlag GmbH & Co KGaA http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation. |
spellingShingle | Full Papers Tsareva, Daria A Ecker, Gerhard F How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists |
title | How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists |
title_full | How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists |
title_fullStr | How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists |
title_full_unstemmed | How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists |
title_short | How Far Could We Go with Open Data – A Case Study for TRPV1 Antagonists |
title_sort | how far could we go with open data – a case study for trpv1 antagonists |
topic | Full Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3743172/ https://www.ncbi.nlm.nih.gov/pubmed/23956804 http://dx.doi.org/10.1002/minf.201300019 |
work_keys_str_mv | AT tsarevadariaa howfarcouldwegowithopendataacasestudyfortrpv1antagonists AT eckergerhardf howfarcouldwegowithopendataacasestudyfortrpv1antagonists |