Cargando…
The CARLSBAD Database: A Confederated Database of Chemical Bioactivities
Many bioactivity databases offer information regarding the biological activity of small molecules on protein targets. Information in these databases is often hard to resolve with certainty because of subsetting different data in a variety of formats; use of different bioactivity metrics; use of diff...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3689437/ https://www.ncbi.nlm.nih.gov/pubmed/23794735 http://dx.doi.org/10.1093/database/bat044 |
_version_ | 1782274256648798208 |
---|---|
author | Mathias, Stephen L. Hines-Kay, Jarrett Yang, Jeremy J. Zahoransky-Kohalmi, Gergely Bologa, Cristian G. Ursu, Oleg Oprea, Tudor I. |
author_facet | Mathias, Stephen L. Hines-Kay, Jarrett Yang, Jeremy J. Zahoransky-Kohalmi, Gergely Bologa, Cristian G. Ursu, Oleg Oprea, Tudor I. |
author_sort | Mathias, Stephen L. |
collection | PubMed |
description | Many bioactivity databases offer information regarding the biological activity of small molecules on protein targets. Information in these databases is often hard to resolve with certainty because of subsetting different data in a variety of formats; use of different bioactivity metrics; use of different identifiers for chemicals and proteins; and having to access different query interfaces, respectively. Given the multitude of data sources, interfaces and standards, it is challenging to gather relevant facts and make appropriate connections and decisions regarding chemical–protein associations. The CARLSBAD database has been developed as an integrated resource, focused on high-quality subsets from several bioactivity databases, which are aggregated and presented in a uniform manner, suitable for the study of the relationships between small molecules and targets. In contrast to data collection resources, CARLSBAD provides a single normalized activity value of a given type for each unique chemical–protein target pair. Two types of scaffold perception methods have been implemented and are available for datamining: HierS (hierarchical scaffolds) and MCES (maximum common edge subgraph). The 2012 release of CARLSBAD contains 439 985 unique chemical structures, mapped onto 1,420 889 unique bioactivities, and annotated with 277 140 HierS scaffolds and 54 135 MCES chemical patterns, respectively. Of the 890 323 unique structure–target pairs curated in CARLSBAD, 13.95% are aggregated from multiple structure–target values: 94 975 are aggregated from two bioactivities, 14 544 from three, 7 930 from four and 2214 have five bioactivities, respectively. CARLSBAD captures bioactivities and tags for 1435 unique chemical structures of active pharmaceutical ingredients (i.e. ‘drugs’). CARLSBAD processing resulted in a net 17.3% data reduction for chemicals, 34.3% reduction for bioactivities, 23% reduction for HierS and 25% reduction for MCES, respectively. The CARLSBAD database supports a knowledge mining system that provides non-specialists with novel integrative ways of exploring chemical biology space to facilitate knowledge mining in drug discovery and repurposing. Database URL: http://carlsbad.health.unm.edu/carlsbad/. |
format | Online Article Text |
id | pubmed-3689437 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-36894372013-06-24 The CARLSBAD Database: A Confederated Database of Chemical Bioactivities Mathias, Stephen L. Hines-Kay, Jarrett Yang, Jeremy J. Zahoransky-Kohalmi, Gergely Bologa, Cristian G. Ursu, Oleg Oprea, Tudor I. Database (Oxford) Original Article Many bioactivity databases offer information regarding the biological activity of small molecules on protein targets. Information in these databases is often hard to resolve with certainty because of subsetting different data in a variety of formats; use of different bioactivity metrics; use of different identifiers for chemicals and proteins; and having to access different query interfaces, respectively. Given the multitude of data sources, interfaces and standards, it is challenging to gather relevant facts and make appropriate connections and decisions regarding chemical–protein associations. The CARLSBAD database has been developed as an integrated resource, focused on high-quality subsets from several bioactivity databases, which are aggregated and presented in a uniform manner, suitable for the study of the relationships between small molecules and targets. In contrast to data collection resources, CARLSBAD provides a single normalized activity value of a given type for each unique chemical–protein target pair. Two types of scaffold perception methods have been implemented and are available for datamining: HierS (hierarchical scaffolds) and MCES (maximum common edge subgraph). The 2012 release of CARLSBAD contains 439 985 unique chemical structures, mapped onto 1,420 889 unique bioactivities, and annotated with 277 140 HierS scaffolds and 54 135 MCES chemical patterns, respectively. Of the 890 323 unique structure–target pairs curated in CARLSBAD, 13.95% are aggregated from multiple structure–target values: 94 975 are aggregated from two bioactivities, 14 544 from three, 7 930 from four and 2214 have five bioactivities, respectively. CARLSBAD captures bioactivities and tags for 1435 unique chemical structures of active pharmaceutical ingredients (i.e. ‘drugs’). CARLSBAD processing resulted in a net 17.3% data reduction for chemicals, 34.3% reduction for bioactivities, 23% reduction for HierS and 25% reduction for MCES, respectively. The CARLSBAD database supports a knowledge mining system that provides non-specialists with novel integrative ways of exploring chemical biology space to facilitate knowledge mining in drug discovery and repurposing. Database URL: http://carlsbad.health.unm.edu/carlsbad/. Oxford University Press 2013-06-21 /pmc/articles/PMC3689437/ /pubmed/23794735 http://dx.doi.org/10.1093/database/bat044 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Mathias, Stephen L. Hines-Kay, Jarrett Yang, Jeremy J. Zahoransky-Kohalmi, Gergely Bologa, Cristian G. Ursu, Oleg Oprea, Tudor I. The CARLSBAD Database: A Confederated Database of Chemical Bioactivities |
title | The CARLSBAD Database: A Confederated Database of Chemical Bioactivities |
title_full | The CARLSBAD Database: A Confederated Database of Chemical Bioactivities |
title_fullStr | The CARLSBAD Database: A Confederated Database of Chemical Bioactivities |
title_full_unstemmed | The CARLSBAD Database: A Confederated Database of Chemical Bioactivities |
title_short | The CARLSBAD Database: A Confederated Database of Chemical Bioactivities |
title_sort | carlsbad database: a confederated database of chemical bioactivities |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3689437/ https://www.ncbi.nlm.nih.gov/pubmed/23794735 http://dx.doi.org/10.1093/database/bat044 |
work_keys_str_mv | AT mathiasstephenl thecarlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT hineskayjarrett thecarlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT yangjeremyj thecarlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT zahoranskykohalmigergely thecarlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT bologacristiang thecarlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT ursuoleg thecarlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT opreatudori thecarlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT mathiasstephenl carlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT hineskayjarrett carlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT yangjeremyj carlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT zahoranskykohalmigergely carlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT bologacristiang carlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT ursuoleg carlsbaddatabaseaconfederateddatabaseofchemicalbioactivities AT opreatudori carlsbaddatabaseaconfederateddatabaseofchemicalbioactivities |