Cargando…
SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data
The access and use of large-scale structure-activity relationships (SAR) is increasing as the range of targets and availability of bioactive compound-to-protein mappings expands. However, effective exploitation requires merging and normalisation of activity data, mappings to target classifications a...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
WILEY-VCH Verlag
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3535785/ https://www.ncbi.nlm.nih.gov/pubmed/23308082 http://dx.doi.org/10.1002/minf.201200030 |
_version_ | 1782254712304697344 |
---|---|
author | Eriksson, Mats Nilsson, Ingemar Kogej, Thierry Southan, Christopher Johansson, Martin Tyrchan, Christian Muresan, Sorel Blomberg, Niklas Bjäreland, Marcus |
author_facet | Eriksson, Mats Nilsson, Ingemar Kogej, Thierry Southan, Christopher Johansson, Martin Tyrchan, Christian Muresan, Sorel Blomberg, Niklas Bjäreland, Marcus |
author_sort | Eriksson, Mats |
collection | PubMed |
description | The access and use of large-scale structure-activity relationships (SAR) is increasing as the range of targets and availability of bioactive compound-to-protein mappings expands. However, effective exploitation requires merging and normalisation of activity data, mappings to target classifications as well as visual display of chemical structure relationships. This work describes the development of the application “SARConnect” to address these issues. We discuss options for delivery and analysis of large-scale SAR data together with a set of use-cases to illustrate the design choices and utility. The main activity sources of ChEMBL,1 GOSTAR2 and AstraZeneca’s internal system IBIS, had already been integrated in Chemistry Connect.3 For target relationships we selected human UniProtKB/Swiss-Prot4 as our primary source of a heuristic target classification. Similarly, to explore chemical relationships we combined several methods for framework and scaffold analysis into a unified, hierarchical classification where ease of navigation was the primary goal. An application was built on TIBCO Spotfire to retrieve data for visual display. Consequently, users can explore relationships between target, activity and structure across internal, external and commercial sources that encompass approximately 3 million compounds, 2000 human proteins and 10 million activity values. Examples showing the utility of the application are given. |
format | Online Article Text |
id | pubmed-3535785 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | WILEY-VCH Verlag |
record_format | MEDLINE/PubMed |
spelling | pubmed-35357852013-01-08 SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data Eriksson, Mats Nilsson, Ingemar Kogej, Thierry Southan, Christopher Johansson, Martin Tyrchan, Christian Muresan, Sorel Blomberg, Niklas Bjäreland, Marcus Mol Inform Full Papers The access and use of large-scale structure-activity relationships (SAR) is increasing as the range of targets and availability of bioactive compound-to-protein mappings expands. However, effective exploitation requires merging and normalisation of activity data, mappings to target classifications as well as visual display of chemical structure relationships. This work describes the development of the application “SARConnect” to address these issues. We discuss options for delivery and analysis of large-scale SAR data together with a set of use-cases to illustrate the design choices and utility. The main activity sources of ChEMBL,1 GOSTAR2 and AstraZeneca’s internal system IBIS, had already been integrated in Chemistry Connect.3 For target relationships we selected human UniProtKB/Swiss-Prot4 as our primary source of a heuristic target classification. Similarly, to explore chemical relationships we combined several methods for framework and scaffold analysis into a unified, hierarchical classification where ease of navigation was the primary goal. An application was built on TIBCO Spotfire to retrieve data for visual display. Consequently, users can explore relationships between target, activity and structure across internal, external and commercial sources that encompass approximately 3 million compounds, 2000 human proteins and 10 million activity values. Examples showing the utility of the application are given. WILEY-VCH Verlag 2012-08 2012-08-07 /pmc/articles/PMC3535785/ /pubmed/23308082 http://dx.doi.org/10.1002/minf.201200030 Text en Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation. |
spellingShingle | Full Papers Eriksson, Mats Nilsson, Ingemar Kogej, Thierry Southan, Christopher Johansson, Martin Tyrchan, Christian Muresan, Sorel Blomberg, Niklas Bjäreland, Marcus SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data |
title | SARConnect: A Tool to Interrogate the Connectivity Between Proteins,
Chemical Structures and Activity Data |
title_full | SARConnect: A Tool to Interrogate the Connectivity Between Proteins,
Chemical Structures and Activity Data |
title_fullStr | SARConnect: A Tool to Interrogate the Connectivity Between Proteins,
Chemical Structures and Activity Data |
title_full_unstemmed | SARConnect: A Tool to Interrogate the Connectivity Between Proteins,
Chemical Structures and Activity Data |
title_short | SARConnect: A Tool to Interrogate the Connectivity Between Proteins,
Chemical Structures and Activity Data |
title_sort | sarconnect: a tool to interrogate the connectivity between proteins,
chemical structures and activity data |
topic | Full Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3535785/ https://www.ncbi.nlm.nih.gov/pubmed/23308082 http://dx.doi.org/10.1002/minf.201200030 |
work_keys_str_mv | AT erikssonmats sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata AT nilssoningemar sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata AT kogejthierry sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata AT southanchristopher sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata AT johanssonmartin sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata AT tyrchanchristian sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata AT muresansorel sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata AT blombergniklas sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata AT bjarelandmarcus sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata |