Cargando…

SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data

The access and use of large-scale structure-activity relationships (SAR) is increasing as the range of targets and availability of bioactive compound-to-protein mappings expands. However, effective exploitation requires merging and normalisation of activity data, mappings to target classifications a...

Descripción completa

Detalles Bibliográficos
Autores principales: Eriksson, Mats, Nilsson, Ingemar, Kogej, Thierry, Southan, Christopher, Johansson, Martin, Tyrchan, Christian, Muresan, Sorel, Blomberg, Niklas, Bjäreland, Marcus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: WILEY-VCH Verlag 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3535785/
https://www.ncbi.nlm.nih.gov/pubmed/23308082
http://dx.doi.org/10.1002/minf.201200030
_version_ 1782254712304697344
author Eriksson, Mats
Nilsson, Ingemar
Kogej, Thierry
Southan, Christopher
Johansson, Martin
Tyrchan, Christian
Muresan, Sorel
Blomberg, Niklas
Bjäreland, Marcus
author_facet Eriksson, Mats
Nilsson, Ingemar
Kogej, Thierry
Southan, Christopher
Johansson, Martin
Tyrchan, Christian
Muresan, Sorel
Blomberg, Niklas
Bjäreland, Marcus
author_sort Eriksson, Mats
collection PubMed
description The access and use of large-scale structure-activity relationships (SAR) is increasing as the range of targets and availability of bioactive compound-to-protein mappings expands. However, effective exploitation requires merging and normalisation of activity data, mappings to target classifications as well as visual display of chemical structure relationships. This work describes the development of the application “SARConnect” to address these issues. We discuss options for delivery and analysis of large-scale SAR data together with a set of use-cases to illustrate the design choices and utility. The main activity sources of ChEMBL,1 GOSTAR2 and AstraZeneca’s internal system IBIS, had already been integrated in Chemistry Connect.3 For target relationships we selected human UniProtKB/Swiss-Prot4 as our primary source of a heuristic target classification. Similarly, to explore chemical relationships we combined several methods for framework and scaffold analysis into a unified, hierarchical classification where ease of navigation was the primary goal. An application was built on TIBCO Spotfire to retrieve data for visual display. Consequently, users can explore relationships between target, activity and structure across internal, external and commercial sources that encompass approximately 3 million compounds, 2000 human proteins and 10 million activity values. Examples showing the utility of the application are given.
format Online
Article
Text
id pubmed-3535785
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher WILEY-VCH Verlag
record_format MEDLINE/PubMed
spelling pubmed-35357852013-01-08 SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data Eriksson, Mats Nilsson, Ingemar Kogej, Thierry Southan, Christopher Johansson, Martin Tyrchan, Christian Muresan, Sorel Blomberg, Niklas Bjäreland, Marcus Mol Inform Full Papers The access and use of large-scale structure-activity relationships (SAR) is increasing as the range of targets and availability of bioactive compound-to-protein mappings expands. However, effective exploitation requires merging and normalisation of activity data, mappings to target classifications as well as visual display of chemical structure relationships. This work describes the development of the application “SARConnect” to address these issues. We discuss options for delivery and analysis of large-scale SAR data together with a set of use-cases to illustrate the design choices and utility. The main activity sources of ChEMBL,1 GOSTAR2 and AstraZeneca’s internal system IBIS, had already been integrated in Chemistry Connect.3 For target relationships we selected human UniProtKB/Swiss-Prot4 as our primary source of a heuristic target classification. Similarly, to explore chemical relationships we combined several methods for framework and scaffold analysis into a unified, hierarchical classification where ease of navigation was the primary goal. An application was built on TIBCO Spotfire to retrieve data for visual display. Consequently, users can explore relationships between target, activity and structure across internal, external and commercial sources that encompass approximately 3 million compounds, 2000 human proteins and 10 million activity values. Examples showing the utility of the application are given. WILEY-VCH Verlag 2012-08 2012-08-07 /pmc/articles/PMC3535785/ /pubmed/23308082 http://dx.doi.org/10.1002/minf.201200030 Text en Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation.
spellingShingle Full Papers
Eriksson, Mats
Nilsson, Ingemar
Kogej, Thierry
Southan, Christopher
Johansson, Martin
Tyrchan, Christian
Muresan, Sorel
Blomberg, Niklas
Bjäreland, Marcus
SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data
title SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data
title_full SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data
title_fullStr SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data
title_full_unstemmed SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data
title_short SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data
title_sort sarconnect: a tool to interrogate the connectivity between proteins, chemical structures and activity data
topic Full Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3535785/
https://www.ncbi.nlm.nih.gov/pubmed/23308082
http://dx.doi.org/10.1002/minf.201200030
work_keys_str_mv AT erikssonmats sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata
AT nilssoningemar sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata
AT kogejthierry sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata
AT southanchristopher sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata
AT johanssonmartin sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata
AT tyrchanchristian sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata
AT muresansorel sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata
AT blombergniklas sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata
AT bjarelandmarcus sarconnectatooltointerrogatetheconnectivitybetweenproteinschemicalstructuresandactivitydata