Cargando…

UniChem: a unified chemical structure cross-referencing and identifier tracking system

UniChem is a freely available compound identifier mapping service on the internet, designed to optimize the efficiency with which structure-based hyperlinks may be built and maintained between chemistry-based resources. In the past, the creation and maintenance of such links at EMBL-EBI, where sever...

Descripción completa

Detalles Bibliográficos
Autores principales: Chambers, Jon, Davies, Mark, Gaulton, Anna, Hersey, Anne, Velankar, Sameer, Petryszak, Robert, Hastings, Janna, Bellis, Louisa, McGlinchey, Shaun, Overington, John P
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3616875/
https://www.ncbi.nlm.nih.gov/pubmed/23317286
http://dx.doi.org/10.1186/1758-2946-5-3
_version_ 1782265180973957120
author Chambers, Jon
Davies, Mark
Gaulton, Anna
Hersey, Anne
Velankar, Sameer
Petryszak, Robert
Hastings, Janna
Bellis, Louisa
McGlinchey, Shaun
Overington, John P
author_facet Chambers, Jon
Davies, Mark
Gaulton, Anna
Hersey, Anne
Velankar, Sameer
Petryszak, Robert
Hastings, Janna
Bellis, Louisa
McGlinchey, Shaun
Overington, John P
author_sort Chambers, Jon
collection PubMed
description UniChem is a freely available compound identifier mapping service on the internet, designed to optimize the efficiency with which structure-based hyperlinks may be built and maintained between chemistry-based resources. In the past, the creation and maintenance of such links at EMBL-EBI, where several chemistry-based resources exist, has required independent efforts by each of the separate teams. These efforts were complicated by the different data models, release schedules, and differing business rules for compound normalization and identifier nomenclature that exist across the organization. UniChem, a large-scale, non-redundant database of Standard InChIs with pointers between these structures and chemical identifiers from all the separate chemistry resources, was developed as a means of efficiently sharing the maintenance overhead of creating these links. Thus, for each source represented in UniChem, all links to and from all other sources are automatically calculated and immediately available for all to use. Updated mappings are immediately available upon loading of new data releases from the sources. Web services in UniChem provide users with a single simple automatable mechanism for maintaining all links from their resource to all other sources represented in UniChem. In addition, functionality to track changes in identifier usage allows users to monitor which identifiers are current, and which are obsolete. Lastly, UniChem has been deliberately designed to allow additional resources to be included with minimal effort. Indeed, the recent inclusion of data sources external to EMBL-EBI has provided a simple means of providing users with an even wider selection of resources with which to link to, all at no extra cost, while at the same time providing a simple mechanism for external resources to link to all EMBL-EBI chemistry resources.
format Online
Article
Text
id pubmed-3616875
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36168752013-04-05 UniChem: a unified chemical structure cross-referencing and identifier tracking system Chambers, Jon Davies, Mark Gaulton, Anna Hersey, Anne Velankar, Sameer Petryszak, Robert Hastings, Janna Bellis, Louisa McGlinchey, Shaun Overington, John P J Cheminform Database UniChem is a freely available compound identifier mapping service on the internet, designed to optimize the efficiency with which structure-based hyperlinks may be built and maintained between chemistry-based resources. In the past, the creation and maintenance of such links at EMBL-EBI, where several chemistry-based resources exist, has required independent efforts by each of the separate teams. These efforts were complicated by the different data models, release schedules, and differing business rules for compound normalization and identifier nomenclature that exist across the organization. UniChem, a large-scale, non-redundant database of Standard InChIs with pointers between these structures and chemical identifiers from all the separate chemistry resources, was developed as a means of efficiently sharing the maintenance overhead of creating these links. Thus, for each source represented in UniChem, all links to and from all other sources are automatically calculated and immediately available for all to use. Updated mappings are immediately available upon loading of new data releases from the sources. Web services in UniChem provide users with a single simple automatable mechanism for maintaining all links from their resource to all other sources represented in UniChem. In addition, functionality to track changes in identifier usage allows users to monitor which identifiers are current, and which are obsolete. Lastly, UniChem has been deliberately designed to allow additional resources to be included with minimal effort. Indeed, the recent inclusion of data sources external to EMBL-EBI has provided a simple means of providing users with an even wider selection of resources with which to link to, all at no extra cost, while at the same time providing a simple mechanism for external resources to link to all EMBL-EBI chemistry resources. BioMed Central 2013-01-14 /pmc/articles/PMC3616875/ /pubmed/23317286 http://dx.doi.org/10.1186/1758-2946-5-3 Text en Copyright © 2013 Chambers et al; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Chambers, Jon
Davies, Mark
Gaulton, Anna
Hersey, Anne
Velankar, Sameer
Petryszak, Robert
Hastings, Janna
Bellis, Louisa
McGlinchey, Shaun
Overington, John P
UniChem: a unified chemical structure cross-referencing and identifier tracking system
title UniChem: a unified chemical structure cross-referencing and identifier tracking system
title_full UniChem: a unified chemical structure cross-referencing and identifier tracking system
title_fullStr UniChem: a unified chemical structure cross-referencing and identifier tracking system
title_full_unstemmed UniChem: a unified chemical structure cross-referencing and identifier tracking system
title_short UniChem: a unified chemical structure cross-referencing and identifier tracking system
title_sort unichem: a unified chemical structure cross-referencing and identifier tracking system
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3616875/
https://www.ncbi.nlm.nih.gov/pubmed/23317286
http://dx.doi.org/10.1186/1758-2946-5-3
work_keys_str_mv AT chambersjon unichemaunifiedchemicalstructurecrossreferencingandidentifiertrackingsystem
AT daviesmark unichemaunifiedchemicalstructurecrossreferencingandidentifiertrackingsystem
AT gaultonanna unichemaunifiedchemicalstructurecrossreferencingandidentifiertrackingsystem
AT herseyanne unichemaunifiedchemicalstructurecrossreferencingandidentifiertrackingsystem
AT velankarsameer unichemaunifiedchemicalstructurecrossreferencingandidentifiertrackingsystem
AT petryszakrobert unichemaunifiedchemicalstructurecrossreferencingandidentifiertrackingsystem
AT hastingsjanna unichemaunifiedchemicalstructurecrossreferencingandidentifiertrackingsystem
AT bellislouisa unichemaunifiedchemicalstructurecrossreferencingandidentifiertrackingsystem
AT mcglincheyshaun unichemaunifiedchemicalstructurecrossreferencingandidentifiertrackingsystem
AT overingtonjohnp unichemaunifiedchemicalstructurecrossreferencingandidentifiertrackingsystem