Cargando…

Building an R&D chemical registration system

Small molecule chemistry is of central importance to a number of R&D companies in diverse areas such as the pharmaceutical, nutraceutical, food flavoring, and cosmeceutical industries. In order to store and manage thousands of chemical compounds in such an environment, we have built a state-of-t...

Descripción completa

Detalles Bibliográficos
Autores principales: Martin, Elyette, Monge, Aurélien, Duret, Jacques-Antoine, Gualandi, Federico, Peitsch, Manuel C, Pospisil, Pavel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3430593/
https://www.ncbi.nlm.nih.gov/pubmed/22650418
http://dx.doi.org/10.1186/1758-2946-4-11
_version_ 1782241958366806016
author Martin, Elyette
Monge, Aurélien
Duret, Jacques-Antoine
Gualandi, Federico
Peitsch, Manuel C
Pospisil, Pavel
author_facet Martin, Elyette
Monge, Aurélien
Duret, Jacques-Antoine
Gualandi, Federico
Peitsch, Manuel C
Pospisil, Pavel
author_sort Martin, Elyette
collection PubMed
description Small molecule chemistry is of central importance to a number of R&D companies in diverse areas such as the pharmaceutical, nutraceutical, food flavoring, and cosmeceutical industries. In order to store and manage thousands of chemical compounds in such an environment, we have built a state-of-the-art master chemical database with unique structure identifiers. Here, we present the concept and methodology we used to build the system that we call the Unique Compound Database (UCD). In the UCD, each molecule is registered only once (uniqueness), structures with alternative representations are entered in a uniform way (normalization), and the chemical structure drawings are recognizable to chemists and to a cartridge. In brief, structural molecules are entered as neutral entities which can be associated with a salt. The salts are listed in a dictionary and bound to the molecule with the appropriate stoichiometric coefficient in an entity called “substance”. The substances are associated with batches. Once a molecule is registered, some properties (e.g., ADMET prediction, IUPAC name, chemical properties) are calculated automatically. The UCD has both automated and manual data controls. Moreover, the UCD concept enables the management of user errors in the structure entry by reassigning or archiving the batches. It also allows updating of the records to include newly discovered properties of individual structures. As our research spans a wide variety of scientific fields, the database enables registration of mixtures of compounds, enantiomers, tautomers, and compounds with unknown stereochemistries.
format Online
Article
Text
id pubmed-3430593
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34305932012-08-30 Building an R&D chemical registration system Martin, Elyette Monge, Aurélien Duret, Jacques-Antoine Gualandi, Federico Peitsch, Manuel C Pospisil, Pavel J Cheminform Methodology Small molecule chemistry is of central importance to a number of R&D companies in diverse areas such as the pharmaceutical, nutraceutical, food flavoring, and cosmeceutical industries. In order to store and manage thousands of chemical compounds in such an environment, we have built a state-of-the-art master chemical database with unique structure identifiers. Here, we present the concept and methodology we used to build the system that we call the Unique Compound Database (UCD). In the UCD, each molecule is registered only once (uniqueness), structures with alternative representations are entered in a uniform way (normalization), and the chemical structure drawings are recognizable to chemists and to a cartridge. In brief, structural molecules are entered as neutral entities which can be associated with a salt. The salts are listed in a dictionary and bound to the molecule with the appropriate stoichiometric coefficient in an entity called “substance”. The substances are associated with batches. Once a molecule is registered, some properties (e.g., ADMET prediction, IUPAC name, chemical properties) are calculated automatically. The UCD has both automated and manual data controls. Moreover, the UCD concept enables the management of user errors in the structure entry by reassigning or archiving the batches. It also allows updating of the records to include newly discovered properties of individual structures. As our research spans a wide variety of scientific fields, the database enables registration of mixtures of compounds, enantiomers, tautomers, and compounds with unknown stereochemistries. BioMed Central 2012-05-31 /pmc/articles/PMC3430593/ /pubmed/22650418 http://dx.doi.org/10.1186/1758-2946-4-11 Text en Copyright ©2012 Martin et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Martin, Elyette
Monge, Aurélien
Duret, Jacques-Antoine
Gualandi, Federico
Peitsch, Manuel C
Pospisil, Pavel
Building an R&D chemical registration system
title Building an R&D chemical registration system
title_full Building an R&D chemical registration system
title_fullStr Building an R&D chemical registration system
title_full_unstemmed Building an R&D chemical registration system
title_short Building an R&D chemical registration system
title_sort building an r&d chemical registration system
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3430593/
https://www.ncbi.nlm.nih.gov/pubmed/22650418
http://dx.doi.org/10.1186/1758-2946-4-11
work_keys_str_mv AT martinelyette buildinganrdchemicalregistrationsystem
AT mongeaurelien buildinganrdchemicalregistrationsystem
AT duretjacquesantoine buildinganrdchemicalregistrationsystem
AT gualandifederico buildinganrdchemicalregistrationsystem
AT peitschmanuelc buildinganrdchemicalregistrationsystem
AT pospisilpavel buildinganrdchemicalregistrationsystem