Cargando…

Unique identifiers for small molecules enable rigorous labeling of their atoms

Rigorous characterization of small organic molecules in terms of their structural and biological properties is vital to biomedical research. The three-dimensional structure of a molecule, its ‘photo ID’, is inefficient for searching and matching tasks. Instead, identifiers play a key role in accessi...

Descripción completa

Detalles Bibliográficos
Autores principales: Dashti, Hesam, Westler, William M., Markley, John L., Eghbalnia, Hamid R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5441290/
https://www.ncbi.nlm.nih.gov/pubmed/28534867
http://dx.doi.org/10.1038/sdata.2017.73
_version_ 1783238236753100800
author Dashti, Hesam
Westler, William M.
Markley, John L.
Eghbalnia, Hamid R.
author_facet Dashti, Hesam
Westler, William M.
Markley, John L.
Eghbalnia, Hamid R.
author_sort Dashti, Hesam
collection PubMed
description Rigorous characterization of small organic molecules in terms of their structural and biological properties is vital to biomedical research. The three-dimensional structure of a molecule, its ‘photo ID’, is inefficient for searching and matching tasks. Instead, identifiers play a key role in accessing compound data. Unique and reproducible molecule and atom identifiers are required to ensure the correct cross-referencing of properties associated with compounds archived in databases. The best approach to this requirement is the International Chemical Identifier (InChI). However, the current implementation of InChI fails to provide a complete standard for atom nomenclature, and incorrect use of the InChI standard has resulted in the proliferation of non-unique identifiers. We propose a methodology and associated software tools, named ALATIS, that overcomes these shortcomings. ALATIS is an adaptation of InChI, which operates fully within the InChI convention to provide unique and reproducible molecule and all atom identifiers. ALATIS includes an InChI extension for unique atom labeling of symmetric molecules. ALATIS forms the basis for improving reproducibility and unifying cross-referencing across databases.
format Online
Article
Text
id pubmed-5441290
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-54412902017-06-07 Unique identifiers for small molecules enable rigorous labeling of their atoms Dashti, Hesam Westler, William M. Markley, John L. Eghbalnia, Hamid R. Sci Data Article Rigorous characterization of small organic molecules in terms of their structural and biological properties is vital to biomedical research. The three-dimensional structure of a molecule, its ‘photo ID’, is inefficient for searching and matching tasks. Instead, identifiers play a key role in accessing compound data. Unique and reproducible molecule and atom identifiers are required to ensure the correct cross-referencing of properties associated with compounds archived in databases. The best approach to this requirement is the International Chemical Identifier (InChI). However, the current implementation of InChI fails to provide a complete standard for atom nomenclature, and incorrect use of the InChI standard has resulted in the proliferation of non-unique identifiers. We propose a methodology and associated software tools, named ALATIS, that overcomes these shortcomings. ALATIS is an adaptation of InChI, which operates fully within the InChI convention to provide unique and reproducible molecule and all atom identifiers. ALATIS includes an InChI extension for unique atom labeling of symmetric molecules. ALATIS forms the basis for improving reproducibility and unifying cross-referencing across databases. Nature Publishing Group 2017-05-23 /pmc/articles/PMC5441290/ /pubmed/28534867 http://dx.doi.org/10.1038/sdata.2017.73 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Dashti, Hesam
Westler, William M.
Markley, John L.
Eghbalnia, Hamid R.
Unique identifiers for small molecules enable rigorous labeling of their atoms
title Unique identifiers for small molecules enable rigorous labeling of their atoms
title_full Unique identifiers for small molecules enable rigorous labeling of their atoms
title_fullStr Unique identifiers for small molecules enable rigorous labeling of their atoms
title_full_unstemmed Unique identifiers for small molecules enable rigorous labeling of their atoms
title_short Unique identifiers for small molecules enable rigorous labeling of their atoms
title_sort unique identifiers for small molecules enable rigorous labeling of their atoms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5441290/
https://www.ncbi.nlm.nih.gov/pubmed/28534867
http://dx.doi.org/10.1038/sdata.2017.73
work_keys_str_mv AT dashtihesam uniqueidentifiersforsmallmoleculesenablerigorouslabelingoftheiratoms
AT westlerwilliamm uniqueidentifiersforsmallmoleculesenablerigorouslabelingoftheiratoms
AT markleyjohnl uniqueidentifiersforsmallmoleculesenablerigorouslabelingoftheiratoms
AT eghbalniahamidr uniqueidentifiersforsmallmoleculesenablerigorouslabelingoftheiratoms