Cargando…
InChI version 1.06: now more than 99.99% reliable
The software for the IUPAC Chemical Identifier, InChI, is extraordinarily reliable. It has been tested on large databases around the world, and has proved itself to be an essential tool in the handling and integration of large chemical databases. InChI version 1.05 was released in January 2017 and v...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8147039/ https://www.ncbi.nlm.nih.gov/pubmed/34030732 http://dx.doi.org/10.1186/s13321-021-00517-z |
_version_ | 1783697537507524608 |
---|---|
author | Goodman, Jonathan M. Pletnev, Igor Thiessen, Paul Bolton, Evan Heller, Stephen R. |
author_facet | Goodman, Jonathan M. Pletnev, Igor Thiessen, Paul Bolton, Evan Heller, Stephen R. |
author_sort | Goodman, Jonathan M. |
collection | PubMed |
description | The software for the IUPAC Chemical Identifier, InChI, is extraordinarily reliable. It has been tested on large databases around the world, and has proved itself to be an essential tool in the handling and integration of large chemical databases. InChI version 1.05 was released in January 2017 and version 1.06 in December 2020. In this paper, we report on the current state of the InChI Software, the details of the improvements in the v1.06 release, and the results of a test of the InChI run on PubChem, a database of more than a hundred million molecules. The upgrade introduces significant new features, including support for pseudo-element atoms and an improved description of polymers. We expect that few, if any, applications using the standard InChI will need to change as a result of the changes in version 1.06. Numerical instability was discovered for 0.002% of this database, and a small number of other molecules were discovered for which the algorithm did not run smoothly. On the basis of PubChem data, we can demonstrate that InChI version 1.05 was 99.996% accurate, and InChI version 1.06 represents a step closer to perfection. Finally, we look forward to future releases and extensions for the InChI Chemical identifier. |
format | Online Article Text |
id | pubmed-8147039 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-81470392021-05-25 InChI version 1.06: now more than 99.99% reliable Goodman, Jonathan M. Pletnev, Igor Thiessen, Paul Bolton, Evan Heller, Stephen R. J Cheminform Research Article The software for the IUPAC Chemical Identifier, InChI, is extraordinarily reliable. It has been tested on large databases around the world, and has proved itself to be an essential tool in the handling and integration of large chemical databases. InChI version 1.05 was released in January 2017 and version 1.06 in December 2020. In this paper, we report on the current state of the InChI Software, the details of the improvements in the v1.06 release, and the results of a test of the InChI run on PubChem, a database of more than a hundred million molecules. The upgrade introduces significant new features, including support for pseudo-element atoms and an improved description of polymers. We expect that few, if any, applications using the standard InChI will need to change as a result of the changes in version 1.06. Numerical instability was discovered for 0.002% of this database, and a small number of other molecules were discovered for which the algorithm did not run smoothly. On the basis of PubChem data, we can demonstrate that InChI version 1.05 was 99.996% accurate, and InChI version 1.06 represents a step closer to perfection. Finally, we look forward to future releases and extensions for the InChI Chemical identifier. Springer International Publishing 2021-05-24 /pmc/articles/PMC8147039/ /pubmed/34030732 http://dx.doi.org/10.1186/s13321-021-00517-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Goodman, Jonathan M. Pletnev, Igor Thiessen, Paul Bolton, Evan Heller, Stephen R. InChI version 1.06: now more than 99.99% reliable |
title | InChI version 1.06: now more than 99.99% reliable |
title_full | InChI version 1.06: now more than 99.99% reliable |
title_fullStr | InChI version 1.06: now more than 99.99% reliable |
title_full_unstemmed | InChI version 1.06: now more than 99.99% reliable |
title_short | InChI version 1.06: now more than 99.99% reliable |
title_sort | inchi version 1.06: now more than 99.99% reliable |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8147039/ https://www.ncbi.nlm.nih.gov/pubmed/34030732 http://dx.doi.org/10.1186/s13321-021-00517-z |
work_keys_str_mv | AT goodmanjonathanm inchiversion106nowmorethan9999reliable AT pletnevigor inchiversion106nowmorethan9999reliable AT thiessenpaul inchiversion106nowmorethan9999reliable AT boltonevan inchiversion106nowmorethan9999reliable AT hellerstephenr inchiversion106nowmorethan9999reliable |