Cargando…

Graph isomorphism-based algorithm for cross-checking chemical and crystallographic descriptions

Published reports of chemical compounds often contain multiple machine-readable descriptions which may supplement each other in order to yield coherent and complete chemical representations. This publication presents a method to cross-check such descriptions using a canonical representation and isom...

Descripción completa

Detalles Bibliográficos
Autores principales: Merkys, Andrius, Vaitkus, Antanas, Grybauskas, Algirdas, Konovalovas, Aleksandras, Quirós, Miguel, Gražulis, Saulius
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9948373/
https://www.ncbi.nlm.nih.gov/pubmed/36814296
http://dx.doi.org/10.1186/s13321-023-00692-1
Descripción
Sumario:Published reports of chemical compounds often contain multiple machine-readable descriptions which may supplement each other in order to yield coherent and complete chemical representations. This publication presents a method to cross-check such descriptions using a canonical representation and isomorphism of molecular graphs. If immediate agreement between compound descriptions is not found, the algorithm derives the minimal set of simplifications required for both descriptions to arrive to a matching form (if any). The proposed algorithm is used to cross-check chemical descriptions from the Crystallography Open Database to identify coherently described entries as well as those requiring further curation. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-023-00692-1.