Cargando…

Capturing mixture composition: an open machine-readable format for representing mixed substances

We describe a file format that is designed to represent mixtures of compounds in a way that is fully machine readable. This Mixfile format is intended to fill the same role for substances that are composed of multiple components as the venerable Molfile does for specifying individual structures. Thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Clark, Alex M., McEwen, Leah R., Gedeck, Peter, Bunin, Barry A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6533230/
https://www.ncbi.nlm.nih.gov/pubmed/31124006
http://dx.doi.org/10.1186/s13321-019-0357-4
_version_ 1783421148267020288
author Clark, Alex M.
McEwen, Leah R.
Gedeck, Peter
Bunin, Barry A.
author_facet Clark, Alex M.
McEwen, Leah R.
Gedeck, Peter
Bunin, Barry A.
author_sort Clark, Alex M.
collection PubMed
description We describe a file format that is designed to represent mixtures of compounds in a way that is fully machine readable. This Mixfile format is intended to fill the same role for substances that are composed of multiple components as the venerable Molfile does for specifying individual structures. This much needed datastructure is intended to replace current practices for communicating information about mixtures, which usually relies on human-readable text descriptions, drawing several species within a single molecular diagram, or mutually incompatible ad hoc solutions. We describe an open source software application for editing mixture files, which can also be used as web-ready tools for manipulating the file format. We also present a corpus of mixture examples, which we have extracted from collections of text-based descriptions. Furthermore, we present an early look at the proposed IUPAC Mixtures InChI specification, instances of which can be automatically generated using the Mixfile format as a precursor.
format Online
Article
Text
id pubmed-6533230
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-65332302019-05-30 Capturing mixture composition: an open machine-readable format for representing mixed substances Clark, Alex M. McEwen, Leah R. Gedeck, Peter Bunin, Barry A. J Cheminform Research Article We describe a file format that is designed to represent mixtures of compounds in a way that is fully machine readable. This Mixfile format is intended to fill the same role for substances that are composed of multiple components as the venerable Molfile does for specifying individual structures. This much needed datastructure is intended to replace current practices for communicating information about mixtures, which usually relies on human-readable text descriptions, drawing several species within a single molecular diagram, or mutually incompatible ad hoc solutions. We describe an open source software application for editing mixture files, which can also be used as web-ready tools for manipulating the file format. We also present a corpus of mixture examples, which we have extracted from collections of text-based descriptions. Furthermore, we present an early look at the proposed IUPAC Mixtures InChI specification, instances of which can be automatically generated using the Mixfile format as a precursor. Springer International Publishing 2019-05-23 /pmc/articles/PMC6533230/ /pubmed/31124006 http://dx.doi.org/10.1186/s13321-019-0357-4 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Clark, Alex M.
McEwen, Leah R.
Gedeck, Peter
Bunin, Barry A.
Capturing mixture composition: an open machine-readable format for representing mixed substances
title Capturing mixture composition: an open machine-readable format for representing mixed substances
title_full Capturing mixture composition: an open machine-readable format for representing mixed substances
title_fullStr Capturing mixture composition: an open machine-readable format for representing mixed substances
title_full_unstemmed Capturing mixture composition: an open machine-readable format for representing mixed substances
title_short Capturing mixture composition: an open machine-readable format for representing mixed substances
title_sort capturing mixture composition: an open machine-readable format for representing mixed substances
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6533230/
https://www.ncbi.nlm.nih.gov/pubmed/31124006
http://dx.doi.org/10.1186/s13321-019-0357-4
work_keys_str_mv AT clarkalexm capturingmixturecompositionanopenmachinereadableformatforrepresentingmixedsubstances
AT mcewenleahr capturingmixturecompositionanopenmachinereadableformatforrepresentingmixedsubstances
AT gedeckpeter capturingmixturecompositionanopenmachinereadableformatforrepresentingmixedsubstances
AT buninbarrya capturingmixturecompositionanopenmachinereadableformatforrepresentingmixedsubstances