Cargando…

Automatic NMR-Based Identification of Chemical Reaction Types in Mixtures of Co-Occurring Reactions

The combination of chemoinformatics approaches with NMR techniques and the increasing availability of data allow the resolution of problems far beyond the original application of NMR in structure elucidation/verification. The diversity of applications can range from process monitoring, metabolic pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Latino, Diogo A. R. S., Aires-de-Sousa, João
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3923800/
https://www.ncbi.nlm.nih.gov/pubmed/24551112
http://dx.doi.org/10.1371/journal.pone.0088499
_version_ 1782303660541214720
author Latino, Diogo A. R. S.
Aires-de-Sousa, João
author_facet Latino, Diogo A. R. S.
Aires-de-Sousa, João
author_sort Latino, Diogo A. R. S.
collection PubMed
description The combination of chemoinformatics approaches with NMR techniques and the increasing availability of data allow the resolution of problems far beyond the original application of NMR in structure elucidation/verification. The diversity of applications can range from process monitoring, metabolic profiling, authentication of products, to quality control. An application related to the automatic analysis of complex mixtures concerns mixtures of chemical reactions. We encoded mixtures of chemical reactions with the difference between the (1)H NMR spectra of the products and the reactants. All the signals arising from all the reactants of the co-occurring reactions were taken together (a simulated spectrum of the mixture of reactants) and the same was done for products. The difference spectrum is taken as the representation of the mixture of chemical reactions. A data set of 181 chemical reactions was used, each reaction manually assigned to one of 6 types. From this dataset, we simulated mixtures where two reactions of different types would occur simultaneously. Automatic learning methods were trained to classify the reactions occurring in a mixture from the (1)H NMR-based descriptor of the mixture. Unsupervised learning methods (self-organizing maps) produced a reasonable clustering of the mixtures by reaction type, and allowed the correct classification of 80% and 63% of the mixtures in two independent test sets of different similarity to the training set. With random forests (RF), the percentage of correct classifications was increased to 99% and 80% for the same test sets. The RF probability associated to the predictions yielded a robust indication of their reliability. This study demonstrates the possibility of applying machine learning methods to automatically identify types of co-occurring chemical reactions from NMR data. Using no explicit structural information about the reactions participants, reaction elucidation is performed without structure elucidation of the molecules in the mixtures.
format Online
Article
Text
id pubmed-3923800
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39238002014-02-18 Automatic NMR-Based Identification of Chemical Reaction Types in Mixtures of Co-Occurring Reactions Latino, Diogo A. R. S. Aires-de-Sousa, João PLoS One Research Article The combination of chemoinformatics approaches with NMR techniques and the increasing availability of data allow the resolution of problems far beyond the original application of NMR in structure elucidation/verification. The diversity of applications can range from process monitoring, metabolic profiling, authentication of products, to quality control. An application related to the automatic analysis of complex mixtures concerns mixtures of chemical reactions. We encoded mixtures of chemical reactions with the difference between the (1)H NMR spectra of the products and the reactants. All the signals arising from all the reactants of the co-occurring reactions were taken together (a simulated spectrum of the mixture of reactants) and the same was done for products. The difference spectrum is taken as the representation of the mixture of chemical reactions. A data set of 181 chemical reactions was used, each reaction manually assigned to one of 6 types. From this dataset, we simulated mixtures where two reactions of different types would occur simultaneously. Automatic learning methods were trained to classify the reactions occurring in a mixture from the (1)H NMR-based descriptor of the mixture. Unsupervised learning methods (self-organizing maps) produced a reasonable clustering of the mixtures by reaction type, and allowed the correct classification of 80% and 63% of the mixtures in two independent test sets of different similarity to the training set. With random forests (RF), the percentage of correct classifications was increased to 99% and 80% for the same test sets. The RF probability associated to the predictions yielded a robust indication of their reliability. This study demonstrates the possibility of applying machine learning methods to automatically identify types of co-occurring chemical reactions from NMR data. Using no explicit structural information about the reactions participants, reaction elucidation is performed without structure elucidation of the molecules in the mixtures. Public Library of Science 2014-02-13 /pmc/articles/PMC3923800/ /pubmed/24551112 http://dx.doi.org/10.1371/journal.pone.0088499 Text en © 2014 Latino, Aires-de-Sousa http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Latino, Diogo A. R. S.
Aires-de-Sousa, João
Automatic NMR-Based Identification of Chemical Reaction Types in Mixtures of Co-Occurring Reactions
title Automatic NMR-Based Identification of Chemical Reaction Types in Mixtures of Co-Occurring Reactions
title_full Automatic NMR-Based Identification of Chemical Reaction Types in Mixtures of Co-Occurring Reactions
title_fullStr Automatic NMR-Based Identification of Chemical Reaction Types in Mixtures of Co-Occurring Reactions
title_full_unstemmed Automatic NMR-Based Identification of Chemical Reaction Types in Mixtures of Co-Occurring Reactions
title_short Automatic NMR-Based Identification of Chemical Reaction Types in Mixtures of Co-Occurring Reactions
title_sort automatic nmr-based identification of chemical reaction types in mixtures of co-occurring reactions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3923800/
https://www.ncbi.nlm.nih.gov/pubmed/24551112
http://dx.doi.org/10.1371/journal.pone.0088499
work_keys_str_mv AT latinodiogoars automaticnmrbasedidentificationofchemicalreactiontypesinmixturesofcooccurringreactions
AT airesdesousajoao automaticnmrbasedidentificationofchemicalreactiontypesinmixturesofcooccurringreactions