Cargando…

An initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated NMR chemical shifts

The majority of primary and secondary metabolites in nature have yet to be identified, representing a major challenge for metabolomics studies that currently require reference libraries from analyses of authentic compounds. Using currently available analytical methods, complete chemical characteriza...

Descripción completa

Detalles Bibliográficos
Autores principales: Yesiltepe, Yasemin, Govind, Niranjan, Metz, Thomas O., Renslow, Ryan S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9499888/
https://www.ncbi.nlm.nih.gov/pubmed/36138446
http://dx.doi.org/10.1186/s13321-022-00587-7
_version_ 1784795097510969344
author Yesiltepe, Yasemin
Govind, Niranjan
Metz, Thomas O.
Renslow, Ryan S.
author_facet Yesiltepe, Yasemin
Govind, Niranjan
Metz, Thomas O.
Renslow, Ryan S.
author_sort Yesiltepe, Yasemin
collection PubMed
description The majority of primary and secondary metabolites in nature have yet to be identified, representing a major challenge for metabolomics studies that currently require reference libraries from analyses of authentic compounds. Using currently available analytical methods, complete chemical characterization of metabolomes is infeasible for both technical and economic reasons. For example, unambiguous identification of metabolites is limited by the availability of authentic chemical standards, which, for the majority of molecules, do not exist. Computationally predicted or calculated data are a viable solution to expand the currently limited metabolite reference libraries, if such methods are shown to be sufficiently accurate. For example, determining nuclear magnetic resonance (NMR) spectroscopy spectra in silico has shown promise in the identification and delineation of metabolite structures. Many researchers have been taking advantage of density functional theory (DFT), a computationally inexpensive yet reputable method for the prediction of carbon and proton NMR spectra of metabolites. However, such methods are expected to have some error in predicted (13)C and (1)H NMR spectra with respect to experimentally measured values. This leads us to the question–what accuracy is required in predicted (13)C and (1)H NMR chemical shifts for confident metabolite identification? Using the set of 11,716 small molecules found in the Human Metabolome Database (HMDB), we simulated both experimental and theoretical NMR chemical shift databases. We investigated the level of accuracy required for identification of metabolites in simulated pure and impure samples by matching predicted chemical shifts to experimental data. We found 90% or more of molecules in simulated pure samples can be successfully identified when errors of (1)H and (13)C chemical shifts in water are below 0.6 and 7.1 ppm, respectively, and below 0.5 and 4.6 ppm in chloroform solvation, respectively. In simulated complex mixtures, as the complexity of the mixture increased, greater accuracy of the calculated chemical shifts was required, as expected. However, if the number of molecules in the mixture is known, e.g., when NMR is combined with MS and sample complexity is low, the likelihood of confident molecular identification increased by 90%. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-022-00587-7.
format Online
Article
Text
id pubmed-9499888
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-94998882022-09-24 An initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated NMR chemical shifts Yesiltepe, Yasemin Govind, Niranjan Metz, Thomas O. Renslow, Ryan S. J Cheminform Research Article The majority of primary and secondary metabolites in nature have yet to be identified, representing a major challenge for metabolomics studies that currently require reference libraries from analyses of authentic compounds. Using currently available analytical methods, complete chemical characterization of metabolomes is infeasible for both technical and economic reasons. For example, unambiguous identification of metabolites is limited by the availability of authentic chemical standards, which, for the majority of molecules, do not exist. Computationally predicted or calculated data are a viable solution to expand the currently limited metabolite reference libraries, if such methods are shown to be sufficiently accurate. For example, determining nuclear magnetic resonance (NMR) spectroscopy spectra in silico has shown promise in the identification and delineation of metabolite structures. Many researchers have been taking advantage of density functional theory (DFT), a computationally inexpensive yet reputable method for the prediction of carbon and proton NMR spectra of metabolites. However, such methods are expected to have some error in predicted (13)C and (1)H NMR spectra with respect to experimentally measured values. This leads us to the question–what accuracy is required in predicted (13)C and (1)H NMR chemical shifts for confident metabolite identification? Using the set of 11,716 small molecules found in the Human Metabolome Database (HMDB), we simulated both experimental and theoretical NMR chemical shift databases. We investigated the level of accuracy required for identification of metabolites in simulated pure and impure samples by matching predicted chemical shifts to experimental data. We found 90% or more of molecules in simulated pure samples can be successfully identified when errors of (1)H and (13)C chemical shifts in water are below 0.6 and 7.1 ppm, respectively, and below 0.5 and 4.6 ppm in chloroform solvation, respectively. In simulated complex mixtures, as the complexity of the mixture increased, greater accuracy of the calculated chemical shifts was required, as expected. However, if the number of molecules in the mixture is known, e.g., when NMR is combined with MS and sample complexity is low, the likelihood of confident molecular identification increased by 90%. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-022-00587-7. Springer International Publishing 2022-09-22 /pmc/articles/PMC9499888/ /pubmed/36138446 http://dx.doi.org/10.1186/s13321-022-00587-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Yesiltepe, Yasemin
Govind, Niranjan
Metz, Thomas O.
Renslow, Ryan S.
An initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated NMR chemical shifts
title An initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated NMR chemical shifts
title_full An initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated NMR chemical shifts
title_fullStr An initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated NMR chemical shifts
title_full_unstemmed An initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated NMR chemical shifts
title_short An initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated NMR chemical shifts
title_sort initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated nmr chemical shifts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9499888/
https://www.ncbi.nlm.nih.gov/pubmed/36138446
http://dx.doi.org/10.1186/s13321-022-00587-7
work_keys_str_mv AT yesiltepeyasemin aninitialinvestigationofaccuracyrequiredfortheidentificationofsmallmoleculesincomplexsamplesusingquantumchemicalcalculatednmrchemicalshifts
AT govindniranjan aninitialinvestigationofaccuracyrequiredfortheidentificationofsmallmoleculesincomplexsamplesusingquantumchemicalcalculatednmrchemicalshifts
AT metzthomaso aninitialinvestigationofaccuracyrequiredfortheidentificationofsmallmoleculesincomplexsamplesusingquantumchemicalcalculatednmrchemicalshifts
AT renslowryans aninitialinvestigationofaccuracyrequiredfortheidentificationofsmallmoleculesincomplexsamplesusingquantumchemicalcalculatednmrchemicalshifts
AT yesiltepeyasemin initialinvestigationofaccuracyrequiredfortheidentificationofsmallmoleculesincomplexsamplesusingquantumchemicalcalculatednmrchemicalshifts
AT govindniranjan initialinvestigationofaccuracyrequiredfortheidentificationofsmallmoleculesincomplexsamplesusingquantumchemicalcalculatednmrchemicalshifts
AT metzthomaso initialinvestigationofaccuracyrequiredfortheidentificationofsmallmoleculesincomplexsamplesusingquantumchemicalcalculatednmrchemicalshifts
AT renslowryans initialinvestigationofaccuracyrequiredfortheidentificationofsmallmoleculesincomplexsamplesusingquantumchemicalcalculatednmrchemicalshifts