Cargando…

NMR-TS: de novo molecule identification from NMR spectra

Nuclear magnetic resonance (NMR) spectroscopy is an effective tool for identifying molecules in a sample. Although many previously observed NMR spectra are accumulated in public databases, they cover only a tiny fraction of the chemical space, and molecule identification is typically accomplished ma...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Jinzhe, Terayama, Kei, Sumita, Masato, Yoshizoe, Kazuki, Ito, Kengo, Kikuchi, Jun, Tsuda, Koji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7476483/
https://www.ncbi.nlm.nih.gov/pubmed/32939179
http://dx.doi.org/10.1080/14686996.2020.1793382
_version_ 1783579709189128192
author Zhang, Jinzhe
Terayama, Kei
Sumita, Masato
Yoshizoe, Kazuki
Ito, Kengo
Kikuchi, Jun
Tsuda, Koji
author_facet Zhang, Jinzhe
Terayama, Kei
Sumita, Masato
Yoshizoe, Kazuki
Ito, Kengo
Kikuchi, Jun
Tsuda, Koji
author_sort Zhang, Jinzhe
collection PubMed
description Nuclear magnetic resonance (NMR) spectroscopy is an effective tool for identifying molecules in a sample. Although many previously observed NMR spectra are accumulated in public databases, they cover only a tiny fraction of the chemical space, and molecule identification is typically accomplished manually based on expert knowledge. Herein, we propose NMR-TS, a machine-learning-based python library, to automatically identify a molecule from its NMR spectrum. NMR-TS discovers candidate molecules whose NMR spectra match the target spectrum by using deep learning and density functional theory (DFT)-computed spectra. As a proof-of-concept, we identify prototypical metabolites from their computed spectra. After an average 5451 DFT runs for each spectrum, six of the nine molecules are identified correctly, and proximal molecules are obtained in the other cases. This encouraging result implies that de novo molecule generation can contribute to the fully automated identification of chemical structures. NMR-TS is available at https://github.com/tsudalab/NMR-TS.
format Online
Article
Text
id pubmed-7476483
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-74764832020-09-15 NMR-TS: de novo molecule identification from NMR spectra Zhang, Jinzhe Terayama, Kei Sumita, Masato Yoshizoe, Kazuki Ito, Kengo Kikuchi, Jun Tsuda, Koji Sci Technol Adv Mater New topics/Others Nuclear magnetic resonance (NMR) spectroscopy is an effective tool for identifying molecules in a sample. Although many previously observed NMR spectra are accumulated in public databases, they cover only a tiny fraction of the chemical space, and molecule identification is typically accomplished manually based on expert knowledge. Herein, we propose NMR-TS, a machine-learning-based python library, to automatically identify a molecule from its NMR spectrum. NMR-TS discovers candidate molecules whose NMR spectra match the target spectrum by using deep learning and density functional theory (DFT)-computed spectra. As a proof-of-concept, we identify prototypical metabolites from their computed spectra. After an average 5451 DFT runs for each spectrum, six of the nine molecules are identified correctly, and proximal molecules are obtained in the other cases. This encouraging result implies that de novo molecule generation can contribute to the fully automated identification of chemical structures. NMR-TS is available at https://github.com/tsudalab/NMR-TS. Taylor & Francis 2020-07-30 /pmc/articles/PMC7476483/ /pubmed/32939179 http://dx.doi.org/10.1080/14686996.2020.1793382 Text en © 2020 The Author(s). Published by National Institute for Materials Science in partnership with Taylor & Francis Group. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle New topics/Others
Zhang, Jinzhe
Terayama, Kei
Sumita, Masato
Yoshizoe, Kazuki
Ito, Kengo
Kikuchi, Jun
Tsuda, Koji
NMR-TS: de novo molecule identification from NMR spectra
title NMR-TS: de novo molecule identification from NMR spectra
title_full NMR-TS: de novo molecule identification from NMR spectra
title_fullStr NMR-TS: de novo molecule identification from NMR spectra
title_full_unstemmed NMR-TS: de novo molecule identification from NMR spectra
title_short NMR-TS: de novo molecule identification from NMR spectra
title_sort nmr-ts: de novo molecule identification from nmr spectra
topic New topics/Others
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7476483/
https://www.ncbi.nlm.nih.gov/pubmed/32939179
http://dx.doi.org/10.1080/14686996.2020.1793382
work_keys_str_mv AT zhangjinzhe nmrtsdenovomoleculeidentificationfromnmrspectra
AT terayamakei nmrtsdenovomoleculeidentificationfromnmrspectra
AT sumitamasato nmrtsdenovomoleculeidentificationfromnmrspectra
AT yoshizoekazuki nmrtsdenovomoleculeidentificationfromnmrspectra
AT itokengo nmrtsdenovomoleculeidentificationfromnmrspectra
AT kikuchijun nmrtsdenovomoleculeidentificationfromnmrspectra
AT tsudakoji nmrtsdenovomoleculeidentificationfromnmrspectra