Cargando…

Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking

BACKGROUND: In metabolomics experiments, spectral fingerprints of metabolites with no known structural identity are detected routinely. Computer-assisted structure elucidation (CASE) has been used to determine the structural identities of unknown compounds. It is generally accepted that a single 1D...

Descripción completa

Detalles Bibliográficos
Autores principales: Jayaseelan, Kalai Vanii, Steinbeck, Christoph
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4089557/
https://www.ncbi.nlm.nih.gov/pubmed/24996690
http://dx.doi.org/10.1186/1471-2105-15-234
_version_ 1782325125399445504
author Jayaseelan, Kalai Vanii
Steinbeck, Christoph
author_facet Jayaseelan, Kalai Vanii
Steinbeck, Christoph
author_sort Jayaseelan, Kalai Vanii
collection PubMed
description BACKGROUND: In metabolomics experiments, spectral fingerprints of metabolites with no known structural identity are detected routinely. Computer-assisted structure elucidation (CASE) has been used to determine the structural identities of unknown compounds. It is generally accepted that a single 1D NMR spectrum or mass spectrum is usually not sufficient to establish the identity of a hitherto unknown compound. When a suite of spectra from 1D and 2D NMR experiments supplemented with a molecular formula are available, the successful elucidation of the chemical structure for candidates with up to 30 heavy atoms has been reported previously by one of the authors. In high-throughput metabolomics, usually 1D NMR or mass spectrometry experiments alone are conducted for rapid analysis of samples. This method subsequently requires that the spectral patterns are analyzed automatically to quickly identify known and unknown structures. In this study, we investigated whether additional existing knowledge, such as the fact that the unknown compound is a natural product, can be used to improve the ranking of the correct structure in the result list after the structure elucidation process. RESULTS: To identify unknowns using as little spectroscopic information as possible, we implemented an evolutionary algorithm-based CASE mechanism to elucidate candidates in a fully automated fashion, with input of the molecular formula and (13)C NMR spectrum of the isolated compound. We also tested how filters like natural product-likeness, a measure that calculates the similarity of the compounds to known natural product space, might enhance the performance and quality of the structure elucidation. The evolutionary algorithm is implemented within the SENECA package for CASE reported previously, and is available for free download under artistic license at http://sourceforge.net/projects/seneca/. The natural product-likeness calculator is incorporated as a plugin within SENECA and is available as a GUI client and command-line executable. Significant improvements in candidate ranking were demonstrated for 41 small test molecules when the CASE system was supplemented by a natural product-likeness filter. CONCLUSIONS: In spectroscopically underdetermined structure elucidation problems, natural product-likeness can contribute to a better ranking of the correct structure in the results list.
format Online
Article
Text
id pubmed-4089557
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40895572014-07-23 Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking Jayaseelan, Kalai Vanii Steinbeck, Christoph BMC Bioinformatics Methodology Article BACKGROUND: In metabolomics experiments, spectral fingerprints of metabolites with no known structural identity are detected routinely. Computer-assisted structure elucidation (CASE) has been used to determine the structural identities of unknown compounds. It is generally accepted that a single 1D NMR spectrum or mass spectrum is usually not sufficient to establish the identity of a hitherto unknown compound. When a suite of spectra from 1D and 2D NMR experiments supplemented with a molecular formula are available, the successful elucidation of the chemical structure for candidates with up to 30 heavy atoms has been reported previously by one of the authors. In high-throughput metabolomics, usually 1D NMR or mass spectrometry experiments alone are conducted for rapid analysis of samples. This method subsequently requires that the spectral patterns are analyzed automatically to quickly identify known and unknown structures. In this study, we investigated whether additional existing knowledge, such as the fact that the unknown compound is a natural product, can be used to improve the ranking of the correct structure in the result list after the structure elucidation process. RESULTS: To identify unknowns using as little spectroscopic information as possible, we implemented an evolutionary algorithm-based CASE mechanism to elucidate candidates in a fully automated fashion, with input of the molecular formula and (13)C NMR spectrum of the isolated compound. We also tested how filters like natural product-likeness, a measure that calculates the similarity of the compounds to known natural product space, might enhance the performance and quality of the structure elucidation. The evolutionary algorithm is implemented within the SENECA package for CASE reported previously, and is available for free download under artistic license at http://sourceforge.net/projects/seneca/. The natural product-likeness calculator is incorporated as a plugin within SENECA and is available as a GUI client and command-line executable. Significant improvements in candidate ranking were demonstrated for 41 small test molecules when the CASE system was supplemented by a natural product-likeness filter. CONCLUSIONS: In spectroscopically underdetermined structure elucidation problems, natural product-likeness can contribute to a better ranking of the correct structure in the results list. BioMed Central 2014-07-05 /pmc/articles/PMC4089557/ /pubmed/24996690 http://dx.doi.org/10.1186/1471-2105-15-234 Text en Copyright © 2014 Jayaseelan and Steinbeck; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Jayaseelan, Kalai Vanii
Steinbeck, Christoph
Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking
title Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking
title_full Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking
title_fullStr Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking
title_full_unstemmed Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking
title_short Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking
title_sort building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4089557/
https://www.ncbi.nlm.nih.gov/pubmed/24996690
http://dx.doi.org/10.1186/1471-2105-15-234
work_keys_str_mv AT jayaseelankalaivanii buildingblocksforautomatedelucidationofmetabolitesnaturalproductlikenessforcandidateranking
AT steinbeckchristoph buildingblocksforautomatedelucidationofmetabolitesnaturalproductlikenessforcandidateranking