Cargando…
Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking
BACKGROUND: In metabolomics experiments, spectral fingerprints of metabolites with no known structural identity are detected routinely. Computer-assisted structure elucidation (CASE) has been used to determine the structural identities of unknown compounds. It is generally accepted that a single 1D...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4089557/ https://www.ncbi.nlm.nih.gov/pubmed/24996690 http://dx.doi.org/10.1186/1471-2105-15-234 |
_version_ | 1782325125399445504 |
---|---|
author | Jayaseelan, Kalai Vanii Steinbeck, Christoph |
author_facet | Jayaseelan, Kalai Vanii Steinbeck, Christoph |
author_sort | Jayaseelan, Kalai Vanii |
collection | PubMed |
description | BACKGROUND: In metabolomics experiments, spectral fingerprints of metabolites with no known structural identity are detected routinely. Computer-assisted structure elucidation (CASE) has been used to determine the structural identities of unknown compounds. It is generally accepted that a single 1D NMR spectrum or mass spectrum is usually not sufficient to establish the identity of a hitherto unknown compound. When a suite of spectra from 1D and 2D NMR experiments supplemented with a molecular formula are available, the successful elucidation of the chemical structure for candidates with up to 30 heavy atoms has been reported previously by one of the authors. In high-throughput metabolomics, usually 1D NMR or mass spectrometry experiments alone are conducted for rapid analysis of samples. This method subsequently requires that the spectral patterns are analyzed automatically to quickly identify known and unknown structures. In this study, we investigated whether additional existing knowledge, such as the fact that the unknown compound is a natural product, can be used to improve the ranking of the correct structure in the result list after the structure elucidation process. RESULTS: To identify unknowns using as little spectroscopic information as possible, we implemented an evolutionary algorithm-based CASE mechanism to elucidate candidates in a fully automated fashion, with input of the molecular formula and (13)C NMR spectrum of the isolated compound. We also tested how filters like natural product-likeness, a measure that calculates the similarity of the compounds to known natural product space, might enhance the performance and quality of the structure elucidation. The evolutionary algorithm is implemented within the SENECA package for CASE reported previously, and is available for free download under artistic license at http://sourceforge.net/projects/seneca/. The natural product-likeness calculator is incorporated as a plugin within SENECA and is available as a GUI client and command-line executable. Significant improvements in candidate ranking were demonstrated for 41 small test molecules when the CASE system was supplemented by a natural product-likeness filter. CONCLUSIONS: In spectroscopically underdetermined structure elucidation problems, natural product-likeness can contribute to a better ranking of the correct structure in the results list. |
format | Online Article Text |
id | pubmed-4089557 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40895572014-07-23 Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking Jayaseelan, Kalai Vanii Steinbeck, Christoph BMC Bioinformatics Methodology Article BACKGROUND: In metabolomics experiments, spectral fingerprints of metabolites with no known structural identity are detected routinely. Computer-assisted structure elucidation (CASE) has been used to determine the structural identities of unknown compounds. It is generally accepted that a single 1D NMR spectrum or mass spectrum is usually not sufficient to establish the identity of a hitherto unknown compound. When a suite of spectra from 1D and 2D NMR experiments supplemented with a molecular formula are available, the successful elucidation of the chemical structure for candidates with up to 30 heavy atoms has been reported previously by one of the authors. In high-throughput metabolomics, usually 1D NMR or mass spectrometry experiments alone are conducted for rapid analysis of samples. This method subsequently requires that the spectral patterns are analyzed automatically to quickly identify known and unknown structures. In this study, we investigated whether additional existing knowledge, such as the fact that the unknown compound is a natural product, can be used to improve the ranking of the correct structure in the result list after the structure elucidation process. RESULTS: To identify unknowns using as little spectroscopic information as possible, we implemented an evolutionary algorithm-based CASE mechanism to elucidate candidates in a fully automated fashion, with input of the molecular formula and (13)C NMR spectrum of the isolated compound. We also tested how filters like natural product-likeness, a measure that calculates the similarity of the compounds to known natural product space, might enhance the performance and quality of the structure elucidation. The evolutionary algorithm is implemented within the SENECA package for CASE reported previously, and is available for free download under artistic license at http://sourceforge.net/projects/seneca/. The natural product-likeness calculator is incorporated as a plugin within SENECA and is available as a GUI client and command-line executable. Significant improvements in candidate ranking were demonstrated for 41 small test molecules when the CASE system was supplemented by a natural product-likeness filter. CONCLUSIONS: In spectroscopically underdetermined structure elucidation problems, natural product-likeness can contribute to a better ranking of the correct structure in the results list. BioMed Central 2014-07-05 /pmc/articles/PMC4089557/ /pubmed/24996690 http://dx.doi.org/10.1186/1471-2105-15-234 Text en Copyright © 2014 Jayaseelan and Steinbeck; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Jayaseelan, Kalai Vanii Steinbeck, Christoph Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking |
title | Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking |
title_full | Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking |
title_fullStr | Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking |
title_full_unstemmed | Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking |
title_short | Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking |
title_sort | building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4089557/ https://www.ncbi.nlm.nih.gov/pubmed/24996690 http://dx.doi.org/10.1186/1471-2105-15-234 |
work_keys_str_mv | AT jayaseelankalaivanii buildingblocksforautomatedelucidationofmetabolitesnaturalproductlikenessforcandidateranking AT steinbeckchristoph buildingblocksforautomatedelucidationofmetabolitesnaturalproductlikenessforcandidateranking |