Cargando…

The volatile compound BinBase mass spectral database

BACKGROUND: Volatile compounds comprise diverse chemical groups with wide-ranging sources and functions. These compounds originate from major pathways of secondary metabolism in many organisms and play essential roles in chemical ecology in both plant and animal kingdoms. In past decades, sampling m...

Descripción completa

Detalles Bibliográficos
Autores principales: Skogerson, Kirsten, Wohlgemuth, Gert, Barupal, Dinesh K, Fiehn, Oliver
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3199763/
https://www.ncbi.nlm.nih.gov/pubmed/21816034
http://dx.doi.org/10.1186/1471-2105-12-321
_version_ 1782214590967316480
author Skogerson, Kirsten
Wohlgemuth, Gert
Barupal, Dinesh K
Fiehn, Oliver
author_facet Skogerson, Kirsten
Wohlgemuth, Gert
Barupal, Dinesh K
Fiehn, Oliver
author_sort Skogerson, Kirsten
collection PubMed
description BACKGROUND: Volatile compounds comprise diverse chemical groups with wide-ranging sources and functions. These compounds originate from major pathways of secondary metabolism in many organisms and play essential roles in chemical ecology in both plant and animal kingdoms. In past decades, sampling methods and instrumentation for the analysis of complex volatile mixtures have improved; however, design and implementation of database tools to process and store the complex datasets have lagged behind. DESCRIPTION: The volatile compound BinBase (vocBinBase) is an automated peak annotation and database system developed for the analysis of GC-TOF-MS data derived from complex volatile mixtures. The vocBinBase DB is an extension of the previously reported metabolite BinBase software developed to track and identify derivatized metabolites. The BinBase algorithm uses deconvoluted spectra and peak metadata (retention index, unique ion, spectral similarity, peak signal-to-noise ratio, and peak purity) from the Leco ChromaTOF software, and annotates peaks using a multi-tiered filtering system with stringent thresholds. The vocBinBase algorithm assigns the identity of compounds existing in the database. Volatile compound assignments are supported by the Adams mass spectral-retention index library, which contains over 2,000 plant-derived volatile compounds. Novel molecules that are not found within vocBinBase are automatically added using strict mass spectral and experimental criteria. Users obtain fully annotated data sheets with quantitative information for all volatile compounds for studies that may consist of thousands of chromatograms. The vocBinBase database may also be queried across different studies, comprising currently 1,537 unique mass spectra generated from 1.7 million deconvoluted mass spectra of 3,435 samples (18 species). Mass spectra with retention indices and volatile profiles are available as free download under the CC-BY agreement (http://vocbinbase.fiehnlab.ucdavis.edu). CONCLUSIONS: The BinBase database algorithms have been successfully modified to allow for tracking and identification of volatile compounds in complex mixtures. The database is capable of annotating large datasets (hundreds to thousands of samples) and is well-suited for between-study comparisons such as chemotaxonomy investigations. This novel volatile compound database tool is applicable to research fields spanning chemical ecology to human health. The BinBase source code is freely available at http://binbase.sourceforge.net/ under the LGPL 2.0 license agreement.
format Online
Article
Text
id pubmed-3199763
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31997632011-10-24 The volatile compound BinBase mass spectral database Skogerson, Kirsten Wohlgemuth, Gert Barupal, Dinesh K Fiehn, Oliver BMC Bioinformatics Database BACKGROUND: Volatile compounds comprise diverse chemical groups with wide-ranging sources and functions. These compounds originate from major pathways of secondary metabolism in many organisms and play essential roles in chemical ecology in both plant and animal kingdoms. In past decades, sampling methods and instrumentation for the analysis of complex volatile mixtures have improved; however, design and implementation of database tools to process and store the complex datasets have lagged behind. DESCRIPTION: The volatile compound BinBase (vocBinBase) is an automated peak annotation and database system developed for the analysis of GC-TOF-MS data derived from complex volatile mixtures. The vocBinBase DB is an extension of the previously reported metabolite BinBase software developed to track and identify derivatized metabolites. The BinBase algorithm uses deconvoluted spectra and peak metadata (retention index, unique ion, spectral similarity, peak signal-to-noise ratio, and peak purity) from the Leco ChromaTOF software, and annotates peaks using a multi-tiered filtering system with stringent thresholds. The vocBinBase algorithm assigns the identity of compounds existing in the database. Volatile compound assignments are supported by the Adams mass spectral-retention index library, which contains over 2,000 plant-derived volatile compounds. Novel molecules that are not found within vocBinBase are automatically added using strict mass spectral and experimental criteria. Users obtain fully annotated data sheets with quantitative information for all volatile compounds for studies that may consist of thousands of chromatograms. The vocBinBase database may also be queried across different studies, comprising currently 1,537 unique mass spectra generated from 1.7 million deconvoluted mass spectra of 3,435 samples (18 species). Mass spectra with retention indices and volatile profiles are available as free download under the CC-BY agreement (http://vocbinbase.fiehnlab.ucdavis.edu). CONCLUSIONS: The BinBase database algorithms have been successfully modified to allow for tracking and identification of volatile compounds in complex mixtures. The database is capable of annotating large datasets (hundreds to thousands of samples) and is well-suited for between-study comparisons such as chemotaxonomy investigations. This novel volatile compound database tool is applicable to research fields spanning chemical ecology to human health. The BinBase source code is freely available at http://binbase.sourceforge.net/ under the LGPL 2.0 license agreement. BioMed Central 2011-08-04 /pmc/articles/PMC3199763/ /pubmed/21816034 http://dx.doi.org/10.1186/1471-2105-12-321 Text en Copyright ©2011 Skogerson et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Skogerson, Kirsten
Wohlgemuth, Gert
Barupal, Dinesh K
Fiehn, Oliver
The volatile compound BinBase mass spectral database
title The volatile compound BinBase mass spectral database
title_full The volatile compound BinBase mass spectral database
title_fullStr The volatile compound BinBase mass spectral database
title_full_unstemmed The volatile compound BinBase mass spectral database
title_short The volatile compound BinBase mass spectral database
title_sort volatile compound binbase mass spectral database
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3199763/
https://www.ncbi.nlm.nih.gov/pubmed/21816034
http://dx.doi.org/10.1186/1471-2105-12-321
work_keys_str_mv AT skogersonkirsten thevolatilecompoundbinbasemassspectraldatabase
AT wohlgemuthgert thevolatilecompoundbinbasemassspectraldatabase
AT barupaldineshk thevolatilecompoundbinbasemassspectraldatabase
AT fiehnoliver thevolatilecompoundbinbasemassspectraldatabase
AT skogersonkirsten volatilecompoundbinbasemassspectraldatabase
AT wohlgemuthgert volatilecompoundbinbasemassspectraldatabase
AT barupaldineshk volatilecompoundbinbasemassspectraldatabase
AT fiehnoliver volatilecompoundbinbasemassspectraldatabase