Cargando…

A modular computational framework for automated peak extraction from ion mobility spectra

BACKGROUND: An ion mobility (IM) spectrometer coupled with a multi-capillary column (MCC) measures volatile organic compounds (VOCs) in the air or in exhaled breath. This technique is utilized in several biotechnological and medical applications. Each peak in an MCC/IM measurement represents a certa...

Descripción completa

Detalles Bibliográficos
Autores principales: D’Addario, Marianna, Kopczynski, Dominik, Baumbach, Jörg Ingo, Rahmann, Sven
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3930762/
https://www.ncbi.nlm.nih.gov/pubmed/24450533
http://dx.doi.org/10.1186/1471-2105-15-25
_version_ 1782304583414972416
author D’Addario, Marianna
Kopczynski, Dominik
Baumbach, Jörg Ingo
Rahmann, Sven
author_facet D’Addario, Marianna
Kopczynski, Dominik
Baumbach, Jörg Ingo
Rahmann, Sven
author_sort D’Addario, Marianna
collection PubMed
description BACKGROUND: An ion mobility (IM) spectrometer coupled with a multi-capillary column (MCC) measures volatile organic compounds (VOCs) in the air or in exhaled breath. This technique is utilized in several biotechnological and medical applications. Each peak in an MCC/IM measurement represents a certain compound, which may be known or unknown. For clustering and classification of measurements, the raw data matrix must be reduced to a set of peaks. Each peak is described by its coordinates (retention time in the MCC and reduced inverse ion mobility) and shape (signal intensity, further shape parameters). This fundamental step is referred to as peak extraction. It is the basis for identifying discriminating peaks, and hence putative biomarkers, between two classes of measurements, such as a healthy control group and a group of patients with a confirmed disease. Current state-of-the-art peak extraction methods require human interaction, such as hand-picking approximate peak locations, assisted by a visualization of the data matrix. In a high-throughput context, however, it is preferable to have robust methods for fully automated peak extraction. RESULTS: We introduce PEAX, a modular framework for automated peak extraction. The framework consists of several steps in a pipeline architecture. Each step performs a specific sub-task and can be instantiated by different methods implemented as modules. We provide open-source software for the framework and several modules for each step. Additionally, an interface that allows easy extension by a new module is provided. Combining the modules in all reasonable ways leads to a large number of peak extraction methods. We evaluate all combinations using intrinsic error measures and by comparing the resulting peak sets with an expert-picked one. CONCLUSIONS: Our software PEAX is able to automatically extract peaks from MCC/IM measurements within a few seconds. The automatically obtained results keep up with the results provided by current state-of-the-art peak extraction methods. This opens a high-throughput context for the MCC/IM application field. Our software is available at http://www.rahmannlab.de/research/ims.
format Online
Article
Text
id pubmed-3930762
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39307622014-03-04 A modular computational framework for automated peak extraction from ion mobility spectra D’Addario, Marianna Kopczynski, Dominik Baumbach, Jörg Ingo Rahmann, Sven BMC Bioinformatics Methodology Article BACKGROUND: An ion mobility (IM) spectrometer coupled with a multi-capillary column (MCC) measures volatile organic compounds (VOCs) in the air or in exhaled breath. This technique is utilized in several biotechnological and medical applications. Each peak in an MCC/IM measurement represents a certain compound, which may be known or unknown. For clustering and classification of measurements, the raw data matrix must be reduced to a set of peaks. Each peak is described by its coordinates (retention time in the MCC and reduced inverse ion mobility) and shape (signal intensity, further shape parameters). This fundamental step is referred to as peak extraction. It is the basis for identifying discriminating peaks, and hence putative biomarkers, between two classes of measurements, such as a healthy control group and a group of patients with a confirmed disease. Current state-of-the-art peak extraction methods require human interaction, such as hand-picking approximate peak locations, assisted by a visualization of the data matrix. In a high-throughput context, however, it is preferable to have robust methods for fully automated peak extraction. RESULTS: We introduce PEAX, a modular framework for automated peak extraction. The framework consists of several steps in a pipeline architecture. Each step performs a specific sub-task and can be instantiated by different methods implemented as modules. We provide open-source software for the framework and several modules for each step. Additionally, an interface that allows easy extension by a new module is provided. Combining the modules in all reasonable ways leads to a large number of peak extraction methods. We evaluate all combinations using intrinsic error measures and by comparing the resulting peak sets with an expert-picked one. CONCLUSIONS: Our software PEAX is able to automatically extract peaks from MCC/IM measurements within a few seconds. The automatically obtained results keep up with the results provided by current state-of-the-art peak extraction methods. This opens a high-throughput context for the MCC/IM application field. Our software is available at http://www.rahmannlab.de/research/ims. BioMed Central 2014-01-22 /pmc/articles/PMC3930762/ /pubmed/24450533 http://dx.doi.org/10.1186/1471-2105-15-25 Text en Copyright © 2014 D’Addario et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
D’Addario, Marianna
Kopczynski, Dominik
Baumbach, Jörg Ingo
Rahmann, Sven
A modular computational framework for automated peak extraction from ion mobility spectra
title A modular computational framework for automated peak extraction from ion mobility spectra
title_full A modular computational framework for automated peak extraction from ion mobility spectra
title_fullStr A modular computational framework for automated peak extraction from ion mobility spectra
title_full_unstemmed A modular computational framework for automated peak extraction from ion mobility spectra
title_short A modular computational framework for automated peak extraction from ion mobility spectra
title_sort modular computational framework for automated peak extraction from ion mobility spectra
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3930762/
https://www.ncbi.nlm.nih.gov/pubmed/24450533
http://dx.doi.org/10.1186/1471-2105-15-25
work_keys_str_mv AT daddariomarianna amodularcomputationalframeworkforautomatedpeakextractionfromionmobilityspectra
AT kopczynskidominik amodularcomputationalframeworkforautomatedpeakextractionfromionmobilityspectra
AT baumbachjorgingo amodularcomputationalframeworkforautomatedpeakextractionfromionmobilityspectra
AT rahmannsven amodularcomputationalframeworkforautomatedpeakextractionfromionmobilityspectra
AT daddariomarianna modularcomputationalframeworkforautomatedpeakextractionfromionmobilityspectra
AT kopczynskidominik modularcomputationalframeworkforautomatedpeakextractionfromionmobilityspectra
AT baumbachjorgingo modularcomputationalframeworkforautomatedpeakextractionfromionmobilityspectra
AT rahmannsven modularcomputationalframeworkforautomatedpeakextractionfromionmobilityspectra