Cargando…

IPO: a tool for automated optimization of XCMS parameters

BACKGROUND: Untargeted metabolomics generates a huge amount of data. Software packages for automated data processing are crucial to successfully process these data. A variety of such software packages exist, but the outcome of data processing strongly depends on algorithm parameter settings. If they...

Descripción completa

Detalles Bibliográficos
Autores principales: Libiseller, Gunnar, Dvorzak, Michaela, Kleb, Ulrike, Gander, Edgar, Eisenberg, Tobias, Madeo, Frank, Neumann, Steffen, Trausinger, Gert, Sinner, Frank, Pieber, Thomas, Magnes, Christoph
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4404568/
https://www.ncbi.nlm.nih.gov/pubmed/25888443
http://dx.doi.org/10.1186/s12859-015-0562-8
_version_ 1782367512106631168
author Libiseller, Gunnar
Dvorzak, Michaela
Kleb, Ulrike
Gander, Edgar
Eisenberg, Tobias
Madeo, Frank
Neumann, Steffen
Trausinger, Gert
Sinner, Frank
Pieber, Thomas
Magnes, Christoph
author_facet Libiseller, Gunnar
Dvorzak, Michaela
Kleb, Ulrike
Gander, Edgar
Eisenberg, Tobias
Madeo, Frank
Neumann, Steffen
Trausinger, Gert
Sinner, Frank
Pieber, Thomas
Magnes, Christoph
author_sort Libiseller, Gunnar
collection PubMed
description BACKGROUND: Untargeted metabolomics generates a huge amount of data. Software packages for automated data processing are crucial to successfully process these data. A variety of such software packages exist, but the outcome of data processing strongly depends on algorithm parameter settings. If they are not carefully chosen, suboptimal parameter settings can easily lead to biased results. Therefore, parameter settings also require optimization. Several parameter optimization approaches have already been proposed, but a software package for parameter optimization which is free of intricate experimental labeling steps, fast and widely applicable is still missing. RESULTS: We implemented the software package IPO (‘Isotopologue Parameter Optimization’) which is fast and free of labeling steps, and applicable to data from different kinds of samples and data from different methods of liquid chromatography - high resolution mass spectrometry and data from different instruments. IPO optimizes XCMS peak picking parameters by using natural, stable (13)C isotopic peaks to calculate a peak picking score. Retention time correction is optimized by minimizing relative retention time differences within peak groups. Grouping parameters are optimized by maximizing the number of peak groups that show one peak from each injection of a pooled sample. The different parameter settings are achieved by design of experiments, and the resulting scores are evaluated using response surface models. IPO was tested on three different data sets, each consisting of a training set and test set. IPO resulted in an increase of reliable groups (146% - 361%), a decrease of non-reliable groups (3% - 8%) and a decrease of the retention time deviation to one third. CONCLUSIONS: IPO was successfully applied to data derived from liquid chromatography coupled to high resolution mass spectrometry from three studies with different sample types and different chromatographic methods and devices. We were also able to show the potential of IPO to increase the reliability of metabolomics data. The source code is implemented in R, tested on Linux and Windows and it is freely available for download at https://github.com/glibiseller/IPO. The training sets and test sets can be downloaded from https://health.joanneum.at/IPO. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0562-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4404568
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44045682015-04-22 IPO: a tool for automated optimization of XCMS parameters Libiseller, Gunnar Dvorzak, Michaela Kleb, Ulrike Gander, Edgar Eisenberg, Tobias Madeo, Frank Neumann, Steffen Trausinger, Gert Sinner, Frank Pieber, Thomas Magnes, Christoph BMC Bioinformatics Software BACKGROUND: Untargeted metabolomics generates a huge amount of data. Software packages for automated data processing are crucial to successfully process these data. A variety of such software packages exist, but the outcome of data processing strongly depends on algorithm parameter settings. If they are not carefully chosen, suboptimal parameter settings can easily lead to biased results. Therefore, parameter settings also require optimization. Several parameter optimization approaches have already been proposed, but a software package for parameter optimization which is free of intricate experimental labeling steps, fast and widely applicable is still missing. RESULTS: We implemented the software package IPO (‘Isotopologue Parameter Optimization’) which is fast and free of labeling steps, and applicable to data from different kinds of samples and data from different methods of liquid chromatography - high resolution mass spectrometry and data from different instruments. IPO optimizes XCMS peak picking parameters by using natural, stable (13)C isotopic peaks to calculate a peak picking score. Retention time correction is optimized by minimizing relative retention time differences within peak groups. Grouping parameters are optimized by maximizing the number of peak groups that show one peak from each injection of a pooled sample. The different parameter settings are achieved by design of experiments, and the resulting scores are evaluated using response surface models. IPO was tested on three different data sets, each consisting of a training set and test set. IPO resulted in an increase of reliable groups (146% - 361%), a decrease of non-reliable groups (3% - 8%) and a decrease of the retention time deviation to one third. CONCLUSIONS: IPO was successfully applied to data derived from liquid chromatography coupled to high resolution mass spectrometry from three studies with different sample types and different chromatographic methods and devices. We were also able to show the potential of IPO to increase the reliability of metabolomics data. The source code is implemented in R, tested on Linux and Windows and it is freely available for download at https://github.com/glibiseller/IPO. The training sets and test sets can be downloaded from https://health.joanneum.at/IPO. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0562-8) contains supplementary material, which is available to authorized users. BioMed Central 2015-04-16 /pmc/articles/PMC4404568/ /pubmed/25888443 http://dx.doi.org/10.1186/s12859-015-0562-8 Text en © Libiseller et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Libiseller, Gunnar
Dvorzak, Michaela
Kleb, Ulrike
Gander, Edgar
Eisenberg, Tobias
Madeo, Frank
Neumann, Steffen
Trausinger, Gert
Sinner, Frank
Pieber, Thomas
Magnes, Christoph
IPO: a tool for automated optimization of XCMS parameters
title IPO: a tool for automated optimization of XCMS parameters
title_full IPO: a tool for automated optimization of XCMS parameters
title_fullStr IPO: a tool for automated optimization of XCMS parameters
title_full_unstemmed IPO: a tool for automated optimization of XCMS parameters
title_short IPO: a tool for automated optimization of XCMS parameters
title_sort ipo: a tool for automated optimization of xcms parameters
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4404568/
https://www.ncbi.nlm.nih.gov/pubmed/25888443
http://dx.doi.org/10.1186/s12859-015-0562-8
work_keys_str_mv AT libisellergunnar ipoatoolforautomatedoptimizationofxcmsparameters
AT dvorzakmichaela ipoatoolforautomatedoptimizationofxcmsparameters
AT klebulrike ipoatoolforautomatedoptimizationofxcmsparameters
AT ganderedgar ipoatoolforautomatedoptimizationofxcmsparameters
AT eisenbergtobias ipoatoolforautomatedoptimizationofxcmsparameters
AT madeofrank ipoatoolforautomatedoptimizationofxcmsparameters
AT neumannsteffen ipoatoolforautomatedoptimizationofxcmsparameters
AT trausingergert ipoatoolforautomatedoptimizationofxcmsparameters
AT sinnerfrank ipoatoolforautomatedoptimizationofxcmsparameters
AT pieberthomas ipoatoolforautomatedoptimizationofxcmsparameters
AT magneschristoph ipoatoolforautomatedoptimizationofxcmsparameters