Cargando…

PeakBot: machine-learning-based chromatographic peak picking

MOTIVATION: Chromatographic peak picking is among the first steps in data processing workflows of raw LC-HRMS datasets in untargeted metabolomics applications. Its performance is crucial for the holistic detection of all metabolic features as well as their relative quantification for statistical ana...

Descripción completa

Detalles Bibliográficos
Autores principales: Bueschl, Christoph, Doppler, Maria, Varga, Elisabeth, Seidl, Bernhard, Flasch, Mira, Warth, Benedikt, Zanghellini, Juergen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237678/
https://www.ncbi.nlm.nih.gov/pubmed/35604083
http://dx.doi.org/10.1093/bioinformatics/btac344
_version_ 1784736854225977344
author Bueschl, Christoph
Doppler, Maria
Varga, Elisabeth
Seidl, Bernhard
Flasch, Mira
Warth, Benedikt
Zanghellini, Juergen
author_facet Bueschl, Christoph
Doppler, Maria
Varga, Elisabeth
Seidl, Bernhard
Flasch, Mira
Warth, Benedikt
Zanghellini, Juergen
author_sort Bueschl, Christoph
collection PubMed
description MOTIVATION: Chromatographic peak picking is among the first steps in data processing workflows of raw LC-HRMS datasets in untargeted metabolomics applications. Its performance is crucial for the holistic detection of all metabolic features as well as their relative quantification for statistical analysis and metabolite identification. Random noise, non-baseline separated compounds and unspecific background signals complicate this task. RESULTS: A machine-learning-based approach entitled PeakBot was developed for detecting chromatographic peaks in LC-HRMS profile-mode data. It first detects all local signal maxima in a chromatogram, which are then extracted as super-sampled standardized areas (retention-time versus m/z). These are subsequently inspected by a custom-trained convolutional neural network that forms the basis of PeakBot’s architecture. The model reports if the respective local maximum is the apex of a chromatographic peak or not as well as its peak center and bounding box. In training and independent validation datasets used for development, PeakBot achieved a high performance with respect to discriminating between chromatographic peaks and background signals (accuracy of 0.99). For training the machine-learning model a minimum of 100 reference features are needed to learn their characteristics to achieve high-quality peak-picking results for detecting such chromatographic peaks in an untargeted fashion. PeakBot is implemented in python (3.8) and uses the TensorFlow (2.5.0) package for machine-learning related tasks. It has been tested on Linux and Windows OSs. AVAILABILITY AND IMPLEMENTATION: The package is available free of charge for non-commercial use (CC BY-NC-SA). It is available at https://github.com/christophuv/PeakBot. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9237678
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92376782022-06-29 PeakBot: machine-learning-based chromatographic peak picking Bueschl, Christoph Doppler, Maria Varga, Elisabeth Seidl, Bernhard Flasch, Mira Warth, Benedikt Zanghellini, Juergen Bioinformatics Original Papers MOTIVATION: Chromatographic peak picking is among the first steps in data processing workflows of raw LC-HRMS datasets in untargeted metabolomics applications. Its performance is crucial for the holistic detection of all metabolic features as well as their relative quantification for statistical analysis and metabolite identification. Random noise, non-baseline separated compounds and unspecific background signals complicate this task. RESULTS: A machine-learning-based approach entitled PeakBot was developed for detecting chromatographic peaks in LC-HRMS profile-mode data. It first detects all local signal maxima in a chromatogram, which are then extracted as super-sampled standardized areas (retention-time versus m/z). These are subsequently inspected by a custom-trained convolutional neural network that forms the basis of PeakBot’s architecture. The model reports if the respective local maximum is the apex of a chromatographic peak or not as well as its peak center and bounding box. In training and independent validation datasets used for development, PeakBot achieved a high performance with respect to discriminating between chromatographic peaks and background signals (accuracy of 0.99). For training the machine-learning model a minimum of 100 reference features are needed to learn their characteristics to achieve high-quality peak-picking results for detecting such chromatographic peaks in an untargeted fashion. PeakBot is implemented in python (3.8) and uses the TensorFlow (2.5.0) package for machine-learning related tasks. It has been tested on Linux and Windows OSs. AVAILABILITY AND IMPLEMENTATION: The package is available free of charge for non-commercial use (CC BY-NC-SA). It is available at https://github.com/christophuv/PeakBot. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-05-23 /pmc/articles/PMC9237678/ /pubmed/35604083 http://dx.doi.org/10.1093/bioinformatics/btac344 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Bueschl, Christoph
Doppler, Maria
Varga, Elisabeth
Seidl, Bernhard
Flasch, Mira
Warth, Benedikt
Zanghellini, Juergen
PeakBot: machine-learning-based chromatographic peak picking
title PeakBot: machine-learning-based chromatographic peak picking
title_full PeakBot: machine-learning-based chromatographic peak picking
title_fullStr PeakBot: machine-learning-based chromatographic peak picking
title_full_unstemmed PeakBot: machine-learning-based chromatographic peak picking
title_short PeakBot: machine-learning-based chromatographic peak picking
title_sort peakbot: machine-learning-based chromatographic peak picking
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237678/
https://www.ncbi.nlm.nih.gov/pubmed/35604083
http://dx.doi.org/10.1093/bioinformatics/btac344
work_keys_str_mv AT bueschlchristoph peakbotmachinelearningbasedchromatographicpeakpicking
AT dopplermaria peakbotmachinelearningbasedchromatographicpeakpicking
AT vargaelisabeth peakbotmachinelearningbasedchromatographicpeakpicking
AT seidlbernhard peakbotmachinelearningbasedchromatographicpeakpicking
AT flaschmira peakbotmachinelearningbasedchromatographicpeakpicking
AT warthbenedikt peakbotmachinelearningbasedchromatographicpeakpicking
AT zanghellinijuergen peakbotmachinelearningbasedchromatographicpeakpicking