Cargando…

automRm: An R Package for Fully Automatic LC-QQQ-MS Data Preprocessing Powered by Machine Learning

[Image: see text] Preprocessing of liquid chromatography-mass spectrometry (LC-MS) raw data facilitates downstream statistical and biological data analyses. In the case of targeted LC-MS data, consistent recognition of chromatographic peaks is a main challenge, in particular, for low abundant signal...

Descripción completa

Detalles Bibliográficos
Autores principales: Eilertz, Daniel, Mitterer, Michael, Buescher, Joerg M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9047440/
https://www.ncbi.nlm.nih.gov/pubmed/35412809
http://dx.doi.org/10.1021/acs.analchem.1c05224
_version_ 1784695726764195840
author Eilertz, Daniel
Mitterer, Michael
Buescher, Joerg M.
author_facet Eilertz, Daniel
Mitterer, Michael
Buescher, Joerg M.
author_sort Eilertz, Daniel
collection PubMed
description [Image: see text] Preprocessing of liquid chromatography-mass spectrometry (LC-MS) raw data facilitates downstream statistical and biological data analyses. In the case of targeted LC-MS data, consistent recognition of chromatographic peaks is a main challenge, in particular, for low abundant signals. Fully automatic preprocessing is faster than manual peak review and does not depend on the individual operator. Here, we present the R package automRm for fully automatic preprocessing of LC-MS data recorded in MRM mode. Using machine learning (ML) for detection of chromatographic peaks and quality control of reported results enables the automatic recognition of complex patterns in raw data. In addition, this approach renders automRm generally applicable to a wide range of analytical methods including hydrophilic interaction liquid chromatography (HILIC), which is known for sample-to-sample variations in peak shape and retention time. We demonstrate the impact of the choice of training data set, of the applied ML algorithm, and of individual peak characteristics on automRm’s ability to correctly report chromatographic peaks. Next, we show that automRm can replicate results obtained by manual peak review on published data. Moreover, automRm outperforms alternative software solutions regarding the variation in peak integration among replicate measurements and the number of correctly reported peaks when applied to a HILIC-MS data set. The R package is freely available from gitlab (https://gitlab.gwdg.de/joerg.buescher/automrm).
format Online
Article
Text
id pubmed-9047440
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-90474402022-04-29 automRm: An R Package for Fully Automatic LC-QQQ-MS Data Preprocessing Powered by Machine Learning Eilertz, Daniel Mitterer, Michael Buescher, Joerg M. Anal Chem [Image: see text] Preprocessing of liquid chromatography-mass spectrometry (LC-MS) raw data facilitates downstream statistical and biological data analyses. In the case of targeted LC-MS data, consistent recognition of chromatographic peaks is a main challenge, in particular, for low abundant signals. Fully automatic preprocessing is faster than manual peak review and does not depend on the individual operator. Here, we present the R package automRm for fully automatic preprocessing of LC-MS data recorded in MRM mode. Using machine learning (ML) for detection of chromatographic peaks and quality control of reported results enables the automatic recognition of complex patterns in raw data. In addition, this approach renders automRm generally applicable to a wide range of analytical methods including hydrophilic interaction liquid chromatography (HILIC), which is known for sample-to-sample variations in peak shape and retention time. We demonstrate the impact of the choice of training data set, of the applied ML algorithm, and of individual peak characteristics on automRm’s ability to correctly report chromatographic peaks. Next, we show that automRm can replicate results obtained by manual peak review on published data. Moreover, automRm outperforms alternative software solutions regarding the variation in peak integration among replicate measurements and the number of correctly reported peaks when applied to a HILIC-MS data set. The R package is freely available from gitlab (https://gitlab.gwdg.de/joerg.buescher/automrm). American Chemical Society 2022-04-12 2022-04-26 /pmc/articles/PMC9047440/ /pubmed/35412809 http://dx.doi.org/10.1021/acs.analchem.1c05224 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Eilertz, Daniel
Mitterer, Michael
Buescher, Joerg M.
automRm: An R Package for Fully Automatic LC-QQQ-MS Data Preprocessing Powered by Machine Learning
title automRm: An R Package for Fully Automatic LC-QQQ-MS Data Preprocessing Powered by Machine Learning
title_full automRm: An R Package for Fully Automatic LC-QQQ-MS Data Preprocessing Powered by Machine Learning
title_fullStr automRm: An R Package for Fully Automatic LC-QQQ-MS Data Preprocessing Powered by Machine Learning
title_full_unstemmed automRm: An R Package for Fully Automatic LC-QQQ-MS Data Preprocessing Powered by Machine Learning
title_short automRm: An R Package for Fully Automatic LC-QQQ-MS Data Preprocessing Powered by Machine Learning
title_sort automrm: an r package for fully automatic lc-qqq-ms data preprocessing powered by machine learning
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9047440/
https://www.ncbi.nlm.nih.gov/pubmed/35412809
http://dx.doi.org/10.1021/acs.analchem.1c05224
work_keys_str_mv AT eilertzdaniel automrmanrpackageforfullyautomaticlcqqqmsdatapreprocessingpoweredbymachinelearning
AT mitterermichael automrmanrpackageforfullyautomaticlcqqqmsdatapreprocessingpoweredbymachinelearning
AT buescherjoergm automrmanrpackageforfullyautomaticlcqqqmsdatapreprocessingpoweredbymachinelearning