Cargando…

Automated supervised learning pipeline for non-targeted GC-MS data analysis

Non-targeted analysis is nowadays applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. Conventional processing strategies for GC-MS data include baseline correction, feature detection, and retention time alignment before multivariate modeli...

Descripción completa

Detalles Bibliográficos
Autores principales: Sirén, Kimmo, Fischer, Ulrich, Vestner, Jochen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7587030/
https://www.ncbi.nlm.nih.gov/pubmed/33117972
http://dx.doi.org/10.1016/j.acax.2019.100005
_version_ 1783600109913636864
author Sirén, Kimmo
Fischer, Ulrich
Vestner, Jochen
author_facet Sirén, Kimmo
Fischer, Ulrich
Vestner, Jochen
author_sort Sirén, Kimmo
collection PubMed
description Non-targeted analysis is nowadays applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. Conventional processing strategies for GC-MS data include baseline correction, feature detection, and retention time alignment before multivariate modeling. These techniques can be prone to errors and therefore time-consuming manual corrections are generally necessary. We introduce here a novel fully automated approach to non-targeted GC-MS data processing. This new approach avoids feature extraction and retention time alignment. Supervised machine learning on decomposed tensors of segmented chromatographic raw data signal is used to rank regions in the chromatograms contributing to differentiation between sample classes. The performance of this novel data analysis approach is demonstrated on three published datasets.
format Online
Article
Text
id pubmed-7587030
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-75870302020-10-27 Automated supervised learning pipeline for non-targeted GC-MS data analysis Sirén, Kimmo Fischer, Ulrich Vestner, Jochen Anal Chim Acta X Article Non-targeted analysis is nowadays applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. Conventional processing strategies for GC-MS data include baseline correction, feature detection, and retention time alignment before multivariate modeling. These techniques can be prone to errors and therefore time-consuming manual corrections are generally necessary. We introduce here a novel fully automated approach to non-targeted GC-MS data processing. This new approach avoids feature extraction and retention time alignment. Supervised machine learning on decomposed tensors of segmented chromatographic raw data signal is used to rank regions in the chromatograms contributing to differentiation between sample classes. The performance of this novel data analysis approach is demonstrated on three published datasets. Elsevier 2019-01-10 /pmc/articles/PMC7587030/ /pubmed/33117972 http://dx.doi.org/10.1016/j.acax.2019.100005 Text en © 2019 Published by Elsevier B.V. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Sirén, Kimmo
Fischer, Ulrich
Vestner, Jochen
Automated supervised learning pipeline for non-targeted GC-MS data analysis
title Automated supervised learning pipeline for non-targeted GC-MS data analysis
title_full Automated supervised learning pipeline for non-targeted GC-MS data analysis
title_fullStr Automated supervised learning pipeline for non-targeted GC-MS data analysis
title_full_unstemmed Automated supervised learning pipeline for non-targeted GC-MS data analysis
title_short Automated supervised learning pipeline for non-targeted GC-MS data analysis
title_sort automated supervised learning pipeline for non-targeted gc-ms data analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7587030/
https://www.ncbi.nlm.nih.gov/pubmed/33117972
http://dx.doi.org/10.1016/j.acax.2019.100005
work_keys_str_mv AT sirenkimmo automatedsupervisedlearningpipelinefornontargetedgcmsdataanalysis
AT fischerulrich automatedsupervisedlearningpipelinefornontargetedgcmsdataanalysis
AT vestnerjochen automatedsupervisedlearningpipelinefornontargetedgcmsdataanalysis