Cargando…
Automated supervised learning pipeline for non-targeted GC-MS data analysis
Non-targeted analysis is nowadays applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. Conventional processing strategies for GC-MS data include baseline correction, feature detection, and retention time alignment before multivariate modeli...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7587030/ https://www.ncbi.nlm.nih.gov/pubmed/33117972 http://dx.doi.org/10.1016/j.acax.2019.100005 |
_version_ | 1783600109913636864 |
---|---|
author | Sirén, Kimmo Fischer, Ulrich Vestner, Jochen |
author_facet | Sirén, Kimmo Fischer, Ulrich Vestner, Jochen |
author_sort | Sirén, Kimmo |
collection | PubMed |
description | Non-targeted analysis is nowadays applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. Conventional processing strategies for GC-MS data include baseline correction, feature detection, and retention time alignment before multivariate modeling. These techniques can be prone to errors and therefore time-consuming manual corrections are generally necessary. We introduce here a novel fully automated approach to non-targeted GC-MS data processing. This new approach avoids feature extraction and retention time alignment. Supervised machine learning on decomposed tensors of segmented chromatographic raw data signal is used to rank regions in the chromatograms contributing to differentiation between sample classes. The performance of this novel data analysis approach is demonstrated on three published datasets. |
format | Online Article Text |
id | pubmed-7587030 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-75870302020-10-27 Automated supervised learning pipeline for non-targeted GC-MS data analysis Sirén, Kimmo Fischer, Ulrich Vestner, Jochen Anal Chim Acta X Article Non-targeted analysis is nowadays applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. Conventional processing strategies for GC-MS data include baseline correction, feature detection, and retention time alignment before multivariate modeling. These techniques can be prone to errors and therefore time-consuming manual corrections are generally necessary. We introduce here a novel fully automated approach to non-targeted GC-MS data processing. This new approach avoids feature extraction and retention time alignment. Supervised machine learning on decomposed tensors of segmented chromatographic raw data signal is used to rank regions in the chromatograms contributing to differentiation between sample classes. The performance of this novel data analysis approach is demonstrated on three published datasets. Elsevier 2019-01-10 /pmc/articles/PMC7587030/ /pubmed/33117972 http://dx.doi.org/10.1016/j.acax.2019.100005 Text en © 2019 Published by Elsevier B.V. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Article Sirén, Kimmo Fischer, Ulrich Vestner, Jochen Automated supervised learning pipeline for non-targeted GC-MS data analysis |
title | Automated supervised learning pipeline for non-targeted GC-MS data analysis |
title_full | Automated supervised learning pipeline for non-targeted GC-MS data analysis |
title_fullStr | Automated supervised learning pipeline for non-targeted GC-MS data analysis |
title_full_unstemmed | Automated supervised learning pipeline for non-targeted GC-MS data analysis |
title_short | Automated supervised learning pipeline for non-targeted GC-MS data analysis |
title_sort | automated supervised learning pipeline for non-targeted gc-ms data analysis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7587030/ https://www.ncbi.nlm.nih.gov/pubmed/33117972 http://dx.doi.org/10.1016/j.acax.2019.100005 |
work_keys_str_mv | AT sirenkimmo automatedsupervisedlearningpipelinefornontargetedgcmsdataanalysis AT fischerulrich automatedsupervisedlearningpipelinefornontargetedgcmsdataanalysis AT vestnerjochen automatedsupervisedlearningpipelinefornontargetedgcmsdataanalysis |