Cargando…

A Comprehensive Evaluation of Metabolomics Data Preprocessing Methods for Deep Learning

Machine learning has greatly advanced over the past decade, owing to advances in algorithmic innovations, hardware acceleration, and benchmark datasets to train on domains such as computer vision, natural-language processing, and more recently the life sciences. In particular, the subfield of machin...

Descripción completa

Detalles Bibliográficos
Autores principales: Abram, Krzysztof Jan, McCloskey, Douglas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8948616/
https://www.ncbi.nlm.nih.gov/pubmed/35323644
http://dx.doi.org/10.3390/metabo12030202
_version_ 1784674695659913216
author Abram, Krzysztof Jan
McCloskey, Douglas
author_facet Abram, Krzysztof Jan
McCloskey, Douglas
author_sort Abram, Krzysztof Jan
collection PubMed
description Machine learning has greatly advanced over the past decade, owing to advances in algorithmic innovations, hardware acceleration, and benchmark datasets to train on domains such as computer vision, natural-language processing, and more recently the life sciences. In particular, the subfield of machine learning known as deep learning has found applications in genomics, proteomics, and metabolomics. However, a thorough assessment of how the data preprocessing methods required for the analysis of life science data affect the performance of deep learning is lacking. This work contributes to filling that gap by assessing the impact of commonly used as well as newly developed methods employed in data preprocessing workflows for metabolomics that span from raw data to processed data. The results from these analyses are summarized into a set of best practices that can be used by researchers as a starting point for downstream classification and reconstruction tasks using deep learning.
format Online
Article
Text
id pubmed-8948616
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-89486162022-03-26 A Comprehensive Evaluation of Metabolomics Data Preprocessing Methods for Deep Learning Abram, Krzysztof Jan McCloskey, Douglas Metabolites Article Machine learning has greatly advanced over the past decade, owing to advances in algorithmic innovations, hardware acceleration, and benchmark datasets to train on domains such as computer vision, natural-language processing, and more recently the life sciences. In particular, the subfield of machine learning known as deep learning has found applications in genomics, proteomics, and metabolomics. However, a thorough assessment of how the data preprocessing methods required for the analysis of life science data affect the performance of deep learning is lacking. This work contributes to filling that gap by assessing the impact of commonly used as well as newly developed methods employed in data preprocessing workflows for metabolomics that span from raw data to processed data. The results from these analyses are summarized into a set of best practices that can be used by researchers as a starting point for downstream classification and reconstruction tasks using deep learning. MDPI 2022-02-24 /pmc/articles/PMC8948616/ /pubmed/35323644 http://dx.doi.org/10.3390/metabo12030202 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Abram, Krzysztof Jan
McCloskey, Douglas
A Comprehensive Evaluation of Metabolomics Data Preprocessing Methods for Deep Learning
title A Comprehensive Evaluation of Metabolomics Data Preprocessing Methods for Deep Learning
title_full A Comprehensive Evaluation of Metabolomics Data Preprocessing Methods for Deep Learning
title_fullStr A Comprehensive Evaluation of Metabolomics Data Preprocessing Methods for Deep Learning
title_full_unstemmed A Comprehensive Evaluation of Metabolomics Data Preprocessing Methods for Deep Learning
title_short A Comprehensive Evaluation of Metabolomics Data Preprocessing Methods for Deep Learning
title_sort comprehensive evaluation of metabolomics data preprocessing methods for deep learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8948616/
https://www.ncbi.nlm.nih.gov/pubmed/35323644
http://dx.doi.org/10.3390/metabo12030202
work_keys_str_mv AT abramkrzysztofjan acomprehensiveevaluationofmetabolomicsdatapreprocessingmethodsfordeeplearning
AT mccloskeydouglas acomprehensiveevaluationofmetabolomicsdatapreprocessingmethodsfordeeplearning
AT abramkrzysztofjan comprehensiveevaluationofmetabolomicsdatapreprocessingmethodsfordeeplearning
AT mccloskeydouglas comprehensiveevaluationofmetabolomicsdatapreprocessingmethodsfordeeplearning