Cargando…

Supervised machine learning in the mass spectrometry laboratory: A tutorial

As the demand for laboratory testing by mass spectrometry increases, so does the need for automated methods for data analysis. Clinical mass spectrometry (MS) data is particularly well-suited for machine learning (ML) methods, which deal nicely with structured and discrete data elements. The alignme...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Edward S., Durant, Thomas J.S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8692990/
https://www.ncbi.nlm.nih.gov/pubmed/34984411
http://dx.doi.org/10.1016/j.jmsacl.2021.12.001
_version_ 1784619050138075136
author Lee, Edward S.
Durant, Thomas J.S.
author_facet Lee, Edward S.
Durant, Thomas J.S.
author_sort Lee, Edward S.
collection PubMed
description As the demand for laboratory testing by mass spectrometry increases, so does the need for automated methods for data analysis. Clinical mass spectrometry (MS) data is particularly well-suited for machine learning (ML) methods, which deal nicely with structured and discrete data elements. The alignment of these two fields offers a promising synergy that can be used to optimize workflows, improve result quality, and enhance our understanding of high-dimensional datasets and their inherent relationship with disease. In recent years, there has been an increasing number of publications that examine the capabilities of ML-based software in the context of chromatography and MS. However, given the historically distant nature between the fields of clinical chemistry and computer science, there is an opportunity to improve technological literacy of ML-based software within the clinical laboratory scientist community. To this end, we present a basic overview of ML and a tutorial of an ML-based experiment using a previously published MS dataset. The purpose of this paper is to describe the fundamental principles of supervised ML, outline the steps that are classically involved in an ML-based experiment, and discuss the purpose of good ML practice in the context of a binary MS classification problem.
format Online
Article
Text
id pubmed-8692990
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-86929902022-01-03 Supervised machine learning in the mass spectrometry laboratory: A tutorial Lee, Edward S. Durant, Thomas J.S. J Mass Spectrom Adv Clin Lab Special issue on Data Science As the demand for laboratory testing by mass spectrometry increases, so does the need for automated methods for data analysis. Clinical mass spectrometry (MS) data is particularly well-suited for machine learning (ML) methods, which deal nicely with structured and discrete data elements. The alignment of these two fields offers a promising synergy that can be used to optimize workflows, improve result quality, and enhance our understanding of high-dimensional datasets and their inherent relationship with disease. In recent years, there has been an increasing number of publications that examine the capabilities of ML-based software in the context of chromatography and MS. However, given the historically distant nature between the fields of clinical chemistry and computer science, there is an opportunity to improve technological literacy of ML-based software within the clinical laboratory scientist community. To this end, we present a basic overview of ML and a tutorial of an ML-based experiment using a previously published MS dataset. The purpose of this paper is to describe the fundamental principles of supervised ML, outline the steps that are classically involved in an ML-based experiment, and discuss the purpose of good ML practice in the context of a binary MS classification problem. Elsevier 2021-12-13 /pmc/articles/PMC8692990/ /pubmed/34984411 http://dx.doi.org/10.1016/j.jmsacl.2021.12.001 Text en © 2021 THE AUTHORS https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Special issue on Data Science
Lee, Edward S.
Durant, Thomas J.S.
Supervised machine learning in the mass spectrometry laboratory: A tutorial
title Supervised machine learning in the mass spectrometry laboratory: A tutorial
title_full Supervised machine learning in the mass spectrometry laboratory: A tutorial
title_fullStr Supervised machine learning in the mass spectrometry laboratory: A tutorial
title_full_unstemmed Supervised machine learning in the mass spectrometry laboratory: A tutorial
title_short Supervised machine learning in the mass spectrometry laboratory: A tutorial
title_sort supervised machine learning in the mass spectrometry laboratory: a tutorial
topic Special issue on Data Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8692990/
https://www.ncbi.nlm.nih.gov/pubmed/34984411
http://dx.doi.org/10.1016/j.jmsacl.2021.12.001
work_keys_str_mv AT leeedwards supervisedmachinelearninginthemassspectrometrylaboratoryatutorial
AT durantthomasjs supervisedmachinelearninginthemassspectrometrylaboratoryatutorial