Cargando…

Predicting human health from biofluid-based metabolomics using machine learning

Biofluid-based metabolomics has the potential to provide highly accurate, minimally invasive diagnostics. Metabolomics studies using mass spectrometry typically reduce the high-dimensional data to only a small number of statistically significant features, that are often chemically identified—where e...

Descripción completa

Detalles Bibliográficos
Autores principales: Evans, Ethan D., Duvallet, Claire, Chu, Nathaniel D., Oberst, Michael K., Murphy, Michael A., Rockafellow, Isaac, Sontag, David, Alm, Eric J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7572502/
https://www.ncbi.nlm.nih.gov/pubmed/33077825
http://dx.doi.org/10.1038/s41598-020-74823-1
_version_ 1783597346139930624
author Evans, Ethan D.
Duvallet, Claire
Chu, Nathaniel D.
Oberst, Michael K.
Murphy, Michael A.
Rockafellow, Isaac
Sontag, David
Alm, Eric J.
author_facet Evans, Ethan D.
Duvallet, Claire
Chu, Nathaniel D.
Oberst, Michael K.
Murphy, Michael A.
Rockafellow, Isaac
Sontag, David
Alm, Eric J.
author_sort Evans, Ethan D.
collection PubMed
description Biofluid-based metabolomics has the potential to provide highly accurate, minimally invasive diagnostics. Metabolomics studies using mass spectrometry typically reduce the high-dimensional data to only a small number of statistically significant features, that are often chemically identified—where each feature corresponds to a mass-to-charge ratio, retention time, and intensity. This practice may remove a substantial amount of predictive signal. To test the utility of the complete feature set, we train machine learning models for health state-prediction in 35 human metabolomics studies, representing 148 individual data sets. Models trained with all features outperform those using only significant features and frequently provide high predictive performance across nine health state categories, despite disparate experimental and disease contexts. Using only non-significant features it is still often possible to train models and achieve high predictive performance, suggesting useful predictive signal. This work highlights the potential for health state diagnostics using all metabolomics features with data-driven analysis.
format Online
Article
Text
id pubmed-7572502
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-75725022020-10-21 Predicting human health from biofluid-based metabolomics using machine learning Evans, Ethan D. Duvallet, Claire Chu, Nathaniel D. Oberst, Michael K. Murphy, Michael A. Rockafellow, Isaac Sontag, David Alm, Eric J. Sci Rep Article Biofluid-based metabolomics has the potential to provide highly accurate, minimally invasive diagnostics. Metabolomics studies using mass spectrometry typically reduce the high-dimensional data to only a small number of statistically significant features, that are often chemically identified—where each feature corresponds to a mass-to-charge ratio, retention time, and intensity. This practice may remove a substantial amount of predictive signal. To test the utility of the complete feature set, we train machine learning models for health state-prediction in 35 human metabolomics studies, representing 148 individual data sets. Models trained with all features outperform those using only significant features and frequently provide high predictive performance across nine health state categories, despite disparate experimental and disease contexts. Using only non-significant features it is still often possible to train models and achieve high predictive performance, suggesting useful predictive signal. This work highlights the potential for health state diagnostics using all metabolomics features with data-driven analysis. Nature Publishing Group UK 2020-10-19 /pmc/articles/PMC7572502/ /pubmed/33077825 http://dx.doi.org/10.1038/s41598-020-74823-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Evans, Ethan D.
Duvallet, Claire
Chu, Nathaniel D.
Oberst, Michael K.
Murphy, Michael A.
Rockafellow, Isaac
Sontag, David
Alm, Eric J.
Predicting human health from biofluid-based metabolomics using machine learning
title Predicting human health from biofluid-based metabolomics using machine learning
title_full Predicting human health from biofluid-based metabolomics using machine learning
title_fullStr Predicting human health from biofluid-based metabolomics using machine learning
title_full_unstemmed Predicting human health from biofluid-based metabolomics using machine learning
title_short Predicting human health from biofluid-based metabolomics using machine learning
title_sort predicting human health from biofluid-based metabolomics using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7572502/
https://www.ncbi.nlm.nih.gov/pubmed/33077825
http://dx.doi.org/10.1038/s41598-020-74823-1
work_keys_str_mv AT evansethand predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning
AT duvalletclaire predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning
AT chunathanield predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning
AT oberstmichaelk predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning
AT murphymichaela predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning
AT rockafellowisaac predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning
AT sontagdavid predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning
AT almericj predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning