Cargando…
Predicting human health from biofluid-based metabolomics using machine learning
Biofluid-based metabolomics has the potential to provide highly accurate, minimally invasive diagnostics. Metabolomics studies using mass spectrometry typically reduce the high-dimensional data to only a small number of statistically significant features, that are often chemically identified—where e...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7572502/ https://www.ncbi.nlm.nih.gov/pubmed/33077825 http://dx.doi.org/10.1038/s41598-020-74823-1 |
_version_ | 1783597346139930624 |
---|---|
author | Evans, Ethan D. Duvallet, Claire Chu, Nathaniel D. Oberst, Michael K. Murphy, Michael A. Rockafellow, Isaac Sontag, David Alm, Eric J. |
author_facet | Evans, Ethan D. Duvallet, Claire Chu, Nathaniel D. Oberst, Michael K. Murphy, Michael A. Rockafellow, Isaac Sontag, David Alm, Eric J. |
author_sort | Evans, Ethan D. |
collection | PubMed |
description | Biofluid-based metabolomics has the potential to provide highly accurate, minimally invasive diagnostics. Metabolomics studies using mass spectrometry typically reduce the high-dimensional data to only a small number of statistically significant features, that are often chemically identified—where each feature corresponds to a mass-to-charge ratio, retention time, and intensity. This practice may remove a substantial amount of predictive signal. To test the utility of the complete feature set, we train machine learning models for health state-prediction in 35 human metabolomics studies, representing 148 individual data sets. Models trained with all features outperform those using only significant features and frequently provide high predictive performance across nine health state categories, despite disparate experimental and disease contexts. Using only non-significant features it is still often possible to train models and achieve high predictive performance, suggesting useful predictive signal. This work highlights the potential for health state diagnostics using all metabolomics features with data-driven analysis. |
format | Online Article Text |
id | pubmed-7572502 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-75725022020-10-21 Predicting human health from biofluid-based metabolomics using machine learning Evans, Ethan D. Duvallet, Claire Chu, Nathaniel D. Oberst, Michael K. Murphy, Michael A. Rockafellow, Isaac Sontag, David Alm, Eric J. Sci Rep Article Biofluid-based metabolomics has the potential to provide highly accurate, minimally invasive diagnostics. Metabolomics studies using mass spectrometry typically reduce the high-dimensional data to only a small number of statistically significant features, that are often chemically identified—where each feature corresponds to a mass-to-charge ratio, retention time, and intensity. This practice may remove a substantial amount of predictive signal. To test the utility of the complete feature set, we train machine learning models for health state-prediction in 35 human metabolomics studies, representing 148 individual data sets. Models trained with all features outperform those using only significant features and frequently provide high predictive performance across nine health state categories, despite disparate experimental and disease contexts. Using only non-significant features it is still often possible to train models and achieve high predictive performance, suggesting useful predictive signal. This work highlights the potential for health state diagnostics using all metabolomics features with data-driven analysis. Nature Publishing Group UK 2020-10-19 /pmc/articles/PMC7572502/ /pubmed/33077825 http://dx.doi.org/10.1038/s41598-020-74823-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Evans, Ethan D. Duvallet, Claire Chu, Nathaniel D. Oberst, Michael K. Murphy, Michael A. Rockafellow, Isaac Sontag, David Alm, Eric J. Predicting human health from biofluid-based metabolomics using machine learning |
title | Predicting human health from biofluid-based metabolomics using machine learning |
title_full | Predicting human health from biofluid-based metabolomics using machine learning |
title_fullStr | Predicting human health from biofluid-based metabolomics using machine learning |
title_full_unstemmed | Predicting human health from biofluid-based metabolomics using machine learning |
title_short | Predicting human health from biofluid-based metabolomics using machine learning |
title_sort | predicting human health from biofluid-based metabolomics using machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7572502/ https://www.ncbi.nlm.nih.gov/pubmed/33077825 http://dx.doi.org/10.1038/s41598-020-74823-1 |
work_keys_str_mv | AT evansethand predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning AT duvalletclaire predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning AT chunathanield predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning AT oberstmichaelk predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning AT murphymichaela predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning AT rockafellowisaac predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning AT sontagdavid predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning AT almericj predictinghumanhealthfrombiofluidbasedmetabolomicsusingmachinelearning |