Cargando…

Interpretable Machine Learning on Metabolomics Data Reveals Biomarkers for Parkinson’s Disease

[Image: see text] The use of machine learning (ML) with metabolomics provides opportunities for the early diagnosis of disease. However, the accuracy of ML and extent of information obtained from metabolomics can be limited owing to challenges associated with interpreting disease prediction models a...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, J. Diana, Xue, Chonghua, Kolachalama, Vijaya B., Donald, William A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10214508/
https://www.ncbi.nlm.nih.gov/pubmed/37252351
http://dx.doi.org/10.1021/acscentsci.2c01468
_version_ 1785047855658958848
author Zhang, J. Diana
Xue, Chonghua
Kolachalama, Vijaya B.
Donald, William A.
author_facet Zhang, J. Diana
Xue, Chonghua
Kolachalama, Vijaya B.
Donald, William A.
author_sort Zhang, J. Diana
collection PubMed
description [Image: see text] The use of machine learning (ML) with metabolomics provides opportunities for the early diagnosis of disease. However, the accuracy of ML and extent of information obtained from metabolomics can be limited owing to challenges associated with interpreting disease prediction models and analyzing many chemical features with abundances that are correlated and “noisy”. Here, we report an interpretable neural network (NN) framework to accurately predict disease and identify significant biomarkers using whole metabolomics data sets without a priori feature selection. The performance of the NN approach for predicting Parkinson’s disease (PD) from blood plasma metabolomics data is significantly higher than other ML methods with a mean area under the curve of >0.995. PD-specific markers that predate clinical PD diagnosis and contribute significantly to early disease prediction were identified including an exogenous polyfluoroalkyl substance. It is anticipated that this accurate and interpretable NN-based approach can improve diagnostic performance for many diseases using metabolomics and other untargeted ‘omics methods.
format Online
Article
Text
id pubmed-10214508
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-102145082023-05-27 Interpretable Machine Learning on Metabolomics Data Reveals Biomarkers for Parkinson’s Disease Zhang, J. Diana Xue, Chonghua Kolachalama, Vijaya B. Donald, William A. ACS Cent Sci [Image: see text] The use of machine learning (ML) with metabolomics provides opportunities for the early diagnosis of disease. However, the accuracy of ML and extent of information obtained from metabolomics can be limited owing to challenges associated with interpreting disease prediction models and analyzing many chemical features with abundances that are correlated and “noisy”. Here, we report an interpretable neural network (NN) framework to accurately predict disease and identify significant biomarkers using whole metabolomics data sets without a priori feature selection. The performance of the NN approach for predicting Parkinson’s disease (PD) from blood plasma metabolomics data is significantly higher than other ML methods with a mean area under the curve of >0.995. PD-specific markers that predate clinical PD diagnosis and contribute significantly to early disease prediction were identified including an exogenous polyfluoroalkyl substance. It is anticipated that this accurate and interpretable NN-based approach can improve diagnostic performance for many diseases using metabolomics and other untargeted ‘omics methods. American Chemical Society 2023-05-09 /pmc/articles/PMC10214508/ /pubmed/37252351 http://dx.doi.org/10.1021/acscentsci.2c01468 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Zhang, J. Diana
Xue, Chonghua
Kolachalama, Vijaya B.
Donald, William A.
Interpretable Machine Learning on Metabolomics Data Reveals Biomarkers for Parkinson’s Disease
title Interpretable Machine Learning on Metabolomics Data Reveals Biomarkers for Parkinson’s Disease
title_full Interpretable Machine Learning on Metabolomics Data Reveals Biomarkers for Parkinson’s Disease
title_fullStr Interpretable Machine Learning on Metabolomics Data Reveals Biomarkers for Parkinson’s Disease
title_full_unstemmed Interpretable Machine Learning on Metabolomics Data Reveals Biomarkers for Parkinson’s Disease
title_short Interpretable Machine Learning on Metabolomics Data Reveals Biomarkers for Parkinson’s Disease
title_sort interpretable machine learning on metabolomics data reveals biomarkers for parkinson’s disease
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10214508/
https://www.ncbi.nlm.nih.gov/pubmed/37252351
http://dx.doi.org/10.1021/acscentsci.2c01468
work_keys_str_mv AT zhangjdiana interpretablemachinelearningonmetabolomicsdatarevealsbiomarkersforparkinsonsdisease
AT xuechonghua interpretablemachinelearningonmetabolomicsdatarevealsbiomarkersforparkinsonsdisease
AT kolachalamavijayab interpretablemachinelearningonmetabolomicsdatarevealsbiomarkersforparkinsonsdisease
AT donaldwilliama interpretablemachinelearningonmetabolomicsdatarevealsbiomarkersforparkinsonsdisease