Cargando…

Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome

Background: Using machine learning based on metabolomics, this study aimed to construct an effective primary Sjogren’s syndrome (pSS) diagnostics model and reveal the potential targets and biomarkers of pSS. Methods: From a total of 39 patients with pSS and 38 healthy controls (HCs), serum specimens...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Kai, Li, Ju, Meng, Deqian, Zhang, Zhongyuan, Liu, Shanshan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9483105/
https://www.ncbi.nlm.nih.gov/pubmed/36133908
http://dx.doi.org/10.3389/fmolb.2022.913325
_version_ 1784791601915101184
author Wang, Kai
Li, Ju
Meng, Deqian
Zhang, Zhongyuan
Liu, Shanshan
author_facet Wang, Kai
Li, Ju
Meng, Deqian
Zhang, Zhongyuan
Liu, Shanshan
author_sort Wang, Kai
collection PubMed
description Background: Using machine learning based on metabolomics, this study aimed to construct an effective primary Sjogren’s syndrome (pSS) diagnostics model and reveal the potential targets and biomarkers of pSS. Methods: From a total of 39 patients with pSS and 38 healthy controls (HCs), serum specimens were collected. The samples were analyzed by ultra-high-performance liquid chromatography coupled with high-resolution mass spectrometry. Three machine learning algorithms, including the least absolute shrinkage and selection operator (LASSO), random forest (RF), and extreme gradient boosting (XGBoost), were used to build the pSS diagnosis models. Afterward, four machine learning methods were used to reduce the dimensionality of the metabolomics data. Finally, metabolites with significant differences were screened and pathway analysis was conducted. Results: The area under the curve (AUC), sensitivity, and specificity of LASSO, RF and XGBoost test set all reached 1.00. Orthogonal partial least squares discriminant analysis was used to classify the metabolomics data. By combining the results of the univariate false discovery rate and the importance of the variable in projection, we identified 21 significantly different metabolites. Using these 21 metabolites for diagnostic modeling, the AUC, sensitivity, and specificity of LASSO, RF, and XGBoost all reached 1.00. Metabolic pathway analysis revealed that these 21 metabolites are highly correlated with amino acid and lipid metabolisms. On the basis of 21 metabolites, we screened the important variables in the models. Further, five common variables were obtained by intersecting the important variables of three models. Based on these five common variables, the AUC, sensitivity, and specificity of LASSO, RF, and XGBoost all reached 1.00.2-Hydroxypalmitic acid, L-carnitine and cyclic AMP were found to be potential targets and specific biomarkers for pSS. Conclusion: The combination of machine learning and metabolomics can accurately distinguish between patients with pSS and HCs. 2-Hydroxypalmitic acid, L-carnitine and cyclic AMP were potential targets and biomarkers for pSS.
format Online
Article
Text
id pubmed-9483105
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-94831052022-09-20 Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome Wang, Kai Li, Ju Meng, Deqian Zhang, Zhongyuan Liu, Shanshan Front Mol Biosci Molecular Biosciences Background: Using machine learning based on metabolomics, this study aimed to construct an effective primary Sjogren’s syndrome (pSS) diagnostics model and reveal the potential targets and biomarkers of pSS. Methods: From a total of 39 patients with pSS and 38 healthy controls (HCs), serum specimens were collected. The samples were analyzed by ultra-high-performance liquid chromatography coupled with high-resolution mass spectrometry. Three machine learning algorithms, including the least absolute shrinkage and selection operator (LASSO), random forest (RF), and extreme gradient boosting (XGBoost), were used to build the pSS diagnosis models. Afterward, four machine learning methods were used to reduce the dimensionality of the metabolomics data. Finally, metabolites with significant differences were screened and pathway analysis was conducted. Results: The area under the curve (AUC), sensitivity, and specificity of LASSO, RF and XGBoost test set all reached 1.00. Orthogonal partial least squares discriminant analysis was used to classify the metabolomics data. By combining the results of the univariate false discovery rate and the importance of the variable in projection, we identified 21 significantly different metabolites. Using these 21 metabolites for diagnostic modeling, the AUC, sensitivity, and specificity of LASSO, RF, and XGBoost all reached 1.00. Metabolic pathway analysis revealed that these 21 metabolites are highly correlated with amino acid and lipid metabolisms. On the basis of 21 metabolites, we screened the important variables in the models. Further, five common variables were obtained by intersecting the important variables of three models. Based on these five common variables, the AUC, sensitivity, and specificity of LASSO, RF, and XGBoost all reached 1.00.2-Hydroxypalmitic acid, L-carnitine and cyclic AMP were found to be potential targets and specific biomarkers for pSS. Conclusion: The combination of machine learning and metabolomics can accurately distinguish between patients with pSS and HCs. 2-Hydroxypalmitic acid, L-carnitine and cyclic AMP were potential targets and biomarkers for pSS. Frontiers Media S.A. 2022-09-05 /pmc/articles/PMC9483105/ /pubmed/36133908 http://dx.doi.org/10.3389/fmolb.2022.913325 Text en Copyright © 2022 Wang, Li, Meng, Zhang and Liu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Wang, Kai
Li, Ju
Meng, Deqian
Zhang, Zhongyuan
Liu, Shanshan
Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome
title Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome
title_full Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome
title_fullStr Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome
title_full_unstemmed Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome
title_short Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome
title_sort machine learning based on metabolomics reveals potential targets and biomarkers for primary sjogren’s syndrome
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9483105/
https://www.ncbi.nlm.nih.gov/pubmed/36133908
http://dx.doi.org/10.3389/fmolb.2022.913325
work_keys_str_mv AT wangkai machinelearningbasedonmetabolomicsrevealspotentialtargetsandbiomarkersforprimarysjogrenssyndrome
AT liju machinelearningbasedonmetabolomicsrevealspotentialtargetsandbiomarkersforprimarysjogrenssyndrome
AT mengdeqian machinelearningbasedonmetabolomicsrevealspotentialtargetsandbiomarkersforprimarysjogrenssyndrome
AT zhangzhongyuan machinelearningbasedonmetabolomicsrevealspotentialtargetsandbiomarkersforprimarysjogrenssyndrome
AT liushanshan machinelearningbasedonmetabolomicsrevealspotentialtargetsandbiomarkersforprimarysjogrenssyndrome