Cargando…
Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome
Background: Using machine learning based on metabolomics, this study aimed to construct an effective primary Sjogren’s syndrome (pSS) diagnostics model and reveal the potential targets and biomarkers of pSS. Methods: From a total of 39 patients with pSS and 38 healthy controls (HCs), serum specimens...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9483105/ https://www.ncbi.nlm.nih.gov/pubmed/36133908 http://dx.doi.org/10.3389/fmolb.2022.913325 |
_version_ | 1784791601915101184 |
---|---|
author | Wang, Kai Li, Ju Meng, Deqian Zhang, Zhongyuan Liu, Shanshan |
author_facet | Wang, Kai Li, Ju Meng, Deqian Zhang, Zhongyuan Liu, Shanshan |
author_sort | Wang, Kai |
collection | PubMed |
description | Background: Using machine learning based on metabolomics, this study aimed to construct an effective primary Sjogren’s syndrome (pSS) diagnostics model and reveal the potential targets and biomarkers of pSS. Methods: From a total of 39 patients with pSS and 38 healthy controls (HCs), serum specimens were collected. The samples were analyzed by ultra-high-performance liquid chromatography coupled with high-resolution mass spectrometry. Three machine learning algorithms, including the least absolute shrinkage and selection operator (LASSO), random forest (RF), and extreme gradient boosting (XGBoost), were used to build the pSS diagnosis models. Afterward, four machine learning methods were used to reduce the dimensionality of the metabolomics data. Finally, metabolites with significant differences were screened and pathway analysis was conducted. Results: The area under the curve (AUC), sensitivity, and specificity of LASSO, RF and XGBoost test set all reached 1.00. Orthogonal partial least squares discriminant analysis was used to classify the metabolomics data. By combining the results of the univariate false discovery rate and the importance of the variable in projection, we identified 21 significantly different metabolites. Using these 21 metabolites for diagnostic modeling, the AUC, sensitivity, and specificity of LASSO, RF, and XGBoost all reached 1.00. Metabolic pathway analysis revealed that these 21 metabolites are highly correlated with amino acid and lipid metabolisms. On the basis of 21 metabolites, we screened the important variables in the models. Further, five common variables were obtained by intersecting the important variables of three models. Based on these five common variables, the AUC, sensitivity, and specificity of LASSO, RF, and XGBoost all reached 1.00.2-Hydroxypalmitic acid, L-carnitine and cyclic AMP were found to be potential targets and specific biomarkers for pSS. Conclusion: The combination of machine learning and metabolomics can accurately distinguish between patients with pSS and HCs. 2-Hydroxypalmitic acid, L-carnitine and cyclic AMP were potential targets and biomarkers for pSS. |
format | Online Article Text |
id | pubmed-9483105 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-94831052022-09-20 Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome Wang, Kai Li, Ju Meng, Deqian Zhang, Zhongyuan Liu, Shanshan Front Mol Biosci Molecular Biosciences Background: Using machine learning based on metabolomics, this study aimed to construct an effective primary Sjogren’s syndrome (pSS) diagnostics model and reveal the potential targets and biomarkers of pSS. Methods: From a total of 39 patients with pSS and 38 healthy controls (HCs), serum specimens were collected. The samples were analyzed by ultra-high-performance liquid chromatography coupled with high-resolution mass spectrometry. Three machine learning algorithms, including the least absolute shrinkage and selection operator (LASSO), random forest (RF), and extreme gradient boosting (XGBoost), were used to build the pSS diagnosis models. Afterward, four machine learning methods were used to reduce the dimensionality of the metabolomics data. Finally, metabolites with significant differences were screened and pathway analysis was conducted. Results: The area under the curve (AUC), sensitivity, and specificity of LASSO, RF and XGBoost test set all reached 1.00. Orthogonal partial least squares discriminant analysis was used to classify the metabolomics data. By combining the results of the univariate false discovery rate and the importance of the variable in projection, we identified 21 significantly different metabolites. Using these 21 metabolites for diagnostic modeling, the AUC, sensitivity, and specificity of LASSO, RF, and XGBoost all reached 1.00. Metabolic pathway analysis revealed that these 21 metabolites are highly correlated with amino acid and lipid metabolisms. On the basis of 21 metabolites, we screened the important variables in the models. Further, five common variables were obtained by intersecting the important variables of three models. Based on these five common variables, the AUC, sensitivity, and specificity of LASSO, RF, and XGBoost all reached 1.00.2-Hydroxypalmitic acid, L-carnitine and cyclic AMP were found to be potential targets and specific biomarkers for pSS. Conclusion: The combination of machine learning and metabolomics can accurately distinguish between patients with pSS and HCs. 2-Hydroxypalmitic acid, L-carnitine and cyclic AMP were potential targets and biomarkers for pSS. Frontiers Media S.A. 2022-09-05 /pmc/articles/PMC9483105/ /pubmed/36133908 http://dx.doi.org/10.3389/fmolb.2022.913325 Text en Copyright © 2022 Wang, Li, Meng, Zhang and Liu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Molecular Biosciences Wang, Kai Li, Ju Meng, Deqian Zhang, Zhongyuan Liu, Shanshan Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome |
title | Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome |
title_full | Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome |
title_fullStr | Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome |
title_full_unstemmed | Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome |
title_short | Machine learning based on metabolomics reveals potential targets and biomarkers for primary Sjogren’s syndrome |
title_sort | machine learning based on metabolomics reveals potential targets and biomarkers for primary sjogren’s syndrome |
topic | Molecular Biosciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9483105/ https://www.ncbi.nlm.nih.gov/pubmed/36133908 http://dx.doi.org/10.3389/fmolb.2022.913325 |
work_keys_str_mv | AT wangkai machinelearningbasedonmetabolomicsrevealspotentialtargetsandbiomarkersforprimarysjogrenssyndrome AT liju machinelearningbasedonmetabolomicsrevealspotentialtargetsandbiomarkersforprimarysjogrenssyndrome AT mengdeqian machinelearningbasedonmetabolomicsrevealspotentialtargetsandbiomarkersforprimarysjogrenssyndrome AT zhangzhongyuan machinelearningbasedonmetabolomicsrevealspotentialtargetsandbiomarkersforprimarysjogrenssyndrome AT liushanshan machinelearningbasedonmetabolomicsrevealspotentialtargetsandbiomarkersforprimarysjogrenssyndrome |