Cargando…

Development and verification of a combined diagnostic model for primary Sjögren's syndrome by integrated bioinformatics analysis and machine learning

Primary Sjögren’s syndrome (pSS) is a chronic, systemic autoimmune disease mostly affecting the exocrine glands. This debilitating condition is complex and specific treatments remain unavailable. There is a need for the development of novel diagnostic models for early screening. Four gene profiling...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Kun, Wang, Qi, Wu, Li, Gao, Qi-Chao, Tang, Shan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10224947/
https://www.ncbi.nlm.nih.gov/pubmed/37244954
http://dx.doi.org/10.1038/s41598-023-35864-4
_version_ 1785050300748398592
author Yang, Kun
Wang, Qi
Wu, Li
Gao, Qi-Chao
Tang, Shan
author_facet Yang, Kun
Wang, Qi
Wu, Li
Gao, Qi-Chao
Tang, Shan
author_sort Yang, Kun
collection PubMed
description Primary Sjögren’s syndrome (pSS) is a chronic, systemic autoimmune disease mostly affecting the exocrine glands. This debilitating condition is complex and specific treatments remain unavailable. There is a need for the development of novel diagnostic models for early screening. Four gene profiling datasets were downloaded from the Gene Expression Omnibus database. The ‘limma’ software package was used to identify differentially expressed genes (DEGs). A random forest-supervised classification algorithm was used to screen disease-specific genes, and three machine learning algorithms, including artificial neural networks (ANN), random forest (RF), and support vector machines (SVM), were used to build a pSS diagnostic model. The performance of the model was measured using its area under the receiver operating characteristic curve. Immune cell infiltration was investigated using the CIBERSORT algorithm. A total of 96 DEGs were identified. By utilizing a RF classifier, a set of 14 signature genes that are pivotal in transcription regulation and disease progression in pSS were identified. Through the utilization of training and testing datasets, diagnostic models for pSS were successfully designed using ANN, RF, and SVM, resulting in AUCs of 0.972, 1.00, and 0.9742, respectively. The validation set yielded AUCs of 0.766, 0.8321, and 0.8223. It was the RF model that produced the best prediction performance out of the three models tested. As a result, an early predictive model for pSS was successfully developed with high diagnostic performance, providing a valuable resource for the screening and early diagnosis of pSS.
format Online
Article
Text
id pubmed-10224947
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-102249472023-05-29 Development and verification of a combined diagnostic model for primary Sjögren's syndrome by integrated bioinformatics analysis and machine learning Yang, Kun Wang, Qi Wu, Li Gao, Qi-Chao Tang, Shan Sci Rep Article Primary Sjögren’s syndrome (pSS) is a chronic, systemic autoimmune disease mostly affecting the exocrine glands. This debilitating condition is complex and specific treatments remain unavailable. There is a need for the development of novel diagnostic models for early screening. Four gene profiling datasets were downloaded from the Gene Expression Omnibus database. The ‘limma’ software package was used to identify differentially expressed genes (DEGs). A random forest-supervised classification algorithm was used to screen disease-specific genes, and three machine learning algorithms, including artificial neural networks (ANN), random forest (RF), and support vector machines (SVM), were used to build a pSS diagnostic model. The performance of the model was measured using its area under the receiver operating characteristic curve. Immune cell infiltration was investigated using the CIBERSORT algorithm. A total of 96 DEGs were identified. By utilizing a RF classifier, a set of 14 signature genes that are pivotal in transcription regulation and disease progression in pSS were identified. Through the utilization of training and testing datasets, diagnostic models for pSS were successfully designed using ANN, RF, and SVM, resulting in AUCs of 0.972, 1.00, and 0.9742, respectively. The validation set yielded AUCs of 0.766, 0.8321, and 0.8223. It was the RF model that produced the best prediction performance out of the three models tested. As a result, an early predictive model for pSS was successfully developed with high diagnostic performance, providing a valuable resource for the screening and early diagnosis of pSS. Nature Publishing Group UK 2023-05-27 /pmc/articles/PMC10224947/ /pubmed/37244954 http://dx.doi.org/10.1038/s41598-023-35864-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Yang, Kun
Wang, Qi
Wu, Li
Gao, Qi-Chao
Tang, Shan
Development and verification of a combined diagnostic model for primary Sjögren's syndrome by integrated bioinformatics analysis and machine learning
title Development and verification of a combined diagnostic model for primary Sjögren's syndrome by integrated bioinformatics analysis and machine learning
title_full Development and verification of a combined diagnostic model for primary Sjögren's syndrome by integrated bioinformatics analysis and machine learning
title_fullStr Development and verification of a combined diagnostic model for primary Sjögren's syndrome by integrated bioinformatics analysis and machine learning
title_full_unstemmed Development and verification of a combined diagnostic model for primary Sjögren's syndrome by integrated bioinformatics analysis and machine learning
title_short Development and verification of a combined diagnostic model for primary Sjögren's syndrome by integrated bioinformatics analysis and machine learning
title_sort development and verification of a combined diagnostic model for primary sjögren's syndrome by integrated bioinformatics analysis and machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10224947/
https://www.ncbi.nlm.nih.gov/pubmed/37244954
http://dx.doi.org/10.1038/s41598-023-35864-4
work_keys_str_mv AT yangkun developmentandverificationofacombineddiagnosticmodelforprimarysjogrenssyndromebyintegratedbioinformaticsanalysisandmachinelearning
AT wangqi developmentandverificationofacombineddiagnosticmodelforprimarysjogrenssyndromebyintegratedbioinformaticsanalysisandmachinelearning
AT wuli developmentandverificationofacombineddiagnosticmodelforprimarysjogrenssyndromebyintegratedbioinformaticsanalysisandmachinelearning
AT gaoqichao developmentandverificationofacombineddiagnosticmodelforprimarysjogrenssyndromebyintegratedbioinformaticsanalysisandmachinelearning
AT tangshan developmentandverificationofacombineddiagnosticmodelforprimarysjogrenssyndromebyintegratedbioinformaticsanalysisandmachinelearning