Cargando…

A signal recognition particle-related joint model of LASSO regression, SVM-RFE and artificial neural network for the diagnosis of systemic sclerosis-associated pulmonary hypertension

Background: Systemic sclerosis-associated pulmonary hypertension (SSc-PH) is one of the most common causes of death in patients with systemic sclerosis (SSc). The complexity of SSc-PH and the heterogeneity of clinical features in SSc-PH patients contribute to the difficulty of diagnosis. Therefore,...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Jingxi, Liang, Chaoyang, Li, Jiangtao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9742487/
https://www.ncbi.nlm.nih.gov/pubmed/36518216
http://dx.doi.org/10.3389/fgene.2022.1078200
_version_ 1784848530817417216
author Xu, Jingxi
Liang, Chaoyang
Li, Jiangtao
author_facet Xu, Jingxi
Liang, Chaoyang
Li, Jiangtao
author_sort Xu, Jingxi
collection PubMed
description Background: Systemic sclerosis-associated pulmonary hypertension (SSc-PH) is one of the most common causes of death in patients with systemic sclerosis (SSc). The complexity of SSc-PH and the heterogeneity of clinical features in SSc-PH patients contribute to the difficulty of diagnosis. Therefore, there is a pressing need to develop and optimize models for the diagnosis of SSc-PH. Signal recognition particle (SRP) deficiency has been found to promote the progression of multiple cancers, but the relationship between SRP and SSc-PH has not been explored. Methods: First, we obtained the GSE19617 and GSE33463 datasets from the Gene Expression Omnibus (GEO) database as the training set, GSE22356 as the test set, and the SRP-related gene set from the MSigDB database. Next, we identified differentially expressed SRP-related genes (DE-SRPGs) and performed unsupervised clustering and gene enrichment analyses. Then, we used least absolute shrinkage and selection operator (LASSO) regression and support vector machine-recursive feature elimination (SVM-RFE) to identify SRP-related diagnostic genes (SRP-DGs). We constructed an SRP scoring system and a nomogram model based on the SRP-DGs and established an artificial neural network (ANN) for diagnosis. We used receiver operating characteristic (ROC) curves to identify the SRP-related signature in the training and test sets. Finally, we analyzed immune features, signaling pathways, and drugs associated with SRP and investigated SRP-DGs’ functions using single gene batch correlation analysis-based GSEA. Results: We obtained 30 DE-SRPGs and found that they were enriched in functions and pathways such as “protein targeting to ER,” “cytosolic ribosome,” and “coronavirus disease—COVID-19”. Subsequently, we identified seven SRP-DGs whose expression levels and diagnostic efficacy were validated in the test set. As one signature, the area under the ROC curve (AUC) values for seven SRP-DGs were 0.769 and 1.000 in the training and test sets, respectively. Predictions made using the nomogram model are likely beneficial for SSc-PH patients. The AUC values of the ANN were 0.999 and 0.860 in the training and test sets, respectively. Finally, we discovered that some immune cells and pathways, such as activated dendritic cells, complement activation, and heme metabolism, were significantly associated with SRP-DGs and identified ten drugs targeting SRP-DGs. Conclusion: We constructed a reliable SRP-related ANN model for the diagnosis of SSc-PH and investigated the possible role of SRP in the etiopathogenesis of SSc-PH by bioinformatics methods to provide a basis for precision and personalized medicine.
format Online
Article
Text
id pubmed-9742487
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-97424872022-12-13 A signal recognition particle-related joint model of LASSO regression, SVM-RFE and artificial neural network for the diagnosis of systemic sclerosis-associated pulmonary hypertension Xu, Jingxi Liang, Chaoyang Li, Jiangtao Front Genet Genetics Background: Systemic sclerosis-associated pulmonary hypertension (SSc-PH) is one of the most common causes of death in patients with systemic sclerosis (SSc). The complexity of SSc-PH and the heterogeneity of clinical features in SSc-PH patients contribute to the difficulty of diagnosis. Therefore, there is a pressing need to develop and optimize models for the diagnosis of SSc-PH. Signal recognition particle (SRP) deficiency has been found to promote the progression of multiple cancers, but the relationship between SRP and SSc-PH has not been explored. Methods: First, we obtained the GSE19617 and GSE33463 datasets from the Gene Expression Omnibus (GEO) database as the training set, GSE22356 as the test set, and the SRP-related gene set from the MSigDB database. Next, we identified differentially expressed SRP-related genes (DE-SRPGs) and performed unsupervised clustering and gene enrichment analyses. Then, we used least absolute shrinkage and selection operator (LASSO) regression and support vector machine-recursive feature elimination (SVM-RFE) to identify SRP-related diagnostic genes (SRP-DGs). We constructed an SRP scoring system and a nomogram model based on the SRP-DGs and established an artificial neural network (ANN) for diagnosis. We used receiver operating characteristic (ROC) curves to identify the SRP-related signature in the training and test sets. Finally, we analyzed immune features, signaling pathways, and drugs associated with SRP and investigated SRP-DGs’ functions using single gene batch correlation analysis-based GSEA. Results: We obtained 30 DE-SRPGs and found that they were enriched in functions and pathways such as “protein targeting to ER,” “cytosolic ribosome,” and “coronavirus disease—COVID-19”. Subsequently, we identified seven SRP-DGs whose expression levels and diagnostic efficacy were validated in the test set. As one signature, the area under the ROC curve (AUC) values for seven SRP-DGs were 0.769 and 1.000 in the training and test sets, respectively. Predictions made using the nomogram model are likely beneficial for SSc-PH patients. The AUC values of the ANN were 0.999 and 0.860 in the training and test sets, respectively. Finally, we discovered that some immune cells and pathways, such as activated dendritic cells, complement activation, and heme metabolism, were significantly associated with SRP-DGs and identified ten drugs targeting SRP-DGs. Conclusion: We constructed a reliable SRP-related ANN model for the diagnosis of SSc-PH and investigated the possible role of SRP in the etiopathogenesis of SSc-PH by bioinformatics methods to provide a basis for precision and personalized medicine. Frontiers Media S.A. 2022-11-28 /pmc/articles/PMC9742487/ /pubmed/36518216 http://dx.doi.org/10.3389/fgene.2022.1078200 Text en Copyright © 2022 Xu, Liang and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xu, Jingxi
Liang, Chaoyang
Li, Jiangtao
A signal recognition particle-related joint model of LASSO regression, SVM-RFE and artificial neural network for the diagnosis of systemic sclerosis-associated pulmonary hypertension
title A signal recognition particle-related joint model of LASSO regression, SVM-RFE and artificial neural network for the diagnosis of systemic sclerosis-associated pulmonary hypertension
title_full A signal recognition particle-related joint model of LASSO regression, SVM-RFE and artificial neural network for the diagnosis of systemic sclerosis-associated pulmonary hypertension
title_fullStr A signal recognition particle-related joint model of LASSO regression, SVM-RFE and artificial neural network for the diagnosis of systemic sclerosis-associated pulmonary hypertension
title_full_unstemmed A signal recognition particle-related joint model of LASSO regression, SVM-RFE and artificial neural network for the diagnosis of systemic sclerosis-associated pulmonary hypertension
title_short A signal recognition particle-related joint model of LASSO regression, SVM-RFE and artificial neural network for the diagnosis of systemic sclerosis-associated pulmonary hypertension
title_sort signal recognition particle-related joint model of lasso regression, svm-rfe and artificial neural network for the diagnosis of systemic sclerosis-associated pulmonary hypertension
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9742487/
https://www.ncbi.nlm.nih.gov/pubmed/36518216
http://dx.doi.org/10.3389/fgene.2022.1078200
work_keys_str_mv AT xujingxi asignalrecognitionparticlerelatedjointmodeloflassoregressionsvmrfeandartificialneuralnetworkforthediagnosisofsystemicsclerosisassociatedpulmonaryhypertension
AT liangchaoyang asignalrecognitionparticlerelatedjointmodeloflassoregressionsvmrfeandartificialneuralnetworkforthediagnosisofsystemicsclerosisassociatedpulmonaryhypertension
AT lijiangtao asignalrecognitionparticlerelatedjointmodeloflassoregressionsvmrfeandartificialneuralnetworkforthediagnosisofsystemicsclerosisassociatedpulmonaryhypertension
AT xujingxi signalrecognitionparticlerelatedjointmodeloflassoregressionsvmrfeandartificialneuralnetworkforthediagnosisofsystemicsclerosisassociatedpulmonaryhypertension
AT liangchaoyang signalrecognitionparticlerelatedjointmodeloflassoregressionsvmrfeandartificialneuralnetworkforthediagnosisofsystemicsclerosisassociatedpulmonaryhypertension
AT lijiangtao signalrecognitionparticlerelatedjointmodeloflassoregressionsvmrfeandartificialneuralnetworkforthediagnosisofsystemicsclerosisassociatedpulmonaryhypertension