Cargando…

Machine learning-based prediction of candidate gene biomarkers correlated with immune infiltration in patients with idiopathic pulmonary fibrosis

OBJECTIVE: This study aimed to identify candidate gene biomarkers associated with immune infiltration in idiopathic pulmonary fibrosis (IPF) based on machine learning algorithms. METHODS: Microarray datasets of IPF were extracted from the Gene Expression Omnibus (GEO) database to screen for differen...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yufeng, Wang, Cong, Xia, Qingqing, Jiang, Weilong, Zhang, Huizhe, Amiri-Ardekani, Ehsan, Hua, Haibing, Cheng, Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9968813/
https://www.ncbi.nlm.nih.gov/pubmed/36860337
http://dx.doi.org/10.3389/fmed.2023.1001813
_version_ 1784897579698356224
author Zhang, Yufeng
Wang, Cong
Xia, Qingqing
Jiang, Weilong
Zhang, Huizhe
Amiri-Ardekani, Ehsan
Hua, Haibing
Cheng, Yi
author_facet Zhang, Yufeng
Wang, Cong
Xia, Qingqing
Jiang, Weilong
Zhang, Huizhe
Amiri-Ardekani, Ehsan
Hua, Haibing
Cheng, Yi
author_sort Zhang, Yufeng
collection PubMed
description OBJECTIVE: This study aimed to identify candidate gene biomarkers associated with immune infiltration in idiopathic pulmonary fibrosis (IPF) based on machine learning algorithms. METHODS: Microarray datasets of IPF were extracted from the Gene Expression Omnibus (GEO) database to screen for differentially expressed genes (DEGs). The DEGs were subjected to enrichment analysis, and two machine learning algorithms were used to identify candidate genes associated with IPF. These genes were verified in a validation cohort from the GEO database. Receiver operating characteristic (ROC) curves were plotted to assess the predictive value of the IPF-associated genes. The cell-type identification by estimating relative subsets of RNA transcripts (CIBERSORT) algorithm was used to evaluate the proportion of immune cells in IPF and normal tissues. Additionally, the correlation between the expression of IPF-associated genes and the infiltration levels of immune cells was examined. RESULTS: A total of 302 upregulated and 192 downregulated genes were identified. Functional annotation, pathway enrichment, Disease Ontology and gene set enrichment analyses revealed that the DEGs were related to the extracellular matrix and immune responses. COL3A1, CDH3, CEBPD, and GPIHBP1 were identified as candidate biomarkers using machine learning algorithms, and their predictive value was verified in a validation cohort. Additionally, ROC analysis revealed that the four genes had high predictive accuracy. The infiltration levels of plasma cells, M0 macrophages and resting dendritic cells were higher and those of resting natural killer (NK) cells, M1 macrophages and eosinophils were lower in the lung tissues of patients with IPF than in those of healthy individuals. The expression of the abovementioned genes was correlated with the infiltration levels of plasma cells, M0 macrophages and eosinophils. CONCLUSION: COL3A1, CDH3, CEBPD, and GPIHBP1 are candidate biomarkers of IPF. Plasma cells, M0 macrophages and eosinophils may be involved in the development of IPF and may serve as immunotherapeutic targets in IPF.
format Online
Article
Text
id pubmed-9968813
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-99688132023-02-28 Machine learning-based prediction of candidate gene biomarkers correlated with immune infiltration in patients with idiopathic pulmonary fibrosis Zhang, Yufeng Wang, Cong Xia, Qingqing Jiang, Weilong Zhang, Huizhe Amiri-Ardekani, Ehsan Hua, Haibing Cheng, Yi Front Med (Lausanne) Medicine OBJECTIVE: This study aimed to identify candidate gene biomarkers associated with immune infiltration in idiopathic pulmonary fibrosis (IPF) based on machine learning algorithms. METHODS: Microarray datasets of IPF were extracted from the Gene Expression Omnibus (GEO) database to screen for differentially expressed genes (DEGs). The DEGs were subjected to enrichment analysis, and two machine learning algorithms were used to identify candidate genes associated with IPF. These genes were verified in a validation cohort from the GEO database. Receiver operating characteristic (ROC) curves were plotted to assess the predictive value of the IPF-associated genes. The cell-type identification by estimating relative subsets of RNA transcripts (CIBERSORT) algorithm was used to evaluate the proportion of immune cells in IPF and normal tissues. Additionally, the correlation between the expression of IPF-associated genes and the infiltration levels of immune cells was examined. RESULTS: A total of 302 upregulated and 192 downregulated genes were identified. Functional annotation, pathway enrichment, Disease Ontology and gene set enrichment analyses revealed that the DEGs were related to the extracellular matrix and immune responses. COL3A1, CDH3, CEBPD, and GPIHBP1 were identified as candidate biomarkers using machine learning algorithms, and their predictive value was verified in a validation cohort. Additionally, ROC analysis revealed that the four genes had high predictive accuracy. The infiltration levels of plasma cells, M0 macrophages and resting dendritic cells were higher and those of resting natural killer (NK) cells, M1 macrophages and eosinophils were lower in the lung tissues of patients with IPF than in those of healthy individuals. The expression of the abovementioned genes was correlated with the infiltration levels of plasma cells, M0 macrophages and eosinophils. CONCLUSION: COL3A1, CDH3, CEBPD, and GPIHBP1 are candidate biomarkers of IPF. Plasma cells, M0 macrophages and eosinophils may be involved in the development of IPF and may serve as immunotherapeutic targets in IPF. Frontiers Media S.A. 2023-02-13 /pmc/articles/PMC9968813/ /pubmed/36860337 http://dx.doi.org/10.3389/fmed.2023.1001813 Text en Copyright © 2023 Zhang, Wang, Xia, Jiang, Zhang, Amiri-Ardekani, Hua and Cheng. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Medicine
Zhang, Yufeng
Wang, Cong
Xia, Qingqing
Jiang, Weilong
Zhang, Huizhe
Amiri-Ardekani, Ehsan
Hua, Haibing
Cheng, Yi
Machine learning-based prediction of candidate gene biomarkers correlated with immune infiltration in patients with idiopathic pulmonary fibrosis
title Machine learning-based prediction of candidate gene biomarkers correlated with immune infiltration in patients with idiopathic pulmonary fibrosis
title_full Machine learning-based prediction of candidate gene biomarkers correlated with immune infiltration in patients with idiopathic pulmonary fibrosis
title_fullStr Machine learning-based prediction of candidate gene biomarkers correlated with immune infiltration in patients with idiopathic pulmonary fibrosis
title_full_unstemmed Machine learning-based prediction of candidate gene biomarkers correlated with immune infiltration in patients with idiopathic pulmonary fibrosis
title_short Machine learning-based prediction of candidate gene biomarkers correlated with immune infiltration in patients with idiopathic pulmonary fibrosis
title_sort machine learning-based prediction of candidate gene biomarkers correlated with immune infiltration in patients with idiopathic pulmonary fibrosis
topic Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9968813/
https://www.ncbi.nlm.nih.gov/pubmed/36860337
http://dx.doi.org/10.3389/fmed.2023.1001813
work_keys_str_mv AT zhangyufeng machinelearningbasedpredictionofcandidategenebiomarkerscorrelatedwithimmuneinfiltrationinpatientswithidiopathicpulmonaryfibrosis
AT wangcong machinelearningbasedpredictionofcandidategenebiomarkerscorrelatedwithimmuneinfiltrationinpatientswithidiopathicpulmonaryfibrosis
AT xiaqingqing machinelearningbasedpredictionofcandidategenebiomarkerscorrelatedwithimmuneinfiltrationinpatientswithidiopathicpulmonaryfibrosis
AT jiangweilong machinelearningbasedpredictionofcandidategenebiomarkerscorrelatedwithimmuneinfiltrationinpatientswithidiopathicpulmonaryfibrosis
AT zhanghuizhe machinelearningbasedpredictionofcandidategenebiomarkerscorrelatedwithimmuneinfiltrationinpatientswithidiopathicpulmonaryfibrosis
AT amiriardekaniehsan machinelearningbasedpredictionofcandidategenebiomarkerscorrelatedwithimmuneinfiltrationinpatientswithidiopathicpulmonaryfibrosis
AT huahaibing machinelearningbasedpredictionofcandidategenebiomarkerscorrelatedwithimmuneinfiltrationinpatientswithidiopathicpulmonaryfibrosis
AT chengyi machinelearningbasedpredictionofcandidategenebiomarkerscorrelatedwithimmuneinfiltrationinpatientswithidiopathicpulmonaryfibrosis