Cargando…
Machine learning-based screening of the diagnostic genes and their relationship with immune-cell infiltration in patients with lung adenocarcinoma
BACKGROUND: Lung adenocarcinoma (LUAD) is the most common type of lung cancer, and has a dismal mortality rate of 80%, mainly due to diagnosis at an advanced stage. Biomarkers with high specificity and sensitivity for the early diagnosis of LUAD are sparse. This study aimed to identify markers for t...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8987822/ https://www.ncbi.nlm.nih.gov/pubmed/35399247 http://dx.doi.org/10.21037/jtd-22-206 |
_version_ | 1784682827693948928 |
---|---|
author | Wang, Shuying Wang, Qiong Fan, Bin Gong, Jiao Sun, Liping Hu, Bo Wang, Deqing |
author_facet | Wang, Shuying Wang, Qiong Fan, Bin Gong, Jiao Sun, Liping Hu, Bo Wang, Deqing |
author_sort | Wang, Shuying |
collection | PubMed |
description | BACKGROUND: Lung adenocarcinoma (LUAD) is the most common type of lung cancer, and has a dismal mortality rate of 80%, mainly due to diagnosis at an advanced stage. Biomarkers with high specificity and sensitivity for the early diagnosis of LUAD are sparse. This study aimed to identify markers for the early diagnosis of LUAD. METHODS: The GSE32863 and GSE75037 data sets were standardized and merged to screen for differentially expressed genes (DEGs). Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were conducted. The intersected DEGs from the least absolute shrinkage and selection operator (LASSO) and support vector machine (SVM) regression analyses were considered the hub genes. Then the diagnostic ability and expression of hub genes was tested in GSE63459 data set, Finally, CIBERSORT was used to analyze the correlation between the immune-infiltrating cells and hub genes. RESULTS: The following 7 DEGs were intersected by the LASSO and SVM regression analyses: Locus 401286 (LOC401286), flavin-containing monooxygenase 2 (FMO2), XLKD1, Ras homolog family member J (RHOJ), scavenger receptor Class A member 5 (SCARA5), heat shock protein beta-2 (HSPB2), and serine incorporator 2 (SERINC2). The area under the receiver operating characteristic curve (AUC) of LOC401286, FMO2, XLKD1, RHOJ, SCARA5, HSPB2, and SERINC2 was 0.99, 1.00, 0.99, 1.00, 0.99, 0.99, and 0.98, respectively in the training groups. The AUC of LOC401286, FMO2, XLKD1, RHOJ, SCARA5, HSPB2, and SERINC2 was 0.97, 0.96, 0.94, 0.88, 0.85, 0.94 and 0.89, respectively in the validation group. The immune-cell infiltrations of naive B cells, memory B cells, plasma cells, naive cluster of differentiation (CD) 4 T cells, T follicular helper cells, regulatory T cells, gamma delta T cells, monocytes, M0 macrophages, M1 macrophages, resting mast cells, activated mast cells, and neutrophils were different between the normal and tumor tissues. Notably, these immune cells were correlated with the above-mentioned 7 diagnostic genes. CONCLUSIONS: We identified 7 DEGs in LUAD tissue that can be considered diagnostic genes based on 2 machine-learning regression methods, which could be very helpful for the early diagnosis of LUAD in clinical practice. |
format | Online Article Text |
id | pubmed-8987822 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-89878222022-04-08 Machine learning-based screening of the diagnostic genes and their relationship with immune-cell infiltration in patients with lung adenocarcinoma Wang, Shuying Wang, Qiong Fan, Bin Gong, Jiao Sun, Liping Hu, Bo Wang, Deqing J Thorac Dis Original Article BACKGROUND: Lung adenocarcinoma (LUAD) is the most common type of lung cancer, and has a dismal mortality rate of 80%, mainly due to diagnosis at an advanced stage. Biomarkers with high specificity and sensitivity for the early diagnosis of LUAD are sparse. This study aimed to identify markers for the early diagnosis of LUAD. METHODS: The GSE32863 and GSE75037 data sets were standardized and merged to screen for differentially expressed genes (DEGs). Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were conducted. The intersected DEGs from the least absolute shrinkage and selection operator (LASSO) and support vector machine (SVM) regression analyses were considered the hub genes. Then the diagnostic ability and expression of hub genes was tested in GSE63459 data set, Finally, CIBERSORT was used to analyze the correlation between the immune-infiltrating cells and hub genes. RESULTS: The following 7 DEGs were intersected by the LASSO and SVM regression analyses: Locus 401286 (LOC401286), flavin-containing monooxygenase 2 (FMO2), XLKD1, Ras homolog family member J (RHOJ), scavenger receptor Class A member 5 (SCARA5), heat shock protein beta-2 (HSPB2), and serine incorporator 2 (SERINC2). The area under the receiver operating characteristic curve (AUC) of LOC401286, FMO2, XLKD1, RHOJ, SCARA5, HSPB2, and SERINC2 was 0.99, 1.00, 0.99, 1.00, 0.99, 0.99, and 0.98, respectively in the training groups. The AUC of LOC401286, FMO2, XLKD1, RHOJ, SCARA5, HSPB2, and SERINC2 was 0.97, 0.96, 0.94, 0.88, 0.85, 0.94 and 0.89, respectively in the validation group. The immune-cell infiltrations of naive B cells, memory B cells, plasma cells, naive cluster of differentiation (CD) 4 T cells, T follicular helper cells, regulatory T cells, gamma delta T cells, monocytes, M0 macrophages, M1 macrophages, resting mast cells, activated mast cells, and neutrophils were different between the normal and tumor tissues. Notably, these immune cells were correlated with the above-mentioned 7 diagnostic genes. CONCLUSIONS: We identified 7 DEGs in LUAD tissue that can be considered diagnostic genes based on 2 machine-learning regression methods, which could be very helpful for the early diagnosis of LUAD in clinical practice. AME Publishing Company 2022-03 /pmc/articles/PMC8987822/ /pubmed/35399247 http://dx.doi.org/10.21037/jtd-22-206 Text en 2022 Journal of Thoracic Disease. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Original Article Wang, Shuying Wang, Qiong Fan, Bin Gong, Jiao Sun, Liping Hu, Bo Wang, Deqing Machine learning-based screening of the diagnostic genes and their relationship with immune-cell infiltration in patients with lung adenocarcinoma |
title | Machine learning-based screening of the diagnostic genes and their relationship with immune-cell infiltration in patients with lung adenocarcinoma |
title_full | Machine learning-based screening of the diagnostic genes and their relationship with immune-cell infiltration in patients with lung adenocarcinoma |
title_fullStr | Machine learning-based screening of the diagnostic genes and their relationship with immune-cell infiltration in patients with lung adenocarcinoma |
title_full_unstemmed | Machine learning-based screening of the diagnostic genes and their relationship with immune-cell infiltration in patients with lung adenocarcinoma |
title_short | Machine learning-based screening of the diagnostic genes and their relationship with immune-cell infiltration in patients with lung adenocarcinoma |
title_sort | machine learning-based screening of the diagnostic genes and their relationship with immune-cell infiltration in patients with lung adenocarcinoma |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8987822/ https://www.ncbi.nlm.nih.gov/pubmed/35399247 http://dx.doi.org/10.21037/jtd-22-206 |
work_keys_str_mv | AT wangshuying machinelearningbasedscreeningofthediagnosticgenesandtheirrelationshipwithimmunecellinfiltrationinpatientswithlungadenocarcinoma AT wangqiong machinelearningbasedscreeningofthediagnosticgenesandtheirrelationshipwithimmunecellinfiltrationinpatientswithlungadenocarcinoma AT fanbin machinelearningbasedscreeningofthediagnosticgenesandtheirrelationshipwithimmunecellinfiltrationinpatientswithlungadenocarcinoma AT gongjiao machinelearningbasedscreeningofthediagnosticgenesandtheirrelationshipwithimmunecellinfiltrationinpatientswithlungadenocarcinoma AT sunliping machinelearningbasedscreeningofthediagnosticgenesandtheirrelationshipwithimmunecellinfiltrationinpatientswithlungadenocarcinoma AT hubo machinelearningbasedscreeningofthediagnosticgenesandtheirrelationshipwithimmunecellinfiltrationinpatientswithlungadenocarcinoma AT wangdeqing machinelearningbasedscreeningofthediagnosticgenesandtheirrelationshipwithimmunecellinfiltrationinpatientswithlungadenocarcinoma |