Cargando…
Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma
BACKGROUND: Lung adenocarcinoma (LUAD) is a subtype of lung cancer with high morbidity and mortality. While genotyping is an important determinant for the prognosis of LUAD patients, there is a paucity of studies on gene set-based expression (GSE) typing for LUAD. This current study used GSE methodo...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9186218/ https://www.ncbi.nlm.nih.gov/pubmed/35693605 http://dx.doi.org/10.21037/jtd-22-557 |
_version_ | 1784724882606522368 |
---|---|
author | Xing, Baobao Shi, Lei Bao, Zhiguo Liang, Ying Liu, Bo Liu, Ruihan |
author_facet | Xing, Baobao Shi, Lei Bao, Zhiguo Liang, Ying Liu, Bo Liu, Ruihan |
author_sort | Xing, Baobao |
collection | PubMed |
description | BACKGROUND: Lung adenocarcinoma (LUAD) is a subtype of lung cancer with high morbidity and mortality. While genotyping is an important determinant for the prognosis of LUAD patients, there is a paucity of studies on gene set-based expression (GSE) typing for LUAD. This current study used GSE methodology to perform gene typing of LUAD patients. METHODS: Clinical and genomic information of the LUAD patients were downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Patients with LUAD were clustered into different molecular subtypes depending on the clinical and gene set expression characteristics. The survival rate and silhouette widths were compared between each molecular subtype. Differences in survival rate between gene sets were analyzed using Kaplan-Meier survival curves. Cox regression and Lasso regression were used to establish the prognostic gene set model based on the TCGA database, and the results were validated using the GEO dataset. RESULTS: A total of 10 hub genes were finally identified and clustered into 3 subtypes with a mean contour width of 0.96. There were significant differences in survival rates among the 3 subtypes (P<0.05). Gene Ontology (GO) analysis indicated that the related biological processes (BP) were mainly involved in regulation of cell cycle, mitotic cell cycle phase transition, and proteasome-mediated ubiquitin-dependent protein catabolic process. The cellular components (CC) were related to the spindle, chromosomal region, and midbody. Molecular function (MF) mainly focused on ubiquitin-like protein ligase binding, translation regulator activity, and oxidation activity. Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis showed that the main pathways included the Epstein Barr virus infection pathway of neurogeneration, the p53 signaling pathway, and the proteome pathways. In addition, the protein-protein interaction network was analyzed using the STRING and Cytospace software, and the top 9 hub genes identified were KIF2C, DLGAP5, KIF20A, PSMC1, PSMD1, PSMB7, SNAI2, FGF13, and BMP2. CONCLUSIONS: Patients with LUAD can be clustered into three subtypes based on the expression of gene sets. These findings contribute to understanding the pathogenesis and molecular mechanisms in LUAD, and may lead to potential individualized pharmacogenetic therapy for patients with LUAD. |
format | Online Article Text |
id | pubmed-9186218 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-91862182022-06-11 Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma Xing, Baobao Shi, Lei Bao, Zhiguo Liang, Ying Liu, Bo Liu, Ruihan J Thorac Dis Original Article BACKGROUND: Lung adenocarcinoma (LUAD) is a subtype of lung cancer with high morbidity and mortality. While genotyping is an important determinant for the prognosis of LUAD patients, there is a paucity of studies on gene set-based expression (GSE) typing for LUAD. This current study used GSE methodology to perform gene typing of LUAD patients. METHODS: Clinical and genomic information of the LUAD patients were downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Patients with LUAD were clustered into different molecular subtypes depending on the clinical and gene set expression characteristics. The survival rate and silhouette widths were compared between each molecular subtype. Differences in survival rate between gene sets were analyzed using Kaplan-Meier survival curves. Cox regression and Lasso regression were used to establish the prognostic gene set model based on the TCGA database, and the results were validated using the GEO dataset. RESULTS: A total of 10 hub genes were finally identified and clustered into 3 subtypes with a mean contour width of 0.96. There were significant differences in survival rates among the 3 subtypes (P<0.05). Gene Ontology (GO) analysis indicated that the related biological processes (BP) were mainly involved in regulation of cell cycle, mitotic cell cycle phase transition, and proteasome-mediated ubiquitin-dependent protein catabolic process. The cellular components (CC) were related to the spindle, chromosomal region, and midbody. Molecular function (MF) mainly focused on ubiquitin-like protein ligase binding, translation regulator activity, and oxidation activity. Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis showed that the main pathways included the Epstein Barr virus infection pathway of neurogeneration, the p53 signaling pathway, and the proteome pathways. In addition, the protein-protein interaction network was analyzed using the STRING and Cytospace software, and the top 9 hub genes identified were KIF2C, DLGAP5, KIF20A, PSMC1, PSMD1, PSMB7, SNAI2, FGF13, and BMP2. CONCLUSIONS: Patients with LUAD can be clustered into three subtypes based on the expression of gene sets. These findings contribute to understanding the pathogenesis and molecular mechanisms in LUAD, and may lead to potential individualized pharmacogenetic therapy for patients with LUAD. AME Publishing Company 2022-05 /pmc/articles/PMC9186218/ /pubmed/35693605 http://dx.doi.org/10.21037/jtd-22-557 Text en 2022 Journal of Thoracic Disease. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Original Article Xing, Baobao Shi, Lei Bao, Zhiguo Liang, Ying Liu, Bo Liu, Ruihan Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma |
title | Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma |
title_full | Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma |
title_fullStr | Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma |
title_full_unstemmed | Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma |
title_short | Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma |
title_sort | molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9186218/ https://www.ncbi.nlm.nih.gov/pubmed/35693605 http://dx.doi.org/10.21037/jtd-22-557 |
work_keys_str_mv | AT xingbaobao molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma AT shilei molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma AT baozhiguo molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma AT liangying molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma AT liubo molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma AT liuruihan molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma |