Cargando…

Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma

BACKGROUND: Lung adenocarcinoma (LUAD) is a subtype of lung cancer with high morbidity and mortality. While genotyping is an important determinant for the prognosis of LUAD patients, there is a paucity of studies on gene set-based expression (GSE) typing for LUAD. This current study used GSE methodo...

Descripción completa

Detalles Bibliográficos
Autores principales: Xing, Baobao, Shi, Lei, Bao, Zhiguo, Liang, Ying, Liu, Bo, Liu, Ruihan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9186218/
https://www.ncbi.nlm.nih.gov/pubmed/35693605
http://dx.doi.org/10.21037/jtd-22-557
_version_ 1784724882606522368
author Xing, Baobao
Shi, Lei
Bao, Zhiguo
Liang, Ying
Liu, Bo
Liu, Ruihan
author_facet Xing, Baobao
Shi, Lei
Bao, Zhiguo
Liang, Ying
Liu, Bo
Liu, Ruihan
author_sort Xing, Baobao
collection PubMed
description BACKGROUND: Lung adenocarcinoma (LUAD) is a subtype of lung cancer with high morbidity and mortality. While genotyping is an important determinant for the prognosis of LUAD patients, there is a paucity of studies on gene set-based expression (GSE) typing for LUAD. This current study used GSE methodology to perform gene typing of LUAD patients. METHODS: Clinical and genomic information of the LUAD patients were downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Patients with LUAD were clustered into different molecular subtypes depending on the clinical and gene set expression characteristics. The survival rate and silhouette widths were compared between each molecular subtype. Differences in survival rate between gene sets were analyzed using Kaplan-Meier survival curves. Cox regression and Lasso regression were used to establish the prognostic gene set model based on the TCGA database, and the results were validated using the GEO dataset. RESULTS: A total of 10 hub genes were finally identified and clustered into 3 subtypes with a mean contour width of 0.96. There were significant differences in survival rates among the 3 subtypes (P<0.05). Gene Ontology (GO) analysis indicated that the related biological processes (BP) were mainly involved in regulation of cell cycle, mitotic cell cycle phase transition, and proteasome-mediated ubiquitin-dependent protein catabolic process. The cellular components (CC) were related to the spindle, chromosomal region, and midbody. Molecular function (MF) mainly focused on ubiquitin-like protein ligase binding, translation regulator activity, and oxidation activity. Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis showed that the main pathways included the Epstein Barr virus infection pathway of neurogeneration, the p53 signaling pathway, and the proteome pathways. In addition, the protein-protein interaction network was analyzed using the STRING and Cytospace software, and the top 9 hub genes identified were KIF2C, DLGAP5, KIF20A, PSMC1, PSMD1, PSMB7, SNAI2, FGF13, and BMP2. CONCLUSIONS: Patients with LUAD can be clustered into three subtypes based on the expression of gene sets. These findings contribute to understanding the pathogenesis and molecular mechanisms in LUAD, and may lead to potential individualized pharmacogenetic therapy for patients with LUAD.
format Online
Article
Text
id pubmed-9186218
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher AME Publishing Company
record_format MEDLINE/PubMed
spelling pubmed-91862182022-06-11 Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma Xing, Baobao Shi, Lei Bao, Zhiguo Liang, Ying Liu, Bo Liu, Ruihan J Thorac Dis Original Article BACKGROUND: Lung adenocarcinoma (LUAD) is a subtype of lung cancer with high morbidity and mortality. While genotyping is an important determinant for the prognosis of LUAD patients, there is a paucity of studies on gene set-based expression (GSE) typing for LUAD. This current study used GSE methodology to perform gene typing of LUAD patients. METHODS: Clinical and genomic information of the LUAD patients were downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Patients with LUAD were clustered into different molecular subtypes depending on the clinical and gene set expression characteristics. The survival rate and silhouette widths were compared between each molecular subtype. Differences in survival rate between gene sets were analyzed using Kaplan-Meier survival curves. Cox regression and Lasso regression were used to establish the prognostic gene set model based on the TCGA database, and the results were validated using the GEO dataset. RESULTS: A total of 10 hub genes were finally identified and clustered into 3 subtypes with a mean contour width of 0.96. There were significant differences in survival rates among the 3 subtypes (P<0.05). Gene Ontology (GO) analysis indicated that the related biological processes (BP) were mainly involved in regulation of cell cycle, mitotic cell cycle phase transition, and proteasome-mediated ubiquitin-dependent protein catabolic process. The cellular components (CC) were related to the spindle, chromosomal region, and midbody. Molecular function (MF) mainly focused on ubiquitin-like protein ligase binding, translation regulator activity, and oxidation activity. Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis showed that the main pathways included the Epstein Barr virus infection pathway of neurogeneration, the p53 signaling pathway, and the proteome pathways. In addition, the protein-protein interaction network was analyzed using the STRING and Cytospace software, and the top 9 hub genes identified were KIF2C, DLGAP5, KIF20A, PSMC1, PSMD1, PSMB7, SNAI2, FGF13, and BMP2. CONCLUSIONS: Patients with LUAD can be clustered into three subtypes based on the expression of gene sets. These findings contribute to understanding the pathogenesis and molecular mechanisms in LUAD, and may lead to potential individualized pharmacogenetic therapy for patients with LUAD. AME Publishing Company 2022-05 /pmc/articles/PMC9186218/ /pubmed/35693605 http://dx.doi.org/10.21037/jtd-22-557 Text en 2022 Journal of Thoracic Disease. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Original Article
Xing, Baobao
Shi, Lei
Bao, Zhiguo
Liang, Ying
Liu, Bo
Liu, Ruihan
Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma
title Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma
title_full Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma
title_fullStr Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma
title_full_unstemmed Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma
title_short Molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma
title_sort molecular clustering based on gene set expression and its relationship with prognosis in patients with lung adenocarcinoma
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9186218/
https://www.ncbi.nlm.nih.gov/pubmed/35693605
http://dx.doi.org/10.21037/jtd-22-557
work_keys_str_mv AT xingbaobao molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma
AT shilei molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma
AT baozhiguo molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma
AT liangying molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma
AT liubo molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma
AT liuruihan molecularclusteringbasedongenesetexpressionanditsrelationshipwithprognosisinpatientswithlungadenocarcinoma