Cargando…

Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model

Lung adenocarcinoma (LAD) is difficult to diagnose as it tends to be small in size and metastasize early. The aim of the present study was to investigate prognostic factors for patients with LAD and establish a prognosis risk model. A training set consisting of clinical and RNA sequencing data from...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Wei, Shen, Yang, Feng, Ganzhu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: D.A. Spandidos 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6539490/
https://www.ncbi.nlm.nih.gov/pubmed/31289525
http://dx.doi.org/10.3892/ol.2019.10366
_version_ 1783422401142325248
author Zhang, Wei
Shen, Yang
Feng, Ganzhu
author_facet Zhang, Wei
Shen, Yang
Feng, Ganzhu
author_sort Zhang, Wei
collection PubMed
description Lung adenocarcinoma (LAD) is difficult to diagnose as it tends to be small in size and metastasize early. The aim of the present study was to investigate prognostic factors for patients with LAD and establish a prognosis risk model. A training set consisting of clinical and RNA sequencing data from 503 patients with LAD, as well as expression data from a further 59 LAD and adjacent tissues, was obtained from The Cancer Genome Atlas. Additionally, a validation dataset was acquired from the Gene Expression Omnibus database (GSE26939), which included clinical and gene expression data from 115 patients. Using the DESeq2 package to compare expression between LAD and adjacent tissues, differentially expressed genes (DEGs) were identified. On the basis of survival and the random forests for survival, regression and classification package, genes for constructing the prognosis risk model were selected. The prognosis risk model was constructed and validated using the survival package. Subsequently, high- and low-risk groups were compared using the Limma package to identify DEGs, and enrichment analysis was performed using the web-based gene set analysis toolkit. A protein-protein interaction network was visualized using Cytoscape software. There were 18,567 DEGs between the LAD samples and the adjacent tissues, and 363 DEGs between the high- and low-risk groups. Of these, four genes were selected for constructing the prognosis risk model, myosin IE (MYO1E), endoplasmic reticulum oxidoreductase 1α (ERO1L), C1q and tumor necrosis factor-related protein 6 (C1QTNF6) and family with sequence similarity 83, member A (FAM83A). The survival time of high- and low-risk groups in the validation set were significantly different. Functional enrichment revealed that the genes that interacted with MYO1E, ERO1L, C1QTNF6 and FAM83A separately were enriched in ‘cell cycle regulation’, ‘synthesis and assembly of nucleic acids’, ‘histone modification and cell cycle progression’ and ‘cell secretion process’. The four-gene prognosis risk model could potentially be used for predicting the survival of patients with LAD.
format Online
Article
Text
id pubmed-6539490
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher D.A. Spandidos
record_format MEDLINE/PubMed
spelling pubmed-65394902019-07-09 Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model Zhang, Wei Shen, Yang Feng, Ganzhu Oncol Lett Articles Lung adenocarcinoma (LAD) is difficult to diagnose as it tends to be small in size and metastasize early. The aim of the present study was to investigate prognostic factors for patients with LAD and establish a prognosis risk model. A training set consisting of clinical and RNA sequencing data from 503 patients with LAD, as well as expression data from a further 59 LAD and adjacent tissues, was obtained from The Cancer Genome Atlas. Additionally, a validation dataset was acquired from the Gene Expression Omnibus database (GSE26939), which included clinical and gene expression data from 115 patients. Using the DESeq2 package to compare expression between LAD and adjacent tissues, differentially expressed genes (DEGs) were identified. On the basis of survival and the random forests for survival, regression and classification package, genes for constructing the prognosis risk model were selected. The prognosis risk model was constructed and validated using the survival package. Subsequently, high- and low-risk groups were compared using the Limma package to identify DEGs, and enrichment analysis was performed using the web-based gene set analysis toolkit. A protein-protein interaction network was visualized using Cytoscape software. There were 18,567 DEGs between the LAD samples and the adjacent tissues, and 363 DEGs between the high- and low-risk groups. Of these, four genes were selected for constructing the prognosis risk model, myosin IE (MYO1E), endoplasmic reticulum oxidoreductase 1α (ERO1L), C1q and tumor necrosis factor-related protein 6 (C1QTNF6) and family with sequence similarity 83, member A (FAM83A). The survival time of high- and low-risk groups in the validation set were significantly different. Functional enrichment revealed that the genes that interacted with MYO1E, ERO1L, C1QTNF6 and FAM83A separately were enriched in ‘cell cycle regulation’, ‘synthesis and assembly of nucleic acids’, ‘histone modification and cell cycle progression’ and ‘cell secretion process’. The four-gene prognosis risk model could potentially be used for predicting the survival of patients with LAD. D.A. Spandidos 2019-07 2019-05-17 /pmc/articles/PMC6539490/ /pubmed/31289525 http://dx.doi.org/10.3892/ol.2019.10366 Text en Copyright: © Zhang et al. This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
spellingShingle Articles
Zhang, Wei
Shen, Yang
Feng, Ganzhu
Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model
title Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model
title_full Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model
title_fullStr Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model
title_full_unstemmed Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model
title_short Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model
title_sort predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6539490/
https://www.ncbi.nlm.nih.gov/pubmed/31289525
http://dx.doi.org/10.3892/ol.2019.10366
work_keys_str_mv AT zhangwei predictingthesurvivalofpatientswithlungadenocarcinomausingafourgeneprognosisriskmodel
AT shenyang predictingthesurvivalofpatientswithlungadenocarcinomausingafourgeneprognosisriskmodel
AT fengganzhu predictingthesurvivalofpatientswithlungadenocarcinomausingafourgeneprognosisriskmodel