Cargando…
Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model
Lung adenocarcinoma (LAD) is difficult to diagnose as it tends to be small in size and metastasize early. The aim of the present study was to investigate prognostic factors for patients with LAD and establish a prognosis risk model. A training set consisting of clinical and RNA sequencing data from...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
D.A. Spandidos
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6539490/ https://www.ncbi.nlm.nih.gov/pubmed/31289525 http://dx.doi.org/10.3892/ol.2019.10366 |
_version_ | 1783422401142325248 |
---|---|
author | Zhang, Wei Shen, Yang Feng, Ganzhu |
author_facet | Zhang, Wei Shen, Yang Feng, Ganzhu |
author_sort | Zhang, Wei |
collection | PubMed |
description | Lung adenocarcinoma (LAD) is difficult to diagnose as it tends to be small in size and metastasize early. The aim of the present study was to investigate prognostic factors for patients with LAD and establish a prognosis risk model. A training set consisting of clinical and RNA sequencing data from 503 patients with LAD, as well as expression data from a further 59 LAD and adjacent tissues, was obtained from The Cancer Genome Atlas. Additionally, a validation dataset was acquired from the Gene Expression Omnibus database (GSE26939), which included clinical and gene expression data from 115 patients. Using the DESeq2 package to compare expression between LAD and adjacent tissues, differentially expressed genes (DEGs) were identified. On the basis of survival and the random forests for survival, regression and classification package, genes for constructing the prognosis risk model were selected. The prognosis risk model was constructed and validated using the survival package. Subsequently, high- and low-risk groups were compared using the Limma package to identify DEGs, and enrichment analysis was performed using the web-based gene set analysis toolkit. A protein-protein interaction network was visualized using Cytoscape software. There were 18,567 DEGs between the LAD samples and the adjacent tissues, and 363 DEGs between the high- and low-risk groups. Of these, four genes were selected for constructing the prognosis risk model, myosin IE (MYO1E), endoplasmic reticulum oxidoreductase 1α (ERO1L), C1q and tumor necrosis factor-related protein 6 (C1QTNF6) and family with sequence similarity 83, member A (FAM83A). The survival time of high- and low-risk groups in the validation set were significantly different. Functional enrichment revealed that the genes that interacted with MYO1E, ERO1L, C1QTNF6 and FAM83A separately were enriched in ‘cell cycle regulation’, ‘synthesis and assembly of nucleic acids’, ‘histone modification and cell cycle progression’ and ‘cell secretion process’. The four-gene prognosis risk model could potentially be used for predicting the survival of patients with LAD. |
format | Online Article Text |
id | pubmed-6539490 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | D.A. Spandidos |
record_format | MEDLINE/PubMed |
spelling | pubmed-65394902019-07-09 Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model Zhang, Wei Shen, Yang Feng, Ganzhu Oncol Lett Articles Lung adenocarcinoma (LAD) is difficult to diagnose as it tends to be small in size and metastasize early. The aim of the present study was to investigate prognostic factors for patients with LAD and establish a prognosis risk model. A training set consisting of clinical and RNA sequencing data from 503 patients with LAD, as well as expression data from a further 59 LAD and adjacent tissues, was obtained from The Cancer Genome Atlas. Additionally, a validation dataset was acquired from the Gene Expression Omnibus database (GSE26939), which included clinical and gene expression data from 115 patients. Using the DESeq2 package to compare expression between LAD and adjacent tissues, differentially expressed genes (DEGs) were identified. On the basis of survival and the random forests for survival, regression and classification package, genes for constructing the prognosis risk model were selected. The prognosis risk model was constructed and validated using the survival package. Subsequently, high- and low-risk groups were compared using the Limma package to identify DEGs, and enrichment analysis was performed using the web-based gene set analysis toolkit. A protein-protein interaction network was visualized using Cytoscape software. There were 18,567 DEGs between the LAD samples and the adjacent tissues, and 363 DEGs between the high- and low-risk groups. Of these, four genes were selected for constructing the prognosis risk model, myosin IE (MYO1E), endoplasmic reticulum oxidoreductase 1α (ERO1L), C1q and tumor necrosis factor-related protein 6 (C1QTNF6) and family with sequence similarity 83, member A (FAM83A). The survival time of high- and low-risk groups in the validation set were significantly different. Functional enrichment revealed that the genes that interacted with MYO1E, ERO1L, C1QTNF6 and FAM83A separately were enriched in ‘cell cycle regulation’, ‘synthesis and assembly of nucleic acids’, ‘histone modification and cell cycle progression’ and ‘cell secretion process’. The four-gene prognosis risk model could potentially be used for predicting the survival of patients with LAD. D.A. Spandidos 2019-07 2019-05-17 /pmc/articles/PMC6539490/ /pubmed/31289525 http://dx.doi.org/10.3892/ol.2019.10366 Text en Copyright: © Zhang et al. This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made. |
spellingShingle | Articles Zhang, Wei Shen, Yang Feng, Ganzhu Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model |
title | Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model |
title_full | Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model |
title_fullStr | Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model |
title_full_unstemmed | Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model |
title_short | Predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model |
title_sort | predicting the survival of patients with lung adenocarcinoma using a four-gene prognosis risk model |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6539490/ https://www.ncbi.nlm.nih.gov/pubmed/31289525 http://dx.doi.org/10.3892/ol.2019.10366 |
work_keys_str_mv | AT zhangwei predictingthesurvivalofpatientswithlungadenocarcinomausingafourgeneprognosisriskmodel AT shenyang predictingthesurvivalofpatientswithlungadenocarcinomausingafourgeneprognosisriskmodel AT fengganzhu predictingthesurvivalofpatientswithlungadenocarcinomausingafourgeneprognosisriskmodel |