Cargando…

Identification of 5 Gene Signatures in Survival Prediction for Patients with Lung Squamous Cell Carcinoma Based on Integrated Multiomics Data Analysis

BACKGROUND: Lung squamous cell carcinoma (LSCC) is a frequently diagnosed cancer worldwide, and it has a poor prognosis. The current study is aimed at developing the prediction of LSCC prognosis by integrating multiomics data including transcriptome, copy number variation data, and mutation data ana...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Hongxia, Tong, Lihong, Zhang, Qian, Chang, Wenjun, Li, Fengsen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7298313/
https://www.ncbi.nlm.nih.gov/pubmed/32596344
http://dx.doi.org/10.1155/2020/6427483
_version_ 1783547182099464192
author Ma, Hongxia
Tong, Lihong
Zhang, Qian
Chang, Wenjun
Li, Fengsen
author_facet Ma, Hongxia
Tong, Lihong
Zhang, Qian
Chang, Wenjun
Li, Fengsen
author_sort Ma, Hongxia
collection PubMed
description BACKGROUND: Lung squamous cell carcinoma (LSCC) is a frequently diagnosed cancer worldwide, and it has a poor prognosis. The current study is aimed at developing the prediction of LSCC prognosis by integrating multiomics data including transcriptome, copy number variation data, and mutation data analysis, so as to predict patients' survival and discover new therapeutic targets. METHODS: RNASeq, SNP, CNV data, and LSCC patients' clinical follow-up information were downloaded from The Cancer Genome Atlas (TCGA), and the samples were randomly divided into two groups, namely, the training set and the validation set. In the training set, the genes related to prognosis and those with different copy numbers or with different SNPs were integrated to extract features using random forests, and finally, robust biomarkers were screened. In addition, a gene-related prognostic model was established and further verified in the test set and GEO validation set. RESULTS: We obtained a total of 804 prognostic-related genes and 535 copy amplification genes, 621 copy deletions genes, and 388 significantly mutated genes in genomic variants; noticeably, these genomic variant genes were found closely related to tumor development. A total of 51 candidate genes were obtained by integrating genomic variants and prognostic genes, and 5 characteristic genes (HIST1H2BH, SERPIND1, COL22A1, LCE3C, and ADAMTS17) were screened through random forest feature selection; we found that many of those genes had been reported to be related to LSCC progression. Cox regression analysis was performed to establish 5-gene signature that could serve as an independent prognostic factor for LSCC patients and can stratify risk samples in training set, test set, and external validation set (p < 0.01), and the 5-year survival areas under the curve (AUC) of both training set and validation set were > 0.67. CONCLUSION: In the current study, 5 gene signatures were constructed as novel prognostic markers to predict the survival of LSCC patients. The present findings provide new diagnostic and prognostic biomarkers and therapeutic targets for LSCC treatment.
format Online
Article
Text
id pubmed-7298313
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-72983132020-06-26 Identification of 5 Gene Signatures in Survival Prediction for Patients with Lung Squamous Cell Carcinoma Based on Integrated Multiomics Data Analysis Ma, Hongxia Tong, Lihong Zhang, Qian Chang, Wenjun Li, Fengsen Biomed Res Int Research Article BACKGROUND: Lung squamous cell carcinoma (LSCC) is a frequently diagnosed cancer worldwide, and it has a poor prognosis. The current study is aimed at developing the prediction of LSCC prognosis by integrating multiomics data including transcriptome, copy number variation data, and mutation data analysis, so as to predict patients' survival and discover new therapeutic targets. METHODS: RNASeq, SNP, CNV data, and LSCC patients' clinical follow-up information were downloaded from The Cancer Genome Atlas (TCGA), and the samples were randomly divided into two groups, namely, the training set and the validation set. In the training set, the genes related to prognosis and those with different copy numbers or with different SNPs were integrated to extract features using random forests, and finally, robust biomarkers were screened. In addition, a gene-related prognostic model was established and further verified in the test set and GEO validation set. RESULTS: We obtained a total of 804 prognostic-related genes and 535 copy amplification genes, 621 copy deletions genes, and 388 significantly mutated genes in genomic variants; noticeably, these genomic variant genes were found closely related to tumor development. A total of 51 candidate genes were obtained by integrating genomic variants and prognostic genes, and 5 characteristic genes (HIST1H2BH, SERPIND1, COL22A1, LCE3C, and ADAMTS17) were screened through random forest feature selection; we found that many of those genes had been reported to be related to LSCC progression. Cox regression analysis was performed to establish 5-gene signature that could serve as an independent prognostic factor for LSCC patients and can stratify risk samples in training set, test set, and external validation set (p < 0.01), and the 5-year survival areas under the curve (AUC) of both training set and validation set were > 0.67. CONCLUSION: In the current study, 5 gene signatures were constructed as novel prognostic markers to predict the survival of LSCC patients. The present findings provide new diagnostic and prognostic biomarkers and therapeutic targets for LSCC treatment. Hindawi 2020-06-08 /pmc/articles/PMC7298313/ /pubmed/32596344 http://dx.doi.org/10.1155/2020/6427483 Text en Copyright © 2020 Hongxia Ma et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ma, Hongxia
Tong, Lihong
Zhang, Qian
Chang, Wenjun
Li, Fengsen
Identification of 5 Gene Signatures in Survival Prediction for Patients with Lung Squamous Cell Carcinoma Based on Integrated Multiomics Data Analysis
title Identification of 5 Gene Signatures in Survival Prediction for Patients with Lung Squamous Cell Carcinoma Based on Integrated Multiomics Data Analysis
title_full Identification of 5 Gene Signatures in Survival Prediction for Patients with Lung Squamous Cell Carcinoma Based on Integrated Multiomics Data Analysis
title_fullStr Identification of 5 Gene Signatures in Survival Prediction for Patients with Lung Squamous Cell Carcinoma Based on Integrated Multiomics Data Analysis
title_full_unstemmed Identification of 5 Gene Signatures in Survival Prediction for Patients with Lung Squamous Cell Carcinoma Based on Integrated Multiomics Data Analysis
title_short Identification of 5 Gene Signatures in Survival Prediction for Patients with Lung Squamous Cell Carcinoma Based on Integrated Multiomics Data Analysis
title_sort identification of 5 gene signatures in survival prediction for patients with lung squamous cell carcinoma based on integrated multiomics data analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7298313/
https://www.ncbi.nlm.nih.gov/pubmed/32596344
http://dx.doi.org/10.1155/2020/6427483
work_keys_str_mv AT mahongxia identificationof5genesignaturesinsurvivalpredictionforpatientswithlungsquamouscellcarcinomabasedonintegratedmultiomicsdataanalysis
AT tonglihong identificationof5genesignaturesinsurvivalpredictionforpatientswithlungsquamouscellcarcinomabasedonintegratedmultiomicsdataanalysis
AT zhangqian identificationof5genesignaturesinsurvivalpredictionforpatientswithlungsquamouscellcarcinomabasedonintegratedmultiomicsdataanalysis
AT changwenjun identificationof5genesignaturesinsurvivalpredictionforpatientswithlungsquamouscellcarcinomabasedonintegratedmultiomicsdataanalysis
AT lifengsen identificationof5genesignaturesinsurvivalpredictionforpatientswithlungsquamouscellcarcinomabasedonintegratedmultiomicsdataanalysis