Cargando…

Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis

BACKGROUND: The mechanism of cancer occurrence and development could be understood with multi‐omics data analysis. Discovering genetic markers is highly necessary for predicting clinical outcome of lung adenocarcinoma (LUAD). METHODS: Clinical follow‐up information, copy number variation (CNV) data,...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Surong, Zeng, Xueni, Lin, Shaona, Liang, Minchao, Huang, Huaxing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8841135/
https://www.ncbi.nlm.nih.gov/pubmed/34951053
http://dx.doi.org/10.1002/jcla.24190
_version_ 1784650769023107072
author Zhang, Surong
Zeng, Xueni
Lin, Shaona
Liang, Minchao
Huang, Huaxing
author_facet Zhang, Surong
Zeng, Xueni
Lin, Shaona
Liang, Minchao
Huang, Huaxing
author_sort Zhang, Surong
collection PubMed
description BACKGROUND: The mechanism of cancer occurrence and development could be understood with multi‐omics data analysis. Discovering genetic markers is highly necessary for predicting clinical outcome of lung adenocarcinoma (LUAD). METHODS: Clinical follow‐up information, copy number variation (CNV) data, single nucleotide polymorphism (SNP), and RNA‐Seq were acquired from The Cancer Genome Atlas (TCGA). To obtain robust biomarkers, prognostic‐related genes, genes with SNP variation, and copy number differential genes in the training set were selected and further subjected to feature selection using random forests. Finally, a gene‐based prediction model for LUAD was validated in validation datasets. RESULTS: The study filtered 2071 prognostic‐related genes and 230 genomic variants, 1878 copy deletions, and 438 significant mutations. 218 candidate genes were screened through integrating genomic variation genes and prognosis‐related genes. 7 characteristic genes (RHOV, CSMD3, FBN2, MAGEL2, SMIM4, BCKDHB, and GANC) were identified by random forest feature selection, and many genes were found to be tumor progression‐related. A 7‐gene signature constructed by Cox regression analysis was an independent prognostic factor for LUAD patients, and at the same time a risk factor in the test set, external validation set, and training set. Noticeably, the 5‐year AUC of survival in the validation set and training set was all ˃ 0.67. Similar results were obtained from multi‐omics validation datasets. CONCLUSIONS: The study builds a novel 7‐gene signature as a prognostic marker for the survival prediction of patients with LUAD. The current findings provided a set of new prognostic and diagnostic biomarkers and therapeutic targets.
format Online
Article
Text
id pubmed-8841135
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-88411352022-02-22 Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis Zhang, Surong Zeng, Xueni Lin, Shaona Liang, Minchao Huang, Huaxing J Clin Lab Anal Research Articles BACKGROUND: The mechanism of cancer occurrence and development could be understood with multi‐omics data analysis. Discovering genetic markers is highly necessary for predicting clinical outcome of lung adenocarcinoma (LUAD). METHODS: Clinical follow‐up information, copy number variation (CNV) data, single nucleotide polymorphism (SNP), and RNA‐Seq were acquired from The Cancer Genome Atlas (TCGA). To obtain robust biomarkers, prognostic‐related genes, genes with SNP variation, and copy number differential genes in the training set were selected and further subjected to feature selection using random forests. Finally, a gene‐based prediction model for LUAD was validated in validation datasets. RESULTS: The study filtered 2071 prognostic‐related genes and 230 genomic variants, 1878 copy deletions, and 438 significant mutations. 218 candidate genes were screened through integrating genomic variation genes and prognosis‐related genes. 7 characteristic genes (RHOV, CSMD3, FBN2, MAGEL2, SMIM4, BCKDHB, and GANC) were identified by random forest feature selection, and many genes were found to be tumor progression‐related. A 7‐gene signature constructed by Cox regression analysis was an independent prognostic factor for LUAD patients, and at the same time a risk factor in the test set, external validation set, and training set. Noticeably, the 5‐year AUC of survival in the validation set and training set was all ˃ 0.67. Similar results were obtained from multi‐omics validation datasets. CONCLUSIONS: The study builds a novel 7‐gene signature as a prognostic marker for the survival prediction of patients with LUAD. The current findings provided a set of new prognostic and diagnostic biomarkers and therapeutic targets. John Wiley and Sons Inc. 2021-12-23 /pmc/articles/PMC8841135/ /pubmed/34951053 http://dx.doi.org/10.1002/jcla.24190 Text en © 2021 The Authors. Journal of Clinical Laboratory Analysis published by Wiley Periodicals LLC. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Zhang, Surong
Zeng, Xueni
Lin, Shaona
Liang, Minchao
Huang, Huaxing
Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis
title Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis
title_full Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis
title_fullStr Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis
title_full_unstemmed Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis
title_short Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis
title_sort identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8841135/
https://www.ncbi.nlm.nih.gov/pubmed/34951053
http://dx.doi.org/10.1002/jcla.24190
work_keys_str_mv AT zhangsurong identificationofsevengenemarkertopredictthesurvivalofpatientswithlungadenocarcinomausingintegratedmultiomicsdataanalysis
AT zengxueni identificationofsevengenemarkertopredictthesurvivalofpatientswithlungadenocarcinomausingintegratedmultiomicsdataanalysis
AT linshaona identificationofsevengenemarkertopredictthesurvivalofpatientswithlungadenocarcinomausingintegratedmultiomicsdataanalysis
AT liangminchao identificationofsevengenemarkertopredictthesurvivalofpatientswithlungadenocarcinomausingintegratedmultiomicsdataanalysis
AT huanghuaxing identificationofsevengenemarkertopredictthesurvivalofpatientswithlungadenocarcinomausingintegratedmultiomicsdataanalysis