Cargando…
Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis
BACKGROUND: The mechanism of cancer occurrence and development could be understood with multi‐omics data analysis. Discovering genetic markers is highly necessary for predicting clinical outcome of lung adenocarcinoma (LUAD). METHODS: Clinical follow‐up information, copy number variation (CNV) data,...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8841135/ https://www.ncbi.nlm.nih.gov/pubmed/34951053 http://dx.doi.org/10.1002/jcla.24190 |
_version_ | 1784650769023107072 |
---|---|
author | Zhang, Surong Zeng, Xueni Lin, Shaona Liang, Minchao Huang, Huaxing |
author_facet | Zhang, Surong Zeng, Xueni Lin, Shaona Liang, Minchao Huang, Huaxing |
author_sort | Zhang, Surong |
collection | PubMed |
description | BACKGROUND: The mechanism of cancer occurrence and development could be understood with multi‐omics data analysis. Discovering genetic markers is highly necessary for predicting clinical outcome of lung adenocarcinoma (LUAD). METHODS: Clinical follow‐up information, copy number variation (CNV) data, single nucleotide polymorphism (SNP), and RNA‐Seq were acquired from The Cancer Genome Atlas (TCGA). To obtain robust biomarkers, prognostic‐related genes, genes with SNP variation, and copy number differential genes in the training set were selected and further subjected to feature selection using random forests. Finally, a gene‐based prediction model for LUAD was validated in validation datasets. RESULTS: The study filtered 2071 prognostic‐related genes and 230 genomic variants, 1878 copy deletions, and 438 significant mutations. 218 candidate genes were screened through integrating genomic variation genes and prognosis‐related genes. 7 characteristic genes (RHOV, CSMD3, FBN2, MAGEL2, SMIM4, BCKDHB, and GANC) were identified by random forest feature selection, and many genes were found to be tumor progression‐related. A 7‐gene signature constructed by Cox regression analysis was an independent prognostic factor for LUAD patients, and at the same time a risk factor in the test set, external validation set, and training set. Noticeably, the 5‐year AUC of survival in the validation set and training set was all ˃ 0.67. Similar results were obtained from multi‐omics validation datasets. CONCLUSIONS: The study builds a novel 7‐gene signature as a prognostic marker for the survival prediction of patients with LUAD. The current findings provided a set of new prognostic and diagnostic biomarkers and therapeutic targets. |
format | Online Article Text |
id | pubmed-8841135 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-88411352022-02-22 Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis Zhang, Surong Zeng, Xueni Lin, Shaona Liang, Minchao Huang, Huaxing J Clin Lab Anal Research Articles BACKGROUND: The mechanism of cancer occurrence and development could be understood with multi‐omics data analysis. Discovering genetic markers is highly necessary for predicting clinical outcome of lung adenocarcinoma (LUAD). METHODS: Clinical follow‐up information, copy number variation (CNV) data, single nucleotide polymorphism (SNP), and RNA‐Seq were acquired from The Cancer Genome Atlas (TCGA). To obtain robust biomarkers, prognostic‐related genes, genes with SNP variation, and copy number differential genes in the training set were selected and further subjected to feature selection using random forests. Finally, a gene‐based prediction model for LUAD was validated in validation datasets. RESULTS: The study filtered 2071 prognostic‐related genes and 230 genomic variants, 1878 copy deletions, and 438 significant mutations. 218 candidate genes were screened through integrating genomic variation genes and prognosis‐related genes. 7 characteristic genes (RHOV, CSMD3, FBN2, MAGEL2, SMIM4, BCKDHB, and GANC) were identified by random forest feature selection, and many genes were found to be tumor progression‐related. A 7‐gene signature constructed by Cox regression analysis was an independent prognostic factor for LUAD patients, and at the same time a risk factor in the test set, external validation set, and training set. Noticeably, the 5‐year AUC of survival in the validation set and training set was all ˃ 0.67. Similar results were obtained from multi‐omics validation datasets. CONCLUSIONS: The study builds a novel 7‐gene signature as a prognostic marker for the survival prediction of patients with LUAD. The current findings provided a set of new prognostic and diagnostic biomarkers and therapeutic targets. John Wiley and Sons Inc. 2021-12-23 /pmc/articles/PMC8841135/ /pubmed/34951053 http://dx.doi.org/10.1002/jcla.24190 Text en © 2021 The Authors. Journal of Clinical Laboratory Analysis published by Wiley Periodicals LLC. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Zhang, Surong Zeng, Xueni Lin, Shaona Liang, Minchao Huang, Huaxing Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis |
title | Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis |
title_full | Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis |
title_fullStr | Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis |
title_full_unstemmed | Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis |
title_short | Identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis |
title_sort | identification of seven‐gene marker to predict the survival of patients with lung adenocarcinoma using integrated multi‐omics data analysis |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8841135/ https://www.ncbi.nlm.nih.gov/pubmed/34951053 http://dx.doi.org/10.1002/jcla.24190 |
work_keys_str_mv | AT zhangsurong identificationofsevengenemarkertopredictthesurvivalofpatientswithlungadenocarcinomausingintegratedmultiomicsdataanalysis AT zengxueni identificationofsevengenemarkertopredictthesurvivalofpatientswithlungadenocarcinomausingintegratedmultiomicsdataanalysis AT linshaona identificationofsevengenemarkertopredictthesurvivalofpatientswithlungadenocarcinomausingintegratedmultiomicsdataanalysis AT liangminchao identificationofsevengenemarkertopredictthesurvivalofpatientswithlungadenocarcinomausingintegratedmultiomicsdataanalysis AT huanghuaxing identificationofsevengenemarkertopredictthesurvivalofpatientswithlungadenocarcinomausingintegratedmultiomicsdataanalysis |