Cargando…

Nonnegative matrix factorization‐based bioinformatics analysis reveals that TPX2 and SELENBP1 are two predictors of the inner sub‐consensuses of lung adenocarcinoma

BACKGROUND: Lung adenocarcinoma (LUAD) is a heterogeneous disease. However the inner sub‐groups of LUAD have not been fully studied. Markers predicted the sub‐groups and prognosis of LUAD are badly needed. AIMS: To identify biomarkers associated with the sub‐groups and prognosis of LUAD. MATERIALS A...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Haiwei, Wang, Xinrui, Xu, Liangpu, Cao, Hua, Zhang, Ji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8683537/
https://www.ncbi.nlm.nih.gov/pubmed/34734491
http://dx.doi.org/10.1002/cam4.4386
_version_ 1784617442198159360
author Wang, Haiwei
Wang, Xinrui
Xu, Liangpu
Cao, Hua
Zhang, Ji
author_facet Wang, Haiwei
Wang, Xinrui
Xu, Liangpu
Cao, Hua
Zhang, Ji
author_sort Wang, Haiwei
collection PubMed
description BACKGROUND: Lung adenocarcinoma (LUAD) is a heterogeneous disease. However the inner sub‐groups of LUAD have not been fully studied. Markers predicted the sub‐groups and prognosis of LUAD are badly needed. AIMS: To identify biomarkers associated with the sub‐groups and prognosis of LUAD. MATERIALS AND METHODS: Using nonnegative matrix factorization (NMF) clustering, LUAD patients from The Cancer Genome Atlas (TCGA), Gene Expression Omnibus (GEO) datasets and LUAD cell lines from Genomics of Drug Sensitivity in Cancer (GDSC) dataset were divided into different sub‐consensuses based on the gene expression profiling. The overall survival of LUAD patients in each sub‐consensus was determined by Kaplan‐Meier survival analysis. The common genes which were differentially expressed in each sub‐consensus of LUAD patients and LUAD cell lines were identified using TBtools. The predictive accuracy of TPX2 and SELENBP1 for theinner sub‐consensuses of LUAD was determined by Receiver operator characteristic (ROC) analysis. The Kaplan‐Meier survival analysis was also used to test the prognostic significance of TPX2 and SELENBP1 in LUAD patients. RESULTS: Using nonnegative matrix factorization clustering, LUAD patients in The Cancer Genome Atlas (TCGA), GSE30219, GSE42127, GSE50081, GSE68465, and GSE72094 datasets were divided into three sub‐consensuses. Sub‐consensus3 LUAD patients were with low overall survival and were with high TP53 mutations. Similarly, LUAD cell lines were also divided into three sub‐consensuses by NMF method, and sub‐consensus2 cell lines were resistant to EGFR inhibitors. Identification of the common genes which were differentially expressed in different sub‐consensuses of LUAD patients and LUAD cell lines revealed that TPX2 was highly expressed in sub‐consensus3 LUAD patients and sub‐consensus2 LUAD cell lines. On the contrary, SELENBP1 was highly expressed in sub‐consensus1 LUAD patients and sub‐consensus1 LUAD cell lines. The expression levels of TPX2 and SELENBP1 could distinguish sub‐consensus3 LUAD patients or sub‐consensus2 LUAD cell lines from other sub‐consensuses of LUAD patients or cell lines. Moreover, compared with normal lung tissues, TPX2 was highly expressed, while, SELENBP1 was lowly expressed in LUAD tissues. Furthermore, the higher expression levels of TPX2 were associated with the lower relapse‐free survival and the lower overall survival of LUAD patients. While, the higher expression levels of SELENBP1 were associated with the higher relapse‐free survival and higher overall survival. At last, we showed that TP53 mutant LUAD patients were with higher TPX2 and lower SELENBP1 expressions. DISCUSSION: Both iCluster and NMF method are proved to be robust LUAD classification systems. However, the LUAD patients in different iclusters had no significant clinical overall survival, while, sub‐consensus3 LUAD patients from NMF classification were with lower overall survival than other sub‐consensuses. CONCLUSIONS: By integrated analysis of 1765 LUAD patients and 64 LUAD cell lines, we showed that NMF was a robust inner sub‐consensuses classification method of LUAD. TPX2 and SELENBP1 were differentially expressed in different LUAD sub‐ consensuses, and predicted the inner sub‐consensuses of LUAD with high accuracy. TPX2 was an unfavorable prognostic biomarker of LUAD which was up‐regulated in LUAD tissues and associated with the low overall survival of LUAD. SELENBP1 was a favorable prognostic biomarker of LUAD which was down‐regulated in LUAD tissues and associated with the prolonged overall survival of LUAD.
format Online
Article
Text
id pubmed-8683537
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-86835372021-12-30 Nonnegative matrix factorization‐based bioinformatics analysis reveals that TPX2 and SELENBP1 are two predictors of the inner sub‐consensuses of lung adenocarcinoma Wang, Haiwei Wang, Xinrui Xu, Liangpu Cao, Hua Zhang, Ji Cancer Med Bioinformatics BACKGROUND: Lung adenocarcinoma (LUAD) is a heterogeneous disease. However the inner sub‐groups of LUAD have not been fully studied. Markers predicted the sub‐groups and prognosis of LUAD are badly needed. AIMS: To identify biomarkers associated with the sub‐groups and prognosis of LUAD. MATERIALS AND METHODS: Using nonnegative matrix factorization (NMF) clustering, LUAD patients from The Cancer Genome Atlas (TCGA), Gene Expression Omnibus (GEO) datasets and LUAD cell lines from Genomics of Drug Sensitivity in Cancer (GDSC) dataset were divided into different sub‐consensuses based on the gene expression profiling. The overall survival of LUAD patients in each sub‐consensus was determined by Kaplan‐Meier survival analysis. The common genes which were differentially expressed in each sub‐consensus of LUAD patients and LUAD cell lines were identified using TBtools. The predictive accuracy of TPX2 and SELENBP1 for theinner sub‐consensuses of LUAD was determined by Receiver operator characteristic (ROC) analysis. The Kaplan‐Meier survival analysis was also used to test the prognostic significance of TPX2 and SELENBP1 in LUAD patients. RESULTS: Using nonnegative matrix factorization clustering, LUAD patients in The Cancer Genome Atlas (TCGA), GSE30219, GSE42127, GSE50081, GSE68465, and GSE72094 datasets were divided into three sub‐consensuses. Sub‐consensus3 LUAD patients were with low overall survival and were with high TP53 mutations. Similarly, LUAD cell lines were also divided into three sub‐consensuses by NMF method, and sub‐consensus2 cell lines were resistant to EGFR inhibitors. Identification of the common genes which were differentially expressed in different sub‐consensuses of LUAD patients and LUAD cell lines revealed that TPX2 was highly expressed in sub‐consensus3 LUAD patients and sub‐consensus2 LUAD cell lines. On the contrary, SELENBP1 was highly expressed in sub‐consensus1 LUAD patients and sub‐consensus1 LUAD cell lines. The expression levels of TPX2 and SELENBP1 could distinguish sub‐consensus3 LUAD patients or sub‐consensus2 LUAD cell lines from other sub‐consensuses of LUAD patients or cell lines. Moreover, compared with normal lung tissues, TPX2 was highly expressed, while, SELENBP1 was lowly expressed in LUAD tissues. Furthermore, the higher expression levels of TPX2 were associated with the lower relapse‐free survival and the lower overall survival of LUAD patients. While, the higher expression levels of SELENBP1 were associated with the higher relapse‐free survival and higher overall survival. At last, we showed that TP53 mutant LUAD patients were with higher TPX2 and lower SELENBP1 expressions. DISCUSSION: Both iCluster and NMF method are proved to be robust LUAD classification systems. However, the LUAD patients in different iclusters had no significant clinical overall survival, while, sub‐consensus3 LUAD patients from NMF classification were with lower overall survival than other sub‐consensuses. CONCLUSIONS: By integrated analysis of 1765 LUAD patients and 64 LUAD cell lines, we showed that NMF was a robust inner sub‐consensuses classification method of LUAD. TPX2 and SELENBP1 were differentially expressed in different LUAD sub‐ consensuses, and predicted the inner sub‐consensuses of LUAD with high accuracy. TPX2 was an unfavorable prognostic biomarker of LUAD which was up‐regulated in LUAD tissues and associated with the low overall survival of LUAD. SELENBP1 was a favorable prognostic biomarker of LUAD which was down‐regulated in LUAD tissues and associated with the prolonged overall survival of LUAD. John Wiley and Sons Inc. 2021-11-03 /pmc/articles/PMC8683537/ /pubmed/34734491 http://dx.doi.org/10.1002/cam4.4386 Text en © 2021 The Authors. Cancer Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Bioinformatics
Wang, Haiwei
Wang, Xinrui
Xu, Liangpu
Cao, Hua
Zhang, Ji
Nonnegative matrix factorization‐based bioinformatics analysis reveals that TPX2 and SELENBP1 are two predictors of the inner sub‐consensuses of lung adenocarcinoma
title Nonnegative matrix factorization‐based bioinformatics analysis reveals that TPX2 and SELENBP1 are two predictors of the inner sub‐consensuses of lung adenocarcinoma
title_full Nonnegative matrix factorization‐based bioinformatics analysis reveals that TPX2 and SELENBP1 are two predictors of the inner sub‐consensuses of lung adenocarcinoma
title_fullStr Nonnegative matrix factorization‐based bioinformatics analysis reveals that TPX2 and SELENBP1 are two predictors of the inner sub‐consensuses of lung adenocarcinoma
title_full_unstemmed Nonnegative matrix factorization‐based bioinformatics analysis reveals that TPX2 and SELENBP1 are two predictors of the inner sub‐consensuses of lung adenocarcinoma
title_short Nonnegative matrix factorization‐based bioinformatics analysis reveals that TPX2 and SELENBP1 are two predictors of the inner sub‐consensuses of lung adenocarcinoma
title_sort nonnegative matrix factorization‐based bioinformatics analysis reveals that tpx2 and selenbp1 are two predictors of the inner sub‐consensuses of lung adenocarcinoma
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8683537/
https://www.ncbi.nlm.nih.gov/pubmed/34734491
http://dx.doi.org/10.1002/cam4.4386
work_keys_str_mv AT wanghaiwei nonnegativematrixfactorizationbasedbioinformaticsanalysisrevealsthattpx2andselenbp1aretwopredictorsoftheinnersubconsensusesoflungadenocarcinoma
AT wangxinrui nonnegativematrixfactorizationbasedbioinformaticsanalysisrevealsthattpx2andselenbp1aretwopredictorsoftheinnersubconsensusesoflungadenocarcinoma
AT xuliangpu nonnegativematrixfactorizationbasedbioinformaticsanalysisrevealsthattpx2andselenbp1aretwopredictorsoftheinnersubconsensusesoflungadenocarcinoma
AT caohua nonnegativematrixfactorizationbasedbioinformaticsanalysisrevealsthattpx2andselenbp1aretwopredictorsoftheinnersubconsensusesoflungadenocarcinoma
AT zhangji nonnegativematrixfactorizationbasedbioinformaticsanalysisrevealsthattpx2andselenbp1aretwopredictorsoftheinnersubconsensusesoflungadenocarcinoma