Cargando…

The landscape of gene co-expression modules correlating with prognostic genetic abnormalities in AML

BACKGROUND: The heterogenous cytogenetic and molecular variations were harbored by AML patients, some of which are related with AML pathogenesis and clinical outcomes. We aimed to uncover the intrinsic expression profiles correlating with prognostic genetic abnormalities by WGCNA. METHODS: We downlo...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Chao, Gao, Ya-yue, Ju, Qian-qian, Zhang, Chun-xia, Gong, Ming, Li, Zhen-ling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8164775/
https://www.ncbi.nlm.nih.gov/pubmed/34051812
http://dx.doi.org/10.1186/s12967-021-02914-2
_version_ 1783701187657203712
author Guo, Chao
Gao, Ya-yue
Ju, Qian-qian
Zhang, Chun-xia
Gong, Ming
Li, Zhen-ling
author_facet Guo, Chao
Gao, Ya-yue
Ju, Qian-qian
Zhang, Chun-xia
Gong, Ming
Li, Zhen-ling
author_sort Guo, Chao
collection PubMed
description BACKGROUND: The heterogenous cytogenetic and molecular variations were harbored by AML patients, some of which are related with AML pathogenesis and clinical outcomes. We aimed to uncover the intrinsic expression profiles correlating with prognostic genetic abnormalities by WGCNA. METHODS: We downloaded the clinical and expression dataset from BeatAML, TCGA and GEO database. Using R (version 4.0.2) and ‘WGCNA’ package, the co-expression modules correlating with the ELN2017 prognostic markers were identified (R(2) ≥ 0.4, p < 0.01). ORA detected the enriched pathways for the key co-expression modules. The patients in TCGA cohort were randomly assigned into the training set (50%) and testing set (50%). The LASSO penalized regression analysis was employed to build the prediction model, fitting OS to the expression level of hub genes by ‘glmnet’ package. Then the testing and 2 independent validation sets (GSE12417 and GSE37642) were used to validate the diagnostic utility and accuracy of the model. RESULTS: A total of 37 gene co-expression modules and 973 hub genes were identified for the BeatAML cohort. We found that 3 modules were significantly correlated with genetic markers (the ‘lightyellow’ module for NPM1 mutation, the ‘saddlebrown’ module for RUNX1 mutation, the ‘lightgreen’ module for TP53 mutation). ORA revealed that the ‘lightyellow’ module was mainly enriched in DNA-binding transcription factor activity and activation of HOX genes. The ‘saddlebrown’ module was enriched in immune response process. And the ‘lightgreen’ module was predominantly enriched in mitosis cell cycle process. The LASSO- regression analysis identified 6 genes (NFKB2, NEK9, HOXA7, APRC5L, FAM30A and LOC105371592) with non-zero coefficients. The risk score generated from the 6-gene model, was associated with ELN2017 risk stratification, relapsed disease, and prior MDS history. The 5-year AUC for the model was 0.822 and 0.824 in the training and testing sets, respectively. Moreover, the diagnostic utility of the model was robust when it was employed in 2 validation sets (5-year AUC 0.743–0.79). CONCLUSIONS: We established the co-expression network signature correlated with the ELN2017 recommended prognostic genetic abnormalities in AML. The 6-gene prediction model for AML survival was developed and validated by multiple datasets. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12967-021-02914-2.
format Online
Article
Text
id pubmed-8164775
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-81647752021-06-01 The landscape of gene co-expression modules correlating with prognostic genetic abnormalities in AML Guo, Chao Gao, Ya-yue Ju, Qian-qian Zhang, Chun-xia Gong, Ming Li, Zhen-ling J Transl Med Research BACKGROUND: The heterogenous cytogenetic and molecular variations were harbored by AML patients, some of which are related with AML pathogenesis and clinical outcomes. We aimed to uncover the intrinsic expression profiles correlating with prognostic genetic abnormalities by WGCNA. METHODS: We downloaded the clinical and expression dataset from BeatAML, TCGA and GEO database. Using R (version 4.0.2) and ‘WGCNA’ package, the co-expression modules correlating with the ELN2017 prognostic markers were identified (R(2) ≥ 0.4, p < 0.01). ORA detected the enriched pathways for the key co-expression modules. The patients in TCGA cohort were randomly assigned into the training set (50%) and testing set (50%). The LASSO penalized regression analysis was employed to build the prediction model, fitting OS to the expression level of hub genes by ‘glmnet’ package. Then the testing and 2 independent validation sets (GSE12417 and GSE37642) were used to validate the diagnostic utility and accuracy of the model. RESULTS: A total of 37 gene co-expression modules and 973 hub genes were identified for the BeatAML cohort. We found that 3 modules were significantly correlated with genetic markers (the ‘lightyellow’ module for NPM1 mutation, the ‘saddlebrown’ module for RUNX1 mutation, the ‘lightgreen’ module for TP53 mutation). ORA revealed that the ‘lightyellow’ module was mainly enriched in DNA-binding transcription factor activity and activation of HOX genes. The ‘saddlebrown’ module was enriched in immune response process. And the ‘lightgreen’ module was predominantly enriched in mitosis cell cycle process. The LASSO- regression analysis identified 6 genes (NFKB2, NEK9, HOXA7, APRC5L, FAM30A and LOC105371592) with non-zero coefficients. The risk score generated from the 6-gene model, was associated with ELN2017 risk stratification, relapsed disease, and prior MDS history. The 5-year AUC for the model was 0.822 and 0.824 in the training and testing sets, respectively. Moreover, the diagnostic utility of the model was robust when it was employed in 2 validation sets (5-year AUC 0.743–0.79). CONCLUSIONS: We established the co-expression network signature correlated with the ELN2017 recommended prognostic genetic abnormalities in AML. The 6-gene prediction model for AML survival was developed and validated by multiple datasets. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12967-021-02914-2. BioMed Central 2021-05-29 /pmc/articles/PMC8164775/ /pubmed/34051812 http://dx.doi.org/10.1186/s12967-021-02914-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Guo, Chao
Gao, Ya-yue
Ju, Qian-qian
Zhang, Chun-xia
Gong, Ming
Li, Zhen-ling
The landscape of gene co-expression modules correlating with prognostic genetic abnormalities in AML
title The landscape of gene co-expression modules correlating with prognostic genetic abnormalities in AML
title_full The landscape of gene co-expression modules correlating with prognostic genetic abnormalities in AML
title_fullStr The landscape of gene co-expression modules correlating with prognostic genetic abnormalities in AML
title_full_unstemmed The landscape of gene co-expression modules correlating with prognostic genetic abnormalities in AML
title_short The landscape of gene co-expression modules correlating with prognostic genetic abnormalities in AML
title_sort landscape of gene co-expression modules correlating with prognostic genetic abnormalities in aml
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8164775/
https://www.ncbi.nlm.nih.gov/pubmed/34051812
http://dx.doi.org/10.1186/s12967-021-02914-2
work_keys_str_mv AT guochao thelandscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml
AT gaoyayue thelandscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml
AT juqianqian thelandscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml
AT zhangchunxia thelandscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml
AT gongming thelandscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml
AT lizhenling thelandscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml
AT guochao landscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml
AT gaoyayue landscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml
AT juqianqian landscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml
AT zhangchunxia landscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml
AT gongming landscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml
AT lizhenling landscapeofgenecoexpressionmodulescorrelatingwithprognosticgeneticabnormalitiesinaml