Cargando…

Searching for a signature involving 10 genes to predict the survival of patients with acute myelocytic leukemia through a combined multi-omics analysis

BACKGROUND: Currently, acute myelocytic leukemia (AML) still has a poor prognosis. As a result, gene markers for predicting AML prognosis must be identified through systemic analysis of multi-omics data. METHODS: First of all, the copy number variation (CNV), mutation, RNA-Seq, and single nucleotide...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhuang, Haifeng, Chen, Yu, Sheng, Xianfu, Hong, Lili, Gao, Ruilan, Zhuang, Xiaofen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7321666/
https://www.ncbi.nlm.nih.gov/pubmed/32617195
http://dx.doi.org/10.7717/peerj.9437
_version_ 1783551520864731136
author Zhuang, Haifeng
Chen, Yu
Sheng, Xianfu
Hong, Lili
Gao, Ruilan
Zhuang, Xiaofen
author_facet Zhuang, Haifeng
Chen, Yu
Sheng, Xianfu
Hong, Lili
Gao, Ruilan
Zhuang, Xiaofen
author_sort Zhuang, Haifeng
collection PubMed
description BACKGROUND: Currently, acute myelocytic leukemia (AML) still has a poor prognosis. As a result, gene markers for predicting AML prognosis must be identified through systemic analysis of multi-omics data. METHODS: First of all, the copy number variation (CNV), mutation, RNA-Seq, and single nucleotide polymorphism (SNP) data, as well as those clinical follow-up data, were obtained based on The Cancer Genome Atlas (TCGA) database. Thereafter, all samples (n = 229) were randomized as test set and training set, respectively. Of them, the training set was used to screen for genes related to prognosis, and genes with mutation, SNP or CNV. Then, shrinkage estimate was used for feature selection of all the as-screened genes, to select those stable biomarkers. Eventually, a prognosis model related to those genes was established, and validated within the GEO verification (n = 124 and 72) and test set (n = 127). Moreover, it was compared with the AML prognosis prediction model reported in literature. RESULTS: Altogether 832 genes related to prognosis, 23 related to copy amplification, 774 associated with copy deletion, and 189 with significant genomic variations were acquired in this study. Later, genes with genomic variations and those related to prognosis were integrated to obtain 38 candidate genes; eventually, a shrinkage estimate was adopted to obtain 10 feature genes (including FAT2, CAMK2A, TCERG1, GDF9, PTGIS, DOC2B, DNTTIP1, PREX1, CRISPLD1 and C22orf42). Further, a signature was established using these 10 genes based on Cox regression analysis, and it served as an independent factor to predict AML prognosis. More importantly, it was able to stratify those external verification, test and training set samples with regard to the risk (P < 0.01). Compared with the prognosis prediction model reported in literature, the model established in this study was advantageous in terms of the prediction performance. CONCLUSION: The signature based on 10 genes had been established in this study, which is promising to be used to be a new marker for predicting AML prognosis.
format Online
Article
Text
id pubmed-7321666
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-73216662020-07-01 Searching for a signature involving 10 genes to predict the survival of patients with acute myelocytic leukemia through a combined multi-omics analysis Zhuang, Haifeng Chen, Yu Sheng, Xianfu Hong, Lili Gao, Ruilan Zhuang, Xiaofen PeerJ Bioinformatics BACKGROUND: Currently, acute myelocytic leukemia (AML) still has a poor prognosis. As a result, gene markers for predicting AML prognosis must be identified through systemic analysis of multi-omics data. METHODS: First of all, the copy number variation (CNV), mutation, RNA-Seq, and single nucleotide polymorphism (SNP) data, as well as those clinical follow-up data, were obtained based on The Cancer Genome Atlas (TCGA) database. Thereafter, all samples (n = 229) were randomized as test set and training set, respectively. Of them, the training set was used to screen for genes related to prognosis, and genes with mutation, SNP or CNV. Then, shrinkage estimate was used for feature selection of all the as-screened genes, to select those stable biomarkers. Eventually, a prognosis model related to those genes was established, and validated within the GEO verification (n = 124 and 72) and test set (n = 127). Moreover, it was compared with the AML prognosis prediction model reported in literature. RESULTS: Altogether 832 genes related to prognosis, 23 related to copy amplification, 774 associated with copy deletion, and 189 with significant genomic variations were acquired in this study. Later, genes with genomic variations and those related to prognosis were integrated to obtain 38 candidate genes; eventually, a shrinkage estimate was adopted to obtain 10 feature genes (including FAT2, CAMK2A, TCERG1, GDF9, PTGIS, DOC2B, DNTTIP1, PREX1, CRISPLD1 and C22orf42). Further, a signature was established using these 10 genes based on Cox regression analysis, and it served as an independent factor to predict AML prognosis. More importantly, it was able to stratify those external verification, test and training set samples with regard to the risk (P < 0.01). Compared with the prognosis prediction model reported in literature, the model established in this study was advantageous in terms of the prediction performance. CONCLUSION: The signature based on 10 genes had been established in this study, which is promising to be used to be a new marker for predicting AML prognosis. PeerJ Inc. 2020-06-25 /pmc/articles/PMC7321666/ /pubmed/32617195 http://dx.doi.org/10.7717/peerj.9437 Text en ©2020 Zhuang et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Zhuang, Haifeng
Chen, Yu
Sheng, Xianfu
Hong, Lili
Gao, Ruilan
Zhuang, Xiaofen
Searching for a signature involving 10 genes to predict the survival of patients with acute myelocytic leukemia through a combined multi-omics analysis
title Searching for a signature involving 10 genes to predict the survival of patients with acute myelocytic leukemia through a combined multi-omics analysis
title_full Searching for a signature involving 10 genes to predict the survival of patients with acute myelocytic leukemia through a combined multi-omics analysis
title_fullStr Searching for a signature involving 10 genes to predict the survival of patients with acute myelocytic leukemia through a combined multi-omics analysis
title_full_unstemmed Searching for a signature involving 10 genes to predict the survival of patients with acute myelocytic leukemia through a combined multi-omics analysis
title_short Searching for a signature involving 10 genes to predict the survival of patients with acute myelocytic leukemia through a combined multi-omics analysis
title_sort searching for a signature involving 10 genes to predict the survival of patients with acute myelocytic leukemia through a combined multi-omics analysis
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7321666/
https://www.ncbi.nlm.nih.gov/pubmed/32617195
http://dx.doi.org/10.7717/peerj.9437
work_keys_str_mv AT zhuanghaifeng searchingforasignatureinvolving10genestopredictthesurvivalofpatientswithacutemyelocyticleukemiathroughacombinedmultiomicsanalysis
AT chenyu searchingforasignatureinvolving10genestopredictthesurvivalofpatientswithacutemyelocyticleukemiathroughacombinedmultiomicsanalysis
AT shengxianfu searchingforasignatureinvolving10genestopredictthesurvivalofpatientswithacutemyelocyticleukemiathroughacombinedmultiomicsanalysis
AT honglili searchingforasignatureinvolving10genestopredictthesurvivalofpatientswithacutemyelocyticleukemiathroughacombinedmultiomicsanalysis
AT gaoruilan searchingforasignatureinvolving10genestopredictthesurvivalofpatientswithacutemyelocyticleukemiathroughacombinedmultiomicsanalysis
AT zhuangxiaofen searchingforasignatureinvolving10genestopredictthesurvivalofpatientswithacutemyelocyticleukemiathroughacombinedmultiomicsanalysis