Cargando…
A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients
BACKGROUND: Although the expression of RNA-binding protein (RBP) genes in hepatocellular carcinoma (HCC) varies and is associated with tumor progression, there has been no overview study with multiple cohorts and large samples. The HCC-associated RBP genes need to be more accurately identified, and...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8697767/ https://www.ncbi.nlm.nih.gov/pubmed/35036125 http://dx.doi.org/10.7717/peerj.12572 |
_version_ | 1784620116871217152 |
---|---|
author | Zhang, Qiangnu Zhang, Yusen Guo, Yusheng Tang, Honggui Li, Mingyue Liu, Liping |
author_facet | Zhang, Qiangnu Zhang, Yusen Guo, Yusheng Tang, Honggui Li, Mingyue Liu, Liping |
author_sort | Zhang, Qiangnu |
collection | PubMed |
description | BACKGROUND: Although the expression of RNA-binding protein (RBP) genes in hepatocellular carcinoma (HCC) varies and is associated with tumor progression, there has been no overview study with multiple cohorts and large samples. The HCC-associated RBP genes need to be more accurately identified, and their clinical application value needs to be further explored. METHODS: First, we used the robust rank aggregation (RRA) algorithm to extract HCC-associated RBP genes from nine HCC microarray datasets and verified them in The Cancer Genome Atlas Liver Hepatocellular Carcinoma (TCGA-LIHC) cohort and International Cancer Genome Consortium (ICGC) Japanese liver cancer (ICGC-LIRI-JP) cohort. In addition, the copy number variation (CNV), single-nucleotide variant (SNV), and promoter-region methylation data of HCC-associated RBP genes were analyzed. Using the random forest algorithm, we constructed an RBP gene–based prognostic score system (RBP-score). We then evaluated the ability of RBP-score to predict the prognosis of patients. The relationships between RBP-score and other clinical characteristics of patients were analyzed. RESULTS: The RRA algorithm identified 30 RBP mRNAs with consistent expression patterns across the nine HCC microarray datasets. These 30 RBP genes were defined as HCC-associated RBP genes. Their mRNA expression patterns were further verified in the TCGA-LIHC and ICGC-LIRI-JP cohorts. Among these 30 RBP genes, some showed significant copy number gain or loss, while others showed differences in the methylation levels of their promoter regions. Some RBP genes were risk factors or protective factors for the prognosis of patients. We extracted 10 key HCC-associated RBP genes using the random forest algorithm and constructed an RBP-score system. RBP-score effectively predicted the overall survival (OS) and disease-free survival (DFS) of HCC patients and was associated with the tumor, node, metastasis (TNM) stage, α-fetoprotein (AFP), and metastasis risk. The clinical value of RBP-score was validated in datasets from different platforms. Cox analysis suggested that a high RBP-score was an independent risk factor for poor prognosis in HCC patients. We also successfully established a combined RBP-score+TNM LASSO-Cox model that more accurately predicted the prognosis. CONCLUSION: The RBP-score system constructed based on HCC-associated RBP genes is a simple and highly effective prognostic evaluation tool. It is suitable for different subgroups of HCC patients and has cross-platform characteristics. Combining RBP-score with the TNM staging system or other clinical parameters can lead to an even greater clinical benefit. In addition, the identified HCC-associated RBP genes may serve as novel targets for HCC treatment. |
format | Online Article Text |
id | pubmed-8697767 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-86977672022-01-14 A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients Zhang, Qiangnu Zhang, Yusen Guo, Yusheng Tang, Honggui Li, Mingyue Liu, Liping PeerJ Bioinformatics BACKGROUND: Although the expression of RNA-binding protein (RBP) genes in hepatocellular carcinoma (HCC) varies and is associated with tumor progression, there has been no overview study with multiple cohorts and large samples. The HCC-associated RBP genes need to be more accurately identified, and their clinical application value needs to be further explored. METHODS: First, we used the robust rank aggregation (RRA) algorithm to extract HCC-associated RBP genes from nine HCC microarray datasets and verified them in The Cancer Genome Atlas Liver Hepatocellular Carcinoma (TCGA-LIHC) cohort and International Cancer Genome Consortium (ICGC) Japanese liver cancer (ICGC-LIRI-JP) cohort. In addition, the copy number variation (CNV), single-nucleotide variant (SNV), and promoter-region methylation data of HCC-associated RBP genes were analyzed. Using the random forest algorithm, we constructed an RBP gene–based prognostic score system (RBP-score). We then evaluated the ability of RBP-score to predict the prognosis of patients. The relationships between RBP-score and other clinical characteristics of patients were analyzed. RESULTS: The RRA algorithm identified 30 RBP mRNAs with consistent expression patterns across the nine HCC microarray datasets. These 30 RBP genes were defined as HCC-associated RBP genes. Their mRNA expression patterns were further verified in the TCGA-LIHC and ICGC-LIRI-JP cohorts. Among these 30 RBP genes, some showed significant copy number gain or loss, while others showed differences in the methylation levels of their promoter regions. Some RBP genes were risk factors or protective factors for the prognosis of patients. We extracted 10 key HCC-associated RBP genes using the random forest algorithm and constructed an RBP-score system. RBP-score effectively predicted the overall survival (OS) and disease-free survival (DFS) of HCC patients and was associated with the tumor, node, metastasis (TNM) stage, α-fetoprotein (AFP), and metastasis risk. The clinical value of RBP-score was validated in datasets from different platforms. Cox analysis suggested that a high RBP-score was an independent risk factor for poor prognosis in HCC patients. We also successfully established a combined RBP-score+TNM LASSO-Cox model that more accurately predicted the prognosis. CONCLUSION: The RBP-score system constructed based on HCC-associated RBP genes is a simple and highly effective prognostic evaluation tool. It is suitable for different subgroups of HCC patients and has cross-platform characteristics. Combining RBP-score with the TNM staging system or other clinical parameters can lead to an even greater clinical benefit. In addition, the identified HCC-associated RBP genes may serve as novel targets for HCC treatment. PeerJ Inc. 2021-12-20 /pmc/articles/PMC8697767/ /pubmed/35036125 http://dx.doi.org/10.7717/peerj.12572 Text en © 2021 Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Zhang, Qiangnu Zhang, Yusen Guo, Yusheng Tang, Honggui Li, Mingyue Liu, Liping A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients |
title | A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients |
title_full | A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients |
title_fullStr | A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients |
title_full_unstemmed | A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients |
title_short | A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients |
title_sort | novel machine learning derived rna-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8697767/ https://www.ncbi.nlm.nih.gov/pubmed/35036125 http://dx.doi.org/10.7717/peerj.12572 |
work_keys_str_mv | AT zhangqiangnu anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients AT zhangyusen anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients AT guoyusheng anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients AT tanghonggui anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients AT limingyue anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients AT liuliping anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients AT zhangqiangnu novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients AT zhangyusen novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients AT guoyusheng novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients AT tanghonggui novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients AT limingyue novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients AT liuliping novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients |