Cargando…

A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients

BACKGROUND: Although the expression of RNA-binding protein (RBP) genes in hepatocellular carcinoma (HCC) varies and is associated with tumor progression, there has been no overview study with multiple cohorts and large samples. The HCC-associated RBP genes need to be more accurately identified, and...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Qiangnu, Zhang, Yusen, Guo, Yusheng, Tang, Honggui, Li, Mingyue, Liu, Liping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8697767/
https://www.ncbi.nlm.nih.gov/pubmed/35036125
http://dx.doi.org/10.7717/peerj.12572
_version_ 1784620116871217152
author Zhang, Qiangnu
Zhang, Yusen
Guo, Yusheng
Tang, Honggui
Li, Mingyue
Liu, Liping
author_facet Zhang, Qiangnu
Zhang, Yusen
Guo, Yusheng
Tang, Honggui
Li, Mingyue
Liu, Liping
author_sort Zhang, Qiangnu
collection PubMed
description BACKGROUND: Although the expression of RNA-binding protein (RBP) genes in hepatocellular carcinoma (HCC) varies and is associated with tumor progression, there has been no overview study with multiple cohorts and large samples. The HCC-associated RBP genes need to be more accurately identified, and their clinical application value needs to be further explored. METHODS: First, we used the robust rank aggregation (RRA) algorithm to extract HCC-associated RBP genes from nine HCC microarray datasets and verified them in The Cancer Genome Atlas Liver Hepatocellular Carcinoma (TCGA-LIHC) cohort and International Cancer Genome Consortium (ICGC) Japanese liver cancer (ICGC-LIRI-JP) cohort. In addition, the copy number variation (CNV), single-nucleotide variant (SNV), and promoter-region methylation data of HCC-associated RBP genes were analyzed. Using the random forest algorithm, we constructed an RBP gene–based prognostic score system (RBP-score). We then evaluated the ability of RBP-score to predict the prognosis of patients. The relationships between RBP-score and other clinical characteristics of patients were analyzed. RESULTS: The RRA algorithm identified 30 RBP mRNAs with consistent expression patterns across the nine HCC microarray datasets. These 30 RBP genes were defined as HCC-associated RBP genes. Their mRNA expression patterns were further verified in the TCGA-LIHC and ICGC-LIRI-JP cohorts. Among these 30 RBP genes, some showed significant copy number gain or loss, while others showed differences in the methylation levels of their promoter regions. Some RBP genes were risk factors or protective factors for the prognosis of patients. We extracted 10 key HCC-associated RBP genes using the random forest algorithm and constructed an RBP-score system. RBP-score effectively predicted the overall survival (OS) and disease-free survival (DFS) of HCC patients and was associated with the tumor, node, metastasis (TNM) stage, α-fetoprotein (AFP), and metastasis risk. The clinical value of RBP-score was validated in datasets from different platforms. Cox analysis suggested that a high RBP-score was an independent risk factor for poor prognosis in HCC patients. We also successfully established a combined RBP-score+TNM LASSO-Cox model that more accurately predicted the prognosis. CONCLUSION: The RBP-score system constructed based on HCC-associated RBP genes is a simple and highly effective prognostic evaluation tool. It is suitable for different subgroups of HCC patients and has cross-platform characteristics. Combining RBP-score with the TNM staging system or other clinical parameters can lead to an even greater clinical benefit. In addition, the identified HCC-associated RBP genes may serve as novel targets for HCC treatment.
format Online
Article
Text
id pubmed-8697767
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-86977672022-01-14 A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients Zhang, Qiangnu Zhang, Yusen Guo, Yusheng Tang, Honggui Li, Mingyue Liu, Liping PeerJ Bioinformatics BACKGROUND: Although the expression of RNA-binding protein (RBP) genes in hepatocellular carcinoma (HCC) varies and is associated with tumor progression, there has been no overview study with multiple cohorts and large samples. The HCC-associated RBP genes need to be more accurately identified, and their clinical application value needs to be further explored. METHODS: First, we used the robust rank aggregation (RRA) algorithm to extract HCC-associated RBP genes from nine HCC microarray datasets and verified them in The Cancer Genome Atlas Liver Hepatocellular Carcinoma (TCGA-LIHC) cohort and International Cancer Genome Consortium (ICGC) Japanese liver cancer (ICGC-LIRI-JP) cohort. In addition, the copy number variation (CNV), single-nucleotide variant (SNV), and promoter-region methylation data of HCC-associated RBP genes were analyzed. Using the random forest algorithm, we constructed an RBP gene–based prognostic score system (RBP-score). We then evaluated the ability of RBP-score to predict the prognosis of patients. The relationships between RBP-score and other clinical characteristics of patients were analyzed. RESULTS: The RRA algorithm identified 30 RBP mRNAs with consistent expression patterns across the nine HCC microarray datasets. These 30 RBP genes were defined as HCC-associated RBP genes. Their mRNA expression patterns were further verified in the TCGA-LIHC and ICGC-LIRI-JP cohorts. Among these 30 RBP genes, some showed significant copy number gain or loss, while others showed differences in the methylation levels of their promoter regions. Some RBP genes were risk factors or protective factors for the prognosis of patients. We extracted 10 key HCC-associated RBP genes using the random forest algorithm and constructed an RBP-score system. RBP-score effectively predicted the overall survival (OS) and disease-free survival (DFS) of HCC patients and was associated with the tumor, node, metastasis (TNM) stage, α-fetoprotein (AFP), and metastasis risk. The clinical value of RBP-score was validated in datasets from different platforms. Cox analysis suggested that a high RBP-score was an independent risk factor for poor prognosis in HCC patients. We also successfully established a combined RBP-score+TNM LASSO-Cox model that more accurately predicted the prognosis. CONCLUSION: The RBP-score system constructed based on HCC-associated RBP genes is a simple and highly effective prognostic evaluation tool. It is suitable for different subgroups of HCC patients and has cross-platform characteristics. Combining RBP-score with the TNM staging system or other clinical parameters can lead to an even greater clinical benefit. In addition, the identified HCC-associated RBP genes may serve as novel targets for HCC treatment. PeerJ Inc. 2021-12-20 /pmc/articles/PMC8697767/ /pubmed/35036125 http://dx.doi.org/10.7717/peerj.12572 Text en © 2021 Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Zhang, Qiangnu
Zhang, Yusen
Guo, Yusheng
Tang, Honggui
Li, Mingyue
Liu, Liping
A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients
title A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients
title_full A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients
title_fullStr A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients
title_full_unstemmed A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients
title_short A novel machine learning derived RNA-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients
title_sort novel machine learning derived rna-binding protein gene–based score system predicts prognosis of hepatocellular carcinoma patients
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8697767/
https://www.ncbi.nlm.nih.gov/pubmed/35036125
http://dx.doi.org/10.7717/peerj.12572
work_keys_str_mv AT zhangqiangnu anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients
AT zhangyusen anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients
AT guoyusheng anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients
AT tanghonggui anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients
AT limingyue anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients
AT liuliping anovelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients
AT zhangqiangnu novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients
AT zhangyusen novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients
AT guoyusheng novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients
AT tanghonggui novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients
AT limingyue novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients
AT liuliping novelmachinelearningderivedrnabindingproteingenebasedscoresystempredictsprognosisofhepatocellularcarcinomapatients