Cargando…

Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma

BACKGROUND: Hepatocellular carcinoma (HCC) one of the most common digestive system tumors, threatens the tens of thousands of people with high morbidity and mortality world widely. The purpose of our study was to investigate the related genes of HCC and discover their potential abilities to predict...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Lingyun, Wang, Zhenjiang, Du, Yuanyuan, Mao, Jie, Zhang, Junqiang, Yu, Zeyuan, Guo, Jiwu, Zhao, Jun, Zhou, Huinian, Wang, Haitao, Gu, Yanmei, Li, Yumin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302385/
https://www.ncbi.nlm.nih.gov/pubmed/32565735
http://dx.doi.org/10.1186/s12935-020-01274-z
_version_ 1783547835489189888
author Guo, Lingyun
Wang, Zhenjiang
Du, Yuanyuan
Mao, Jie
Zhang, Junqiang
Yu, Zeyuan
Guo, Jiwu
Zhao, Jun
Zhou, Huinian
Wang, Haitao
Gu, Yanmei
Li, Yumin
author_facet Guo, Lingyun
Wang, Zhenjiang
Du, Yuanyuan
Mao, Jie
Zhang, Junqiang
Yu, Zeyuan
Guo, Jiwu
Zhao, Jun
Zhou, Huinian
Wang, Haitao
Gu, Yanmei
Li, Yumin
author_sort Guo, Lingyun
collection PubMed
description BACKGROUND: Hepatocellular carcinoma (HCC) one of the most common digestive system tumors, threatens the tens of thousands of people with high morbidity and mortality world widely. The purpose of our study was to investigate the related genes of HCC and discover their potential abilities to predict the prognosis of the patients. METHODS: We obtained RNA sequencing data of HCC from The Cancer Genome Atlas (TCGA) database and performed analysis on protein coding genes. Differentially expressed genes (DEGs) were selected. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were conducted to discover biological functions of DEGs. Protein and protein interaction (PPI) was performed to investigate hub genes. In addition, a method of supervised machine learning, recursive feature elimination (RFE) based on random forest (RF) classifier, was used to screen for significant biomarkers. And the basic experiment was conducted by lab, we constructe a clinical patients’ database, and obtained the data and results of immunohistochemistry. RESULTS: We identified five biomarkers with significantly high expression to predict survival risk of the HCC patients. These prognostic biomarkers included SPC25, NUF2, MCM2, BLM and AURKA. We also defined a risk score model with these biomarkers to identify the patients who is in high risk. In our single-center experiment, 95 pairs of clinical samples were used to explore the expression levels of NUF2 and BLM in HCC. Immunohistochemical staining results showed that NUF2 and BLM were significantly up-regulated in immunohistochemical staining. High expression levels of NUF2 and BLM indicated poor prognosis. CONCLUSION: Our investigation provided novel prognostic biomarkers and model in HCC and aimed to improve the understanding of HCC. In the results obtained, we also conducted a part of experiments to verify the theory described earlier, The experimental results did verify our theory.
format Online
Article
Text
id pubmed-7302385
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73023852020-06-19 Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma Guo, Lingyun Wang, Zhenjiang Du, Yuanyuan Mao, Jie Zhang, Junqiang Yu, Zeyuan Guo, Jiwu Zhao, Jun Zhou, Huinian Wang, Haitao Gu, Yanmei Li, Yumin Cancer Cell Int Primary Research BACKGROUND: Hepatocellular carcinoma (HCC) one of the most common digestive system tumors, threatens the tens of thousands of people with high morbidity and mortality world widely. The purpose of our study was to investigate the related genes of HCC and discover their potential abilities to predict the prognosis of the patients. METHODS: We obtained RNA sequencing data of HCC from The Cancer Genome Atlas (TCGA) database and performed analysis on protein coding genes. Differentially expressed genes (DEGs) were selected. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were conducted to discover biological functions of DEGs. Protein and protein interaction (PPI) was performed to investigate hub genes. In addition, a method of supervised machine learning, recursive feature elimination (RFE) based on random forest (RF) classifier, was used to screen for significant biomarkers. And the basic experiment was conducted by lab, we constructe a clinical patients’ database, and obtained the data and results of immunohistochemistry. RESULTS: We identified five biomarkers with significantly high expression to predict survival risk of the HCC patients. These prognostic biomarkers included SPC25, NUF2, MCM2, BLM and AURKA. We also defined a risk score model with these biomarkers to identify the patients who is in high risk. In our single-center experiment, 95 pairs of clinical samples were used to explore the expression levels of NUF2 and BLM in HCC. Immunohistochemical staining results showed that NUF2 and BLM were significantly up-regulated in immunohistochemical staining. High expression levels of NUF2 and BLM indicated poor prognosis. CONCLUSION: Our investigation provided novel prognostic biomarkers and model in HCC and aimed to improve the understanding of HCC. In the results obtained, we also conducted a part of experiments to verify the theory described earlier, The experimental results did verify our theory. BioMed Central 2020-06-17 /pmc/articles/PMC7302385/ /pubmed/32565735 http://dx.doi.org/10.1186/s12935-020-01274-z Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Primary Research
Guo, Lingyun
Wang, Zhenjiang
Du, Yuanyuan
Mao, Jie
Zhang, Junqiang
Yu, Zeyuan
Guo, Jiwu
Zhao, Jun
Zhou, Huinian
Wang, Haitao
Gu, Yanmei
Li, Yumin
Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma
title Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma
title_full Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma
title_fullStr Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma
title_full_unstemmed Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma
title_short Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma
title_sort random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma
topic Primary Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302385/
https://www.ncbi.nlm.nih.gov/pubmed/32565735
http://dx.doi.org/10.1186/s12935-020-01274-z
work_keys_str_mv AT guolingyun randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma
AT wangzhenjiang randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma
AT duyuanyuan randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma
AT maojie randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma
AT zhangjunqiang randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma
AT yuzeyuan randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma
AT guojiwu randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma
AT zhaojun randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma
AT zhouhuinian randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma
AT wanghaitao randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma
AT guyanmei randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma
AT liyumin randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma