Cargando…
Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma
BACKGROUND: Hepatocellular carcinoma (HCC) one of the most common digestive system tumors, threatens the tens of thousands of people with high morbidity and mortality world widely. The purpose of our study was to investigate the related genes of HCC and discover their potential abilities to predict...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302385/ https://www.ncbi.nlm.nih.gov/pubmed/32565735 http://dx.doi.org/10.1186/s12935-020-01274-z |
_version_ | 1783547835489189888 |
---|---|
author | Guo, Lingyun Wang, Zhenjiang Du, Yuanyuan Mao, Jie Zhang, Junqiang Yu, Zeyuan Guo, Jiwu Zhao, Jun Zhou, Huinian Wang, Haitao Gu, Yanmei Li, Yumin |
author_facet | Guo, Lingyun Wang, Zhenjiang Du, Yuanyuan Mao, Jie Zhang, Junqiang Yu, Zeyuan Guo, Jiwu Zhao, Jun Zhou, Huinian Wang, Haitao Gu, Yanmei Li, Yumin |
author_sort | Guo, Lingyun |
collection | PubMed |
description | BACKGROUND: Hepatocellular carcinoma (HCC) one of the most common digestive system tumors, threatens the tens of thousands of people with high morbidity and mortality world widely. The purpose of our study was to investigate the related genes of HCC and discover their potential abilities to predict the prognosis of the patients. METHODS: We obtained RNA sequencing data of HCC from The Cancer Genome Atlas (TCGA) database and performed analysis on protein coding genes. Differentially expressed genes (DEGs) were selected. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were conducted to discover biological functions of DEGs. Protein and protein interaction (PPI) was performed to investigate hub genes. In addition, a method of supervised machine learning, recursive feature elimination (RFE) based on random forest (RF) classifier, was used to screen for significant biomarkers. And the basic experiment was conducted by lab, we constructe a clinical patients’ database, and obtained the data and results of immunohistochemistry. RESULTS: We identified five biomarkers with significantly high expression to predict survival risk of the HCC patients. These prognostic biomarkers included SPC25, NUF2, MCM2, BLM and AURKA. We also defined a risk score model with these biomarkers to identify the patients who is in high risk. In our single-center experiment, 95 pairs of clinical samples were used to explore the expression levels of NUF2 and BLM in HCC. Immunohistochemical staining results showed that NUF2 and BLM were significantly up-regulated in immunohistochemical staining. High expression levels of NUF2 and BLM indicated poor prognosis. CONCLUSION: Our investigation provided novel prognostic biomarkers and model in HCC and aimed to improve the understanding of HCC. In the results obtained, we also conducted a part of experiments to verify the theory described earlier, The experimental results did verify our theory. |
format | Online Article Text |
id | pubmed-7302385 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-73023852020-06-19 Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma Guo, Lingyun Wang, Zhenjiang Du, Yuanyuan Mao, Jie Zhang, Junqiang Yu, Zeyuan Guo, Jiwu Zhao, Jun Zhou, Huinian Wang, Haitao Gu, Yanmei Li, Yumin Cancer Cell Int Primary Research BACKGROUND: Hepatocellular carcinoma (HCC) one of the most common digestive system tumors, threatens the tens of thousands of people with high morbidity and mortality world widely. The purpose of our study was to investigate the related genes of HCC and discover their potential abilities to predict the prognosis of the patients. METHODS: We obtained RNA sequencing data of HCC from The Cancer Genome Atlas (TCGA) database and performed analysis on protein coding genes. Differentially expressed genes (DEGs) were selected. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were conducted to discover biological functions of DEGs. Protein and protein interaction (PPI) was performed to investigate hub genes. In addition, a method of supervised machine learning, recursive feature elimination (RFE) based on random forest (RF) classifier, was used to screen for significant biomarkers. And the basic experiment was conducted by lab, we constructe a clinical patients’ database, and obtained the data and results of immunohistochemistry. RESULTS: We identified five biomarkers with significantly high expression to predict survival risk of the HCC patients. These prognostic biomarkers included SPC25, NUF2, MCM2, BLM and AURKA. We also defined a risk score model with these biomarkers to identify the patients who is in high risk. In our single-center experiment, 95 pairs of clinical samples were used to explore the expression levels of NUF2 and BLM in HCC. Immunohistochemical staining results showed that NUF2 and BLM were significantly up-regulated in immunohistochemical staining. High expression levels of NUF2 and BLM indicated poor prognosis. CONCLUSION: Our investigation provided novel prognostic biomarkers and model in HCC and aimed to improve the understanding of HCC. In the results obtained, we also conducted a part of experiments to verify the theory described earlier, The experimental results did verify our theory. BioMed Central 2020-06-17 /pmc/articles/PMC7302385/ /pubmed/32565735 http://dx.doi.org/10.1186/s12935-020-01274-z Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Primary Research Guo, Lingyun Wang, Zhenjiang Du, Yuanyuan Mao, Jie Zhang, Junqiang Yu, Zeyuan Guo, Jiwu Zhao, Jun Zhou, Huinian Wang, Haitao Gu, Yanmei Li, Yumin Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma |
title | Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma |
title_full | Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma |
title_fullStr | Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma |
title_full_unstemmed | Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma |
title_short | Random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma |
title_sort | random-forest algorithm based biomarkers in predicting prognosis in the patients with hepatocellular carcinoma |
topic | Primary Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302385/ https://www.ncbi.nlm.nih.gov/pubmed/32565735 http://dx.doi.org/10.1186/s12935-020-01274-z |
work_keys_str_mv | AT guolingyun randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma AT wangzhenjiang randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma AT duyuanyuan randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma AT maojie randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma AT zhangjunqiang randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma AT yuzeyuan randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma AT guojiwu randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma AT zhaojun randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma AT zhouhuinian randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma AT wanghaitao randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma AT guyanmei randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma AT liyumin randomforestalgorithmbasedbiomarkersinpredictingprognosisinthepatientswithhepatocellularcarcinoma |