Cargando…

The Use of Machine Learning to Create a Risk Score to Predict Survival in Patients with Hepatocellular Carcinoma: A TCGA Cohort Analysis

INTRODUCTION: Hepatocellular carcinoma (HCC) accounts for approximately 90% of primary liver malignancies and is currently the fourth most common cause of cancer-related death worldwide. Due to varying underlying etiologies, the prognosis of HCC differs greatly among patients. It is important to dev...

Descripción completa

Detalles Bibliográficos
Autores principales: Tohme, Samer, Yazdani, Hamza O, Rahman, Amaan, Handu, Sanah, Khan, Sidrah, Wilson, Tanner, Geller, David A, Simmons, Richard L, Molinari, Michele, Kaltenmeier, Christof
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8651371/
https://www.ncbi.nlm.nih.gov/pubmed/34888264
http://dx.doi.org/10.1155/2021/5212953
_version_ 1784611381568339968
author Tohme, Samer
Yazdani, Hamza O
Rahman, Amaan
Handu, Sanah
Khan, Sidrah
Wilson, Tanner
Geller, David A
Simmons, Richard L
Molinari, Michele
Kaltenmeier, Christof
author_facet Tohme, Samer
Yazdani, Hamza O
Rahman, Amaan
Handu, Sanah
Khan, Sidrah
Wilson, Tanner
Geller, David A
Simmons, Richard L
Molinari, Michele
Kaltenmeier, Christof
author_sort Tohme, Samer
collection PubMed
description INTRODUCTION: Hepatocellular carcinoma (HCC) accounts for approximately 90% of primary liver malignancies and is currently the fourth most common cause of cancer-related death worldwide. Due to varying underlying etiologies, the prognosis of HCC differs greatly among patients. It is important to develop ways to help stratify patients upon initial diagnosis to provide optimal treatment modalities and follow-up plans. The current study uses Artificial Neural Network (ANN) and Classification Tree Analysis (CTA) to create a gene signature score that can help predict survival in patients with HCC. METHODS: The Cancer Genome Atlas (TCGA-LIHC) was analyzed for differentially expressed genes. Clinicopathological data were obtained from cBioPortal. ANN analysis of the 75 most significant genes predicting disease-free survival (DFS) was performed. Next, CTA results were used for creation of the scoring system. Cox regression was performed to identify the prognostic value of the scoring system. RESULTS: 363 patients diagnosed with HCC were analyzed in this study. ANN provided 15 genes with normalized importance >50%. CTA resulted in a set of three genes (NRM, STAG3, and SNHG20). Patients were then divided in to 4 groups based on the CTA tree cutoff values. The Kaplan–Meier analysis showed significantly reduced DFS in groups 1, 2, and 3 (median DFS: 29.7 months, 16.1 months, and 11.7 months, p < 0.01) compared to group 0 (median not reached). Similar results were observed when overall survival (OS) was analyzed. On multivariate Cox regression, higher scores were associated with significantly shorter DFS (1 point: HR 2.57 (1.38–4.80), 2 points: 3.91 (2.11–7.24), and 3 points: 5.09 (2.70–9.58), p < 0.01). CONCLUSION: Long-term outcomes of patients with HCC can be predicted using a simplified scoring system based on tumor mRNA gene expression levels. This tool could assist clinicians and researchers in identifying patients at increased risks for recurrence to tailor specific treatment and follow-up strategies for individual patients.
format Online
Article
Text
id pubmed-8651371
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-86513712021-12-08 The Use of Machine Learning to Create a Risk Score to Predict Survival in Patients with Hepatocellular Carcinoma: A TCGA Cohort Analysis Tohme, Samer Yazdani, Hamza O Rahman, Amaan Handu, Sanah Khan, Sidrah Wilson, Tanner Geller, David A Simmons, Richard L Molinari, Michele Kaltenmeier, Christof Can J Gastroenterol Hepatol Research Article INTRODUCTION: Hepatocellular carcinoma (HCC) accounts for approximately 90% of primary liver malignancies and is currently the fourth most common cause of cancer-related death worldwide. Due to varying underlying etiologies, the prognosis of HCC differs greatly among patients. It is important to develop ways to help stratify patients upon initial diagnosis to provide optimal treatment modalities and follow-up plans. The current study uses Artificial Neural Network (ANN) and Classification Tree Analysis (CTA) to create a gene signature score that can help predict survival in patients with HCC. METHODS: The Cancer Genome Atlas (TCGA-LIHC) was analyzed for differentially expressed genes. Clinicopathological data were obtained from cBioPortal. ANN analysis of the 75 most significant genes predicting disease-free survival (DFS) was performed. Next, CTA results were used for creation of the scoring system. Cox regression was performed to identify the prognostic value of the scoring system. RESULTS: 363 patients diagnosed with HCC were analyzed in this study. ANN provided 15 genes with normalized importance >50%. CTA resulted in a set of three genes (NRM, STAG3, and SNHG20). Patients were then divided in to 4 groups based on the CTA tree cutoff values. The Kaplan–Meier analysis showed significantly reduced DFS in groups 1, 2, and 3 (median DFS: 29.7 months, 16.1 months, and 11.7 months, p < 0.01) compared to group 0 (median not reached). Similar results were observed when overall survival (OS) was analyzed. On multivariate Cox regression, higher scores were associated with significantly shorter DFS (1 point: HR 2.57 (1.38–4.80), 2 points: 3.91 (2.11–7.24), and 3 points: 5.09 (2.70–9.58), p < 0.01). CONCLUSION: Long-term outcomes of patients with HCC can be predicted using a simplified scoring system based on tumor mRNA gene expression levels. This tool could assist clinicians and researchers in identifying patients at increased risks for recurrence to tailor specific treatment and follow-up strategies for individual patients. Hindawi 2021-11-30 /pmc/articles/PMC8651371/ /pubmed/34888264 http://dx.doi.org/10.1155/2021/5212953 Text en Copyright © 2021 Samer Tohme et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Tohme, Samer
Yazdani, Hamza O
Rahman, Amaan
Handu, Sanah
Khan, Sidrah
Wilson, Tanner
Geller, David A
Simmons, Richard L
Molinari, Michele
Kaltenmeier, Christof
The Use of Machine Learning to Create a Risk Score to Predict Survival in Patients with Hepatocellular Carcinoma: A TCGA Cohort Analysis
title The Use of Machine Learning to Create a Risk Score to Predict Survival in Patients with Hepatocellular Carcinoma: A TCGA Cohort Analysis
title_full The Use of Machine Learning to Create a Risk Score to Predict Survival in Patients with Hepatocellular Carcinoma: A TCGA Cohort Analysis
title_fullStr The Use of Machine Learning to Create a Risk Score to Predict Survival in Patients with Hepatocellular Carcinoma: A TCGA Cohort Analysis
title_full_unstemmed The Use of Machine Learning to Create a Risk Score to Predict Survival in Patients with Hepatocellular Carcinoma: A TCGA Cohort Analysis
title_short The Use of Machine Learning to Create a Risk Score to Predict Survival in Patients with Hepatocellular Carcinoma: A TCGA Cohort Analysis
title_sort use of machine learning to create a risk score to predict survival in patients with hepatocellular carcinoma: a tcga cohort analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8651371/
https://www.ncbi.nlm.nih.gov/pubmed/34888264
http://dx.doi.org/10.1155/2021/5212953
work_keys_str_mv AT tohmesamer theuseofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT yazdanihamzao theuseofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT rahmanamaan theuseofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT handusanah theuseofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT khansidrah theuseofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT wilsontanner theuseofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT gellerdavida theuseofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT simmonsrichardl theuseofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT molinarimichele theuseofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT kaltenmeierchristof theuseofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT tohmesamer useofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT yazdanihamzao useofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT rahmanamaan useofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT handusanah useofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT khansidrah useofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT wilsontanner useofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT gellerdavida useofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT simmonsrichardl useofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT molinarimichele useofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis
AT kaltenmeierchristof useofmachinelearningtocreateariskscoretopredictsurvivalinpatientswithhepatocellularcarcinomaatcgacohortanalysis