Cargando…

Identification of a novel gene signature for the prediction of recurrence in HCC patients by machine learning of genome-wide databases

Hepatocellular carcinoma (HCC) is a common malignant tumor in China. In the present study, we aimed to construct and verify a prediction model of recurrence in HCC patients using databases (TCGA, AMC and Inserm) and machine learning methods and obtain the gene signature that could predict early rela...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Jie, Qi, Liang, Zou, Zhengyun, Du, Juan, Kong, Weiwei, Zhao, Lianjun, Wei, Jia, Lin, Ling, Ren, Min, Liu, Baorui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7064516/
https://www.ncbi.nlm.nih.gov/pubmed/32157118
http://dx.doi.org/10.1038/s41598-020-61298-3
_version_ 1783504885873901568
author Shen, Jie
Qi, Liang
Zou, Zhengyun
Du, Juan
Kong, Weiwei
Zhao, Lianjun
Wei, Jia
Lin, Ling
Ren, Min
Liu, Baorui
author_facet Shen, Jie
Qi, Liang
Zou, Zhengyun
Du, Juan
Kong, Weiwei
Zhao, Lianjun
Wei, Jia
Lin, Ling
Ren, Min
Liu, Baorui
author_sort Shen, Jie
collection PubMed
description Hepatocellular carcinoma (HCC) is a common malignant tumor in China. In the present study, we aimed to construct and verify a prediction model of recurrence in HCC patients using databases (TCGA, AMC and Inserm) and machine learning methods and obtain the gene signature that could predict early relapse of HCC. Statistical methods, such as feature selection, survival analysis and Chi-Square test in R software, were used to analyze and select mutant genes related to disease free survival (DFS), race and vascular invasion. In addition, whole-exome sequencing was performed on 10 HCC patients recruited from our center, and the sequencing results were compared with the databases. Using the databases and machine learning methods, the prediction model of recurrence was constructed and optimized, and the selected mutant genes were verified in the test group. The accuracy of prediction was 74.19%. Moreover, these 10 patients from our center were used to verify these mutant genes and the prediction model, and a success rate of 80% was achieved. Collectively, we discovered recurrence-related genes and established recurrence prediction model of recurrence for HCC patients, which could provide significant guidance for clinical prediction of recurrence.
format Online
Article
Text
id pubmed-7064516
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-70645162020-03-18 Identification of a novel gene signature for the prediction of recurrence in HCC patients by machine learning of genome-wide databases Shen, Jie Qi, Liang Zou, Zhengyun Du, Juan Kong, Weiwei Zhao, Lianjun Wei, Jia Lin, Ling Ren, Min Liu, Baorui Sci Rep Article Hepatocellular carcinoma (HCC) is a common malignant tumor in China. In the present study, we aimed to construct and verify a prediction model of recurrence in HCC patients using databases (TCGA, AMC and Inserm) and machine learning methods and obtain the gene signature that could predict early relapse of HCC. Statistical methods, such as feature selection, survival analysis and Chi-Square test in R software, were used to analyze and select mutant genes related to disease free survival (DFS), race and vascular invasion. In addition, whole-exome sequencing was performed on 10 HCC patients recruited from our center, and the sequencing results were compared with the databases. Using the databases and machine learning methods, the prediction model of recurrence was constructed and optimized, and the selected mutant genes were verified in the test group. The accuracy of prediction was 74.19%. Moreover, these 10 patients from our center were used to verify these mutant genes and the prediction model, and a success rate of 80% was achieved. Collectively, we discovered recurrence-related genes and established recurrence prediction model of recurrence for HCC patients, which could provide significant guidance for clinical prediction of recurrence. Nature Publishing Group UK 2020-03-10 /pmc/articles/PMC7064516/ /pubmed/32157118 http://dx.doi.org/10.1038/s41598-020-61298-3 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Shen, Jie
Qi, Liang
Zou, Zhengyun
Du, Juan
Kong, Weiwei
Zhao, Lianjun
Wei, Jia
Lin, Ling
Ren, Min
Liu, Baorui
Identification of a novel gene signature for the prediction of recurrence in HCC patients by machine learning of genome-wide databases
title Identification of a novel gene signature for the prediction of recurrence in HCC patients by machine learning of genome-wide databases
title_full Identification of a novel gene signature for the prediction of recurrence in HCC patients by machine learning of genome-wide databases
title_fullStr Identification of a novel gene signature for the prediction of recurrence in HCC patients by machine learning of genome-wide databases
title_full_unstemmed Identification of a novel gene signature for the prediction of recurrence in HCC patients by machine learning of genome-wide databases
title_short Identification of a novel gene signature for the prediction of recurrence in HCC patients by machine learning of genome-wide databases
title_sort identification of a novel gene signature for the prediction of recurrence in hcc patients by machine learning of genome-wide databases
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7064516/
https://www.ncbi.nlm.nih.gov/pubmed/32157118
http://dx.doi.org/10.1038/s41598-020-61298-3
work_keys_str_mv AT shenjie identificationofanovelgenesignatureforthepredictionofrecurrenceinhccpatientsbymachinelearningofgenomewidedatabases
AT qiliang identificationofanovelgenesignatureforthepredictionofrecurrenceinhccpatientsbymachinelearningofgenomewidedatabases
AT zouzhengyun identificationofanovelgenesignatureforthepredictionofrecurrenceinhccpatientsbymachinelearningofgenomewidedatabases
AT dujuan identificationofanovelgenesignatureforthepredictionofrecurrenceinhccpatientsbymachinelearningofgenomewidedatabases
AT kongweiwei identificationofanovelgenesignatureforthepredictionofrecurrenceinhccpatientsbymachinelearningofgenomewidedatabases
AT zhaolianjun identificationofanovelgenesignatureforthepredictionofrecurrenceinhccpatientsbymachinelearningofgenomewidedatabases
AT weijia identificationofanovelgenesignatureforthepredictionofrecurrenceinhccpatientsbymachinelearningofgenomewidedatabases
AT linling identificationofanovelgenesignatureforthepredictionofrecurrenceinhccpatientsbymachinelearningofgenomewidedatabases
AT renmin identificationofanovelgenesignatureforthepredictionofrecurrenceinhccpatientsbymachinelearningofgenomewidedatabases
AT liubaorui identificationofanovelgenesignatureforthepredictionofrecurrenceinhccpatientsbymachinelearningofgenomewidedatabases