Cargando…

Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest

Traditional clinical features are not sufficient to accurately judge the prognosis of endometrioid endometrial adenocarcinoma (EEA). Molecular biological characteristics and traditional clinical features are particularly important in the prognosis of EEA. The aim of the present study was to establis...

Descripción completa

Detalles Bibliográficos
Autores principales: Yin, Fufen, Shao, Xingyang, Zhao, Lijun, Li, Xiaoping, Zhou, Jingyi, Cheng, Yuan, He, Xiangjun, Lei, Shu, Li, Jiangeng, Wang, Jianliu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: D.A. Spandidos 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6607378/
https://www.ncbi.nlm.nih.gov/pubmed/31423227
http://dx.doi.org/10.3892/ol.2019.10504
_version_ 1783432085542797312
author Yin, Fufen
Shao, Xingyang
Zhao, Lijun
Li, Xiaoping
Zhou, Jingyi
Cheng, Yuan
He, Xiangjun
Lei, Shu
Li, Jiangeng
Wang, Jianliu
author_facet Yin, Fufen
Shao, Xingyang
Zhao, Lijun
Li, Xiaoping
Zhou, Jingyi
Cheng, Yuan
He, Xiangjun
Lei, Shu
Li, Jiangeng
Wang, Jianliu
author_sort Yin, Fufen
collection PubMed
description Traditional clinical features are not sufficient to accurately judge the prognosis of endometrioid endometrial adenocarcinoma (EEA). Molecular biological characteristics and traditional clinical features are particularly important in the prognosis of EEA. The aim of the present study was to establish a predictive model that considers genes and clinical features for the prognosis of EEA. The clinical and RNA sequencing expression data of EEA were derived from samples from The Cancer Genome Atlas (TCGA) and Peking University People's Hospital (PKUPH; Beijing, China). Samples from TCGA were used as the training set, and samples from the PKUPH were used as the testing set. Variable selection using Random Forests (VSURF) was used to select the genes and clinical features on the basis of TCGA samples. The RF classification method was used to establish the prediction model. Kaplan-Meier curves were tested with the log-rank test. The results from this study demonstrated that on the basis of TCGA samples, 11 genes and the grade were selected as the input features. In the training set, the out-of-bag (OOB) error of RF model-1, which was established using the ‘11 genes’, was 0.15; the OOB error of RF model-2, which was established using the ‘grade’, was 0.39; and the OOB error of RF model-3, established using the ‘11 genes and grade’, was 0.15. In the testing set, the classification accuracy of RF model-1, model-2 and model-3 was 71.43, 66.67 and 80.95%, respectively. In conclusion, to the best of our knowledge, the VSURF was used to select features relevant to EEA prognosis, and an EEA predictive model combining genes and traditional features was established for the first time in the present study. The prediction accuracy of the RF model on the basis of the 11 genes and grade was markedly higher than that of the RF models established by either the 11 genes or grade alone.
format Online
Article
Text
id pubmed-6607378
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher D.A. Spandidos
record_format MEDLINE/PubMed
spelling pubmed-66073782019-08-18 Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest Yin, Fufen Shao, Xingyang Zhao, Lijun Li, Xiaoping Zhou, Jingyi Cheng, Yuan He, Xiangjun Lei, Shu Li, Jiangeng Wang, Jianliu Oncol Lett Articles Traditional clinical features are not sufficient to accurately judge the prognosis of endometrioid endometrial adenocarcinoma (EEA). Molecular biological characteristics and traditional clinical features are particularly important in the prognosis of EEA. The aim of the present study was to establish a predictive model that considers genes and clinical features for the prognosis of EEA. The clinical and RNA sequencing expression data of EEA were derived from samples from The Cancer Genome Atlas (TCGA) and Peking University People's Hospital (PKUPH; Beijing, China). Samples from TCGA were used as the training set, and samples from the PKUPH were used as the testing set. Variable selection using Random Forests (VSURF) was used to select the genes and clinical features on the basis of TCGA samples. The RF classification method was used to establish the prediction model. Kaplan-Meier curves were tested with the log-rank test. The results from this study demonstrated that on the basis of TCGA samples, 11 genes and the grade were selected as the input features. In the training set, the out-of-bag (OOB) error of RF model-1, which was established using the ‘11 genes’, was 0.15; the OOB error of RF model-2, which was established using the ‘grade’, was 0.39; and the OOB error of RF model-3, established using the ‘11 genes and grade’, was 0.15. In the testing set, the classification accuracy of RF model-1, model-2 and model-3 was 71.43, 66.67 and 80.95%, respectively. In conclusion, to the best of our knowledge, the VSURF was used to select features relevant to EEA prognosis, and an EEA predictive model combining genes and traditional features was established for the first time in the present study. The prediction accuracy of the RF model on the basis of the 11 genes and grade was markedly higher than that of the RF models established by either the 11 genes or grade alone. D.A. Spandidos 2019-08 2019-06-20 /pmc/articles/PMC6607378/ /pubmed/31423227 http://dx.doi.org/10.3892/ol.2019.10504 Text en Copyright: © Yin et al. This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
spellingShingle Articles
Yin, Fufen
Shao, Xingyang
Zhao, Lijun
Li, Xiaoping
Zhou, Jingyi
Cheng, Yuan
He, Xiangjun
Lei, Shu
Li, Jiangeng
Wang, Jianliu
Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest
title Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest
title_full Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest
title_fullStr Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest
title_full_unstemmed Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest
title_short Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest
title_sort predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using random forest
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6607378/
https://www.ncbi.nlm.nih.gov/pubmed/31423227
http://dx.doi.org/10.3892/ol.2019.10504
work_keys_str_mv AT yinfufen predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest
AT shaoxingyang predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest
AT zhaolijun predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest
AT lixiaoping predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest
AT zhoujingyi predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest
AT chengyuan predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest
AT hexiangjun predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest
AT leishu predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest
AT lijiangeng predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest
AT wangjianliu predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest