Cargando…
Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest
Traditional clinical features are not sufficient to accurately judge the prognosis of endometrioid endometrial adenocarcinoma (EEA). Molecular biological characteristics and traditional clinical features are particularly important in the prognosis of EEA. The aim of the present study was to establis...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
D.A. Spandidos
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6607378/ https://www.ncbi.nlm.nih.gov/pubmed/31423227 http://dx.doi.org/10.3892/ol.2019.10504 |
_version_ | 1783432085542797312 |
---|---|
author | Yin, Fufen Shao, Xingyang Zhao, Lijun Li, Xiaoping Zhou, Jingyi Cheng, Yuan He, Xiangjun Lei, Shu Li, Jiangeng Wang, Jianliu |
author_facet | Yin, Fufen Shao, Xingyang Zhao, Lijun Li, Xiaoping Zhou, Jingyi Cheng, Yuan He, Xiangjun Lei, Shu Li, Jiangeng Wang, Jianliu |
author_sort | Yin, Fufen |
collection | PubMed |
description | Traditional clinical features are not sufficient to accurately judge the prognosis of endometrioid endometrial adenocarcinoma (EEA). Molecular biological characteristics and traditional clinical features are particularly important in the prognosis of EEA. The aim of the present study was to establish a predictive model that considers genes and clinical features for the prognosis of EEA. The clinical and RNA sequencing expression data of EEA were derived from samples from The Cancer Genome Atlas (TCGA) and Peking University People's Hospital (PKUPH; Beijing, China). Samples from TCGA were used as the training set, and samples from the PKUPH were used as the testing set. Variable selection using Random Forests (VSURF) was used to select the genes and clinical features on the basis of TCGA samples. The RF classification method was used to establish the prediction model. Kaplan-Meier curves were tested with the log-rank test. The results from this study demonstrated that on the basis of TCGA samples, 11 genes and the grade were selected as the input features. In the training set, the out-of-bag (OOB) error of RF model-1, which was established using the ‘11 genes’, was 0.15; the OOB error of RF model-2, which was established using the ‘grade’, was 0.39; and the OOB error of RF model-3, established using the ‘11 genes and grade’, was 0.15. In the testing set, the classification accuracy of RF model-1, model-2 and model-3 was 71.43, 66.67 and 80.95%, respectively. In conclusion, to the best of our knowledge, the VSURF was used to select features relevant to EEA prognosis, and an EEA predictive model combining genes and traditional features was established for the first time in the present study. The prediction accuracy of the RF model on the basis of the 11 genes and grade was markedly higher than that of the RF models established by either the 11 genes or grade alone. |
format | Online Article Text |
id | pubmed-6607378 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | D.A. Spandidos |
record_format | MEDLINE/PubMed |
spelling | pubmed-66073782019-08-18 Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest Yin, Fufen Shao, Xingyang Zhao, Lijun Li, Xiaoping Zhou, Jingyi Cheng, Yuan He, Xiangjun Lei, Shu Li, Jiangeng Wang, Jianliu Oncol Lett Articles Traditional clinical features are not sufficient to accurately judge the prognosis of endometrioid endometrial adenocarcinoma (EEA). Molecular biological characteristics and traditional clinical features are particularly important in the prognosis of EEA. The aim of the present study was to establish a predictive model that considers genes and clinical features for the prognosis of EEA. The clinical and RNA sequencing expression data of EEA were derived from samples from The Cancer Genome Atlas (TCGA) and Peking University People's Hospital (PKUPH; Beijing, China). Samples from TCGA were used as the training set, and samples from the PKUPH were used as the testing set. Variable selection using Random Forests (VSURF) was used to select the genes and clinical features on the basis of TCGA samples. The RF classification method was used to establish the prediction model. Kaplan-Meier curves were tested with the log-rank test. The results from this study demonstrated that on the basis of TCGA samples, 11 genes and the grade were selected as the input features. In the training set, the out-of-bag (OOB) error of RF model-1, which was established using the ‘11 genes’, was 0.15; the OOB error of RF model-2, which was established using the ‘grade’, was 0.39; and the OOB error of RF model-3, established using the ‘11 genes and grade’, was 0.15. In the testing set, the classification accuracy of RF model-1, model-2 and model-3 was 71.43, 66.67 and 80.95%, respectively. In conclusion, to the best of our knowledge, the VSURF was used to select features relevant to EEA prognosis, and an EEA predictive model combining genes and traditional features was established for the first time in the present study. The prediction accuracy of the RF model on the basis of the 11 genes and grade was markedly higher than that of the RF models established by either the 11 genes or grade alone. D.A. Spandidos 2019-08 2019-06-20 /pmc/articles/PMC6607378/ /pubmed/31423227 http://dx.doi.org/10.3892/ol.2019.10504 Text en Copyright: © Yin et al. This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made. |
spellingShingle | Articles Yin, Fufen Shao, Xingyang Zhao, Lijun Li, Xiaoping Zhou, Jingyi Cheng, Yuan He, Xiangjun Lei, Shu Li, Jiangeng Wang, Jianliu Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest |
title | Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest |
title_full | Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest |
title_fullStr | Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest |
title_full_unstemmed | Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest |
title_short | Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest |
title_sort | predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using random forest |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6607378/ https://www.ncbi.nlm.nih.gov/pubmed/31423227 http://dx.doi.org/10.3892/ol.2019.10504 |
work_keys_str_mv | AT yinfufen predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest AT shaoxingyang predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest AT zhaolijun predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest AT lixiaoping predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest AT zhoujingyi predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest AT chengyuan predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest AT hexiangjun predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest AT leishu predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest AT lijiangeng predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest AT wangjianliu predictingprognosisofendometrioidendometrialadenocarcinomaonthebasisofgeneexpressionandclinicalfeaturesusingrandomforest |