Cargando…

Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database

Background: Prognosis prediction is indispensable in clinical practice and machine learning has been proved to be helpful. We expected to predict survival of pancreatic neuroendocrine tumors (PNETs) with machine learning, and compared it with the American Joint Committee on Cancer (AJCC) staging sys...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Yiyan, Gao, Shaowei, Tan, Wulin, Qiu, Zeting, Zhou, Huaqiang, Zhao, Yue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Ivyspring International Publisher 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6218767/
https://www.ncbi.nlm.nih.gov/pubmed/30410601
http://dx.doi.org/10.7150/jca.26649
_version_ 1783368525173227520
author Song, Yiyan
Gao, Shaowei
Tan, Wulin
Qiu, Zeting
Zhou, Huaqiang
Zhao, Yue
author_facet Song, Yiyan
Gao, Shaowei
Tan, Wulin
Qiu, Zeting
Zhou, Huaqiang
Zhao, Yue
author_sort Song, Yiyan
collection PubMed
description Background: Prognosis prediction is indispensable in clinical practice and machine learning has been proved to be helpful. We expected to predict survival of pancreatic neuroendocrine tumors (PNETs) with machine learning, and compared it with the American Joint Committee on Cancer (AJCC) staging system. Methods: Data of PNETs cases were extracted from The Surveillance, Epidemiology, and End Result (SEER) database. Statistic description, multivariate survival analysis and preprocessing were done before machine learning. Four different algorithms (logistic regression (LR), support vector machines (SVM), random forest (RF) and deep learning (DL)) were used to train the model. We used proper imputations to manage missing data in the database and sensitive analysis was performed to evaluate the imputation. The model with the best predictive accuracy was compared with the AJCC staging system using the SEER cases. Results: The four models had similar predictive accuracy with no significant difference existed (p = 0.664). The DL model showed a slightly better predictive accuracy than others (81.6% (± 1.9%)), thus it was used for further comparison with the AJCC staging system and revealed a better performance for PNETs cases in SEER database (Area under receiver operating characteristic curve: 0.87 vs 0.76). The validity of missing data imputation was supported by sensitivity analysis. Conclusions: The models developed with machine learning performed well in survival prediction of PNETs, and the DL model have a better accuracy and specificity than the AJCC staging system in SEER data. The DL model has potential for clinical application but external validation is needed.
format Online
Article
Text
id pubmed-6218767
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Ivyspring International Publisher
record_format MEDLINE/PubMed
spelling pubmed-62187672018-11-08 Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database Song, Yiyan Gao, Shaowei Tan, Wulin Qiu, Zeting Zhou, Huaqiang Zhao, Yue J Cancer Research Paper Background: Prognosis prediction is indispensable in clinical practice and machine learning has been proved to be helpful. We expected to predict survival of pancreatic neuroendocrine tumors (PNETs) with machine learning, and compared it with the American Joint Committee on Cancer (AJCC) staging system. Methods: Data of PNETs cases were extracted from The Surveillance, Epidemiology, and End Result (SEER) database. Statistic description, multivariate survival analysis and preprocessing were done before machine learning. Four different algorithms (logistic regression (LR), support vector machines (SVM), random forest (RF) and deep learning (DL)) were used to train the model. We used proper imputations to manage missing data in the database and sensitive analysis was performed to evaluate the imputation. The model with the best predictive accuracy was compared with the AJCC staging system using the SEER cases. Results: The four models had similar predictive accuracy with no significant difference existed (p = 0.664). The DL model showed a slightly better predictive accuracy than others (81.6% (± 1.9%)), thus it was used for further comparison with the AJCC staging system and revealed a better performance for PNETs cases in SEER database (Area under receiver operating characteristic curve: 0.87 vs 0.76). The validity of missing data imputation was supported by sensitivity analysis. Conclusions: The models developed with machine learning performed well in survival prediction of PNETs, and the DL model have a better accuracy and specificity than the AJCC staging system in SEER data. The DL model has potential for clinical application but external validation is needed. Ivyspring International Publisher 2018-10-10 /pmc/articles/PMC6218767/ /pubmed/30410601 http://dx.doi.org/10.7150/jca.26649 Text en © Ivyspring International Publisher This is an open access article distributed under the terms of the Creative Commons Attribution (CC BY-NC) license (https://creativecommons.org/licenses/by-nc/4.0/). See http://ivyspring.com/terms for full terms and conditions.
spellingShingle Research Paper
Song, Yiyan
Gao, Shaowei
Tan, Wulin
Qiu, Zeting
Zhou, Huaqiang
Zhao, Yue
Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database
title Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database
title_full Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database
title_fullStr Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database
title_full_unstemmed Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database
title_short Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database
title_sort multiple machine learnings revealed similar predictive accuracy for prognosis of pnets from the surveillance, epidemiology, and end result database
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6218767/
https://www.ncbi.nlm.nih.gov/pubmed/30410601
http://dx.doi.org/10.7150/jca.26649
work_keys_str_mv AT songyiyan multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase
AT gaoshaowei multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase
AT tanwulin multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase
AT qiuzeting multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase
AT zhouhuaqiang multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase
AT zhaoyue multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase