Cargando…
Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database
Background: Prognosis prediction is indispensable in clinical practice and machine learning has been proved to be helpful. We expected to predict survival of pancreatic neuroendocrine tumors (PNETs) with machine learning, and compared it with the American Joint Committee on Cancer (AJCC) staging sys...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Ivyspring International Publisher
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6218767/ https://www.ncbi.nlm.nih.gov/pubmed/30410601 http://dx.doi.org/10.7150/jca.26649 |
_version_ | 1783368525173227520 |
---|---|
author | Song, Yiyan Gao, Shaowei Tan, Wulin Qiu, Zeting Zhou, Huaqiang Zhao, Yue |
author_facet | Song, Yiyan Gao, Shaowei Tan, Wulin Qiu, Zeting Zhou, Huaqiang Zhao, Yue |
author_sort | Song, Yiyan |
collection | PubMed |
description | Background: Prognosis prediction is indispensable in clinical practice and machine learning has been proved to be helpful. We expected to predict survival of pancreatic neuroendocrine tumors (PNETs) with machine learning, and compared it with the American Joint Committee on Cancer (AJCC) staging system. Methods: Data of PNETs cases were extracted from The Surveillance, Epidemiology, and End Result (SEER) database. Statistic description, multivariate survival analysis and preprocessing were done before machine learning. Four different algorithms (logistic regression (LR), support vector machines (SVM), random forest (RF) and deep learning (DL)) were used to train the model. We used proper imputations to manage missing data in the database and sensitive analysis was performed to evaluate the imputation. The model with the best predictive accuracy was compared with the AJCC staging system using the SEER cases. Results: The four models had similar predictive accuracy with no significant difference existed (p = 0.664). The DL model showed a slightly better predictive accuracy than others (81.6% (± 1.9%)), thus it was used for further comparison with the AJCC staging system and revealed a better performance for PNETs cases in SEER database (Area under receiver operating characteristic curve: 0.87 vs 0.76). The validity of missing data imputation was supported by sensitivity analysis. Conclusions: The models developed with machine learning performed well in survival prediction of PNETs, and the DL model have a better accuracy and specificity than the AJCC staging system in SEER data. The DL model has potential for clinical application but external validation is needed. |
format | Online Article Text |
id | pubmed-6218767 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Ivyspring International Publisher |
record_format | MEDLINE/PubMed |
spelling | pubmed-62187672018-11-08 Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database Song, Yiyan Gao, Shaowei Tan, Wulin Qiu, Zeting Zhou, Huaqiang Zhao, Yue J Cancer Research Paper Background: Prognosis prediction is indispensable in clinical practice and machine learning has been proved to be helpful. We expected to predict survival of pancreatic neuroendocrine tumors (PNETs) with machine learning, and compared it with the American Joint Committee on Cancer (AJCC) staging system. Methods: Data of PNETs cases were extracted from The Surveillance, Epidemiology, and End Result (SEER) database. Statistic description, multivariate survival analysis and preprocessing were done before machine learning. Four different algorithms (logistic regression (LR), support vector machines (SVM), random forest (RF) and deep learning (DL)) were used to train the model. We used proper imputations to manage missing data in the database and sensitive analysis was performed to evaluate the imputation. The model with the best predictive accuracy was compared with the AJCC staging system using the SEER cases. Results: The four models had similar predictive accuracy with no significant difference existed (p = 0.664). The DL model showed a slightly better predictive accuracy than others (81.6% (± 1.9%)), thus it was used for further comparison with the AJCC staging system and revealed a better performance for PNETs cases in SEER database (Area under receiver operating characteristic curve: 0.87 vs 0.76). The validity of missing data imputation was supported by sensitivity analysis. Conclusions: The models developed with machine learning performed well in survival prediction of PNETs, and the DL model have a better accuracy and specificity than the AJCC staging system in SEER data. The DL model has potential for clinical application but external validation is needed. Ivyspring International Publisher 2018-10-10 /pmc/articles/PMC6218767/ /pubmed/30410601 http://dx.doi.org/10.7150/jca.26649 Text en © Ivyspring International Publisher This is an open access article distributed under the terms of the Creative Commons Attribution (CC BY-NC) license (https://creativecommons.org/licenses/by-nc/4.0/). See http://ivyspring.com/terms for full terms and conditions. |
spellingShingle | Research Paper Song, Yiyan Gao, Shaowei Tan, Wulin Qiu, Zeting Zhou, Huaqiang Zhao, Yue Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database |
title | Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database |
title_full | Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database |
title_fullStr | Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database |
title_full_unstemmed | Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database |
title_short | Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database |
title_sort | multiple machine learnings revealed similar predictive accuracy for prognosis of pnets from the surveillance, epidemiology, and end result database |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6218767/ https://www.ncbi.nlm.nih.gov/pubmed/30410601 http://dx.doi.org/10.7150/jca.26649 |
work_keys_str_mv | AT songyiyan multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase AT gaoshaowei multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase AT tanwulin multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase AT qiuzeting multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase AT zhouhuaqiang multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase AT zhaoyue multiplemachinelearningsrevealedsimilarpredictiveaccuracyforprognosisofpnetsfromthesurveillanceepidemiologyandendresultdatabase |