Cargando…

Artificial Intelligence Combined With Big Data to Predict Lymph Node Involvement in Prostate Cancer: A Population-Based Study

BACKGROUND: A more accurate preoperative prediction of lymph node involvement (LNI) in prostate cancer (PCa) would improve clinical treatment and follow-up strategies of this disease. We developed a predictive model based on machine learning (ML) combined with big data to achieve this. METHODS: Clin...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Liwei, Huang, Yongdi, Chen, Zheng, Lei, Hongyu, Qin, Xiaoping, Cui, Lihong, Zhuo, Yumin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8551611/
https://www.ncbi.nlm.nih.gov/pubmed/34722318
http://dx.doi.org/10.3389/fonc.2021.763381
_version_ 1784591196906061824
author Wei, Liwei
Huang, Yongdi
Chen, Zheng
Lei, Hongyu
Qin, Xiaoping
Cui, Lihong
Zhuo, Yumin
author_facet Wei, Liwei
Huang, Yongdi
Chen, Zheng
Lei, Hongyu
Qin, Xiaoping
Cui, Lihong
Zhuo, Yumin
author_sort Wei, Liwei
collection PubMed
description BACKGROUND: A more accurate preoperative prediction of lymph node involvement (LNI) in prostate cancer (PCa) would improve clinical treatment and follow-up strategies of this disease. We developed a predictive model based on machine learning (ML) combined with big data to achieve this. METHODS: Clinicopathological characteristics of 2,884 PCa patients who underwent extended pelvic lymph node dissection (ePLND) were collected from the U.S. National Cancer Institute’s Surveillance, Epidemiology, and End Results (SEER) database from 2010 to 2015. Eight variables were included to establish an ML model. Model performance was evaluated by the receiver operating characteristic (ROC) curves and calibration plots for predictive accuracy. Decision curve analysis (DCA) and cutoff values were obtained to estimate its clinical utility. RESULTS: Three hundred and forty-four (11.9%) patients were identified with LNI. The five most important factors were the Gleason score, T stage of disease, percentage of positive cores, tumor size, and prostate-specific antigen levels with 158, 137, 128, 113, and 88 points, respectively. The XGBoost (XGB) model showed the best predictive performance and had the highest net benefit when compared with the other algorithms, achieving an area under the curve of 0.883. With a 5%~20% cutoff value, the XGB model performed best in reducing omissions and avoiding overtreatment of patients when dealing with LNI. This model also had a lower false-negative rate and a higher percentage of ePLND was avoided. In addition, DCA showed it has the highest net benefit across the whole range of threshold probabilities. CONCLUSIONS: We established an ML model based on big data for predicting LNI in PCa, and it could lead to a reduction of approximately 50% of ePLND cases. In addition, only ≤3% of patients were misdiagnosed with a cutoff value ranging from 5% to 20%. This promising study warrants further validation by using a larger prospective dataset.
format Online
Article
Text
id pubmed-8551611
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85516112021-10-29 Artificial Intelligence Combined With Big Data to Predict Lymph Node Involvement in Prostate Cancer: A Population-Based Study Wei, Liwei Huang, Yongdi Chen, Zheng Lei, Hongyu Qin, Xiaoping Cui, Lihong Zhuo, Yumin Front Oncol Oncology BACKGROUND: A more accurate preoperative prediction of lymph node involvement (LNI) in prostate cancer (PCa) would improve clinical treatment and follow-up strategies of this disease. We developed a predictive model based on machine learning (ML) combined with big data to achieve this. METHODS: Clinicopathological characteristics of 2,884 PCa patients who underwent extended pelvic lymph node dissection (ePLND) were collected from the U.S. National Cancer Institute’s Surveillance, Epidemiology, and End Results (SEER) database from 2010 to 2015. Eight variables were included to establish an ML model. Model performance was evaluated by the receiver operating characteristic (ROC) curves and calibration plots for predictive accuracy. Decision curve analysis (DCA) and cutoff values were obtained to estimate its clinical utility. RESULTS: Three hundred and forty-four (11.9%) patients were identified with LNI. The five most important factors were the Gleason score, T stage of disease, percentage of positive cores, tumor size, and prostate-specific antigen levels with 158, 137, 128, 113, and 88 points, respectively. The XGBoost (XGB) model showed the best predictive performance and had the highest net benefit when compared with the other algorithms, achieving an area under the curve of 0.883. With a 5%~20% cutoff value, the XGB model performed best in reducing omissions and avoiding overtreatment of patients when dealing with LNI. This model also had a lower false-negative rate and a higher percentage of ePLND was avoided. In addition, DCA showed it has the highest net benefit across the whole range of threshold probabilities. CONCLUSIONS: We established an ML model based on big data for predicting LNI in PCa, and it could lead to a reduction of approximately 50% of ePLND cases. In addition, only ≤3% of patients were misdiagnosed with a cutoff value ranging from 5% to 20%. This promising study warrants further validation by using a larger prospective dataset. Frontiers Media S.A. 2021-10-14 /pmc/articles/PMC8551611/ /pubmed/34722318 http://dx.doi.org/10.3389/fonc.2021.763381 Text en Copyright © 2021 Wei, Huang, Chen, Lei, Qin, Cui and Zhuo https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Wei, Liwei
Huang, Yongdi
Chen, Zheng
Lei, Hongyu
Qin, Xiaoping
Cui, Lihong
Zhuo, Yumin
Artificial Intelligence Combined With Big Data to Predict Lymph Node Involvement in Prostate Cancer: A Population-Based Study
title Artificial Intelligence Combined With Big Data to Predict Lymph Node Involvement in Prostate Cancer: A Population-Based Study
title_full Artificial Intelligence Combined With Big Data to Predict Lymph Node Involvement in Prostate Cancer: A Population-Based Study
title_fullStr Artificial Intelligence Combined With Big Data to Predict Lymph Node Involvement in Prostate Cancer: A Population-Based Study
title_full_unstemmed Artificial Intelligence Combined With Big Data to Predict Lymph Node Involvement in Prostate Cancer: A Population-Based Study
title_short Artificial Intelligence Combined With Big Data to Predict Lymph Node Involvement in Prostate Cancer: A Population-Based Study
title_sort artificial intelligence combined with big data to predict lymph node involvement in prostate cancer: a population-based study
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8551611/
https://www.ncbi.nlm.nih.gov/pubmed/34722318
http://dx.doi.org/10.3389/fonc.2021.763381
work_keys_str_mv AT weiliwei artificialintelligencecombinedwithbigdatatopredictlymphnodeinvolvementinprostatecancerapopulationbasedstudy
AT huangyongdi artificialintelligencecombinedwithbigdatatopredictlymphnodeinvolvementinprostatecancerapopulationbasedstudy
AT chenzheng artificialintelligencecombinedwithbigdatatopredictlymphnodeinvolvementinprostatecancerapopulationbasedstudy
AT leihongyu artificialintelligencecombinedwithbigdatatopredictlymphnodeinvolvementinprostatecancerapopulationbasedstudy
AT qinxiaoping artificialintelligencecombinedwithbigdatatopredictlymphnodeinvolvementinprostatecancerapopulationbasedstudy
AT cuilihong artificialintelligencecombinedwithbigdatatopredictlymphnodeinvolvementinprostatecancerapopulationbasedstudy
AT zhuoyumin artificialintelligencecombinedwithbigdatatopredictlymphnodeinvolvementinprostatecancerapopulationbasedstudy