Cargando…
A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study
OBJECTIVE: To construct a novel tumor-node-morphology (TNMor) staging system derived from natural language processing (NLP) of pathology reports to predict outcomes of pancreatic ductal adenocarcinoma. METHOD: This retrospective study with 1657 participants was based on a large referral center and T...
Autores principales: | , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Lippincott Williams & Wilkins
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10651292/ https://www.ncbi.nlm.nih.gov/pubmed/37578452 http://dx.doi.org/10.1097/JS9.0000000000000648 |
_version_ | 1785147622038700032 |
---|---|
author | Li, Bo Wang, Beilei Zhuang, Pengjie Cao, Hongwei Wu, Shengyong Tan, Zhendong Gao, Suizhi Li, Penghao Jing, Wei Shao, Zhuo Zheng, Kailian Wu, Lele Gao, Bai Wang, Yang Jiang, Hui Guo, Shiwei He, Liang Yang, Yan Jin, Gang |
author_facet | Li, Bo Wang, Beilei Zhuang, Pengjie Cao, Hongwei Wu, Shengyong Tan, Zhendong Gao, Suizhi Li, Penghao Jing, Wei Shao, Zhuo Zheng, Kailian Wu, Lele Gao, Bai Wang, Yang Jiang, Hui Guo, Shiwei He, Liang Yang, Yan Jin, Gang |
author_sort | Li, Bo |
collection | PubMed |
description | OBJECTIVE: To construct a novel tumor-node-morphology (TNMor) staging system derived from natural language processing (NLP) of pathology reports to predict outcomes of pancreatic ductal adenocarcinoma. METHOD: This retrospective study with 1657 participants was based on a large referral center and The Cancer Genome Atlas Program (TCGA) dataset. In the training cohort, NLP was used to extract and screen prognostic predictors from pathology reports to develop the TNMor system, which was further evaluated with the tumor-node-metastasis (TNM) system in the internal and external validation cohort, respectively. Main outcomes were evaluated by the log-rank test of Kaplan–Meier curves, the concordance index (C-index), and the area under the receiver operating curve (AUC). RESULTS: The precision, recall, and F1 scores of the NLP model were 88.83, 89.89, and 89.21%, respectively. In Kaplan–Meier analysis, survival differences between stages in the TNMor system were more significant than that in the TNM system. In addition, our system provided an improved C-index (internal validation, 0.58 vs. 0.54, P<0.001; external validation, 0.64 vs. 0.63, P<0.001), and higher AUCs for 1, 2, and 3-year survival (internal validation: 0.62 vs. 0.54, P<0.001; 0.64 vs. 0.60, P=0.017; 0.69 vs. 0.62, P=0.001; external validation: 0.69 vs. 0.65, P=0.098; 0.68 vs. 0.64, P=0.154; 0.64 vs. 0.55, P=0.032, respectively). Finally, our system was particularly beneficial for precise stratification of patients receiving adjuvant therapy, with an improved C-index (0.61 vs. 0.57, P<0.001), and higher AUCs for 1-year, 2-year, and 3-year survival (0.64 vs. 0.57, P<0.001; 0.64 vs. 0.58, P<0.001; 0.67 vs. 0.61, P<0.001; respectively) compared with the TNM system. CONCLUSION: These findings suggest that the TNMor system performed better than the TNM system in predicting pancreatic ductal adenocarcinoma prognosis. It is a promising system to screen risk-adjusted strategies for precision medicine. |
format | Online Article Text |
id | pubmed-10651292 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Lippincott Williams & Wilkins |
record_format | MEDLINE/PubMed |
spelling | pubmed-106512922023-11-15 A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study Li, Bo Wang, Beilei Zhuang, Pengjie Cao, Hongwei Wu, Shengyong Tan, Zhendong Gao, Suizhi Li, Penghao Jing, Wei Shao, Zhuo Zheng, Kailian Wu, Lele Gao, Bai Wang, Yang Jiang, Hui Guo, Shiwei He, Liang Yang, Yan Jin, Gang Int J Surg Original Research OBJECTIVE: To construct a novel tumor-node-morphology (TNMor) staging system derived from natural language processing (NLP) of pathology reports to predict outcomes of pancreatic ductal adenocarcinoma. METHOD: This retrospective study with 1657 participants was based on a large referral center and The Cancer Genome Atlas Program (TCGA) dataset. In the training cohort, NLP was used to extract and screen prognostic predictors from pathology reports to develop the TNMor system, which was further evaluated with the tumor-node-metastasis (TNM) system in the internal and external validation cohort, respectively. Main outcomes were evaluated by the log-rank test of Kaplan–Meier curves, the concordance index (C-index), and the area under the receiver operating curve (AUC). RESULTS: The precision, recall, and F1 scores of the NLP model were 88.83, 89.89, and 89.21%, respectively. In Kaplan–Meier analysis, survival differences between stages in the TNMor system were more significant than that in the TNM system. In addition, our system provided an improved C-index (internal validation, 0.58 vs. 0.54, P<0.001; external validation, 0.64 vs. 0.63, P<0.001), and higher AUCs for 1, 2, and 3-year survival (internal validation: 0.62 vs. 0.54, P<0.001; 0.64 vs. 0.60, P=0.017; 0.69 vs. 0.62, P=0.001; external validation: 0.69 vs. 0.65, P=0.098; 0.68 vs. 0.64, P=0.154; 0.64 vs. 0.55, P=0.032, respectively). Finally, our system was particularly beneficial for precise stratification of patients receiving adjuvant therapy, with an improved C-index (0.61 vs. 0.57, P<0.001), and higher AUCs for 1-year, 2-year, and 3-year survival (0.64 vs. 0.57, P<0.001; 0.64 vs. 0.58, P<0.001; 0.67 vs. 0.61, P<0.001; respectively) compared with the TNM system. CONCLUSION: These findings suggest that the TNMor system performed better than the TNM system in predicting pancreatic ductal adenocarcinoma prognosis. It is a promising system to screen risk-adjusted strategies for precision medicine. Lippincott Williams & Wilkins 2023-08-11 /pmc/articles/PMC10651292/ /pubmed/37578452 http://dx.doi.org/10.1097/JS9.0000000000000648 Text en Copyright © 2023 The Author(s). Published by Wolters Kluwer Health, Inc. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License 4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) (CCBY-NC-ND), where it is permissible to download and share the work provided it is properly cited. The work cannot be changed in any way or used commercially without permission from the journal. http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) |
spellingShingle | Original Research Li, Bo Wang, Beilei Zhuang, Pengjie Cao, Hongwei Wu, Shengyong Tan, Zhendong Gao, Suizhi Li, Penghao Jing, Wei Shao, Zhuo Zheng, Kailian Wu, Lele Gao, Bai Wang, Yang Jiang, Hui Guo, Shiwei He, Liang Yang, Yan Jin, Gang A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study |
title | A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study |
title_full | A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study |
title_fullStr | A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study |
title_full_unstemmed | A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study |
title_short | A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study |
title_sort | novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10651292/ https://www.ncbi.nlm.nih.gov/pubmed/37578452 http://dx.doi.org/10.1097/JS9.0000000000000648 |
work_keys_str_mv | AT libo anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT wangbeilei anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT zhuangpengjie anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT caohongwei anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT wushengyong anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT tanzhendong anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT gaosuizhi anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT lipenghao anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT jingwei anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT shaozhuo anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT zhengkailian anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT wulele anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT gaobai anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT wangyang anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT jianghui anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT guoshiwei anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT heliang anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT yangyan anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT jingang anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT libo novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT wangbeilei novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT zhuangpengjie novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT caohongwei novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT wushengyong novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT tanzhendong novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT gaosuizhi novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT lipenghao novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT jingwei novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT shaozhuo novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT zhengkailian novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT wulele novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT gaobai novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT wangyang novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT jianghui novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT guoshiwei novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT heliang novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT yangyan novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy AT jingang novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy |