Cargando…

A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study

OBJECTIVE: To construct a novel tumor-node-morphology (TNMor) staging system derived from natural language processing (NLP) of pathology reports to predict outcomes of pancreatic ductal adenocarcinoma. METHOD: This retrospective study with 1657 participants was based on a large referral center and T...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Bo, Wang, Beilei, Zhuang, Pengjie, Cao, Hongwei, Wu, Shengyong, Tan, Zhendong, Gao, Suizhi, Li, Penghao, Jing, Wei, Shao, Zhuo, Zheng, Kailian, Wu, Lele, Gao, Bai, Wang, Yang, Jiang, Hui, Guo, Shiwei, He, Liang, Yang, Yan, Jin, Gang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Lippincott Williams & Wilkins 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10651292/
https://www.ncbi.nlm.nih.gov/pubmed/37578452
http://dx.doi.org/10.1097/JS9.0000000000000648
_version_ 1785147622038700032
author Li, Bo
Wang, Beilei
Zhuang, Pengjie
Cao, Hongwei
Wu, Shengyong
Tan, Zhendong
Gao, Suizhi
Li, Penghao
Jing, Wei
Shao, Zhuo
Zheng, Kailian
Wu, Lele
Gao, Bai
Wang, Yang
Jiang, Hui
Guo, Shiwei
He, Liang
Yang, Yan
Jin, Gang
author_facet Li, Bo
Wang, Beilei
Zhuang, Pengjie
Cao, Hongwei
Wu, Shengyong
Tan, Zhendong
Gao, Suizhi
Li, Penghao
Jing, Wei
Shao, Zhuo
Zheng, Kailian
Wu, Lele
Gao, Bai
Wang, Yang
Jiang, Hui
Guo, Shiwei
He, Liang
Yang, Yan
Jin, Gang
author_sort Li, Bo
collection PubMed
description OBJECTIVE: To construct a novel tumor-node-morphology (TNMor) staging system derived from natural language processing (NLP) of pathology reports to predict outcomes of pancreatic ductal adenocarcinoma. METHOD: This retrospective study with 1657 participants was based on a large referral center and The Cancer Genome Atlas Program (TCGA) dataset. In the training cohort, NLP was used to extract and screen prognostic predictors from pathology reports to develop the TNMor system, which was further evaluated with the tumor-node-metastasis (TNM) system in the internal and external validation cohort, respectively. Main outcomes were evaluated by the log-rank test of Kaplan–Meier curves, the concordance index (C-index), and the area under the receiver operating curve (AUC). RESULTS: The precision, recall, and F1 scores of the NLP model were 88.83, 89.89, and 89.21%, respectively. In Kaplan–Meier analysis, survival differences between stages in the TNMor system were more significant than that in the TNM system. In addition, our system provided an improved C-index (internal validation, 0.58 vs. 0.54, P<0.001; external validation, 0.64 vs. 0.63, P<0.001), and higher AUCs for 1, 2, and 3-year survival (internal validation: 0.62 vs. 0.54, P<0.001; 0.64 vs. 0.60, P=0.017; 0.69 vs. 0.62, P=0.001; external validation: 0.69 vs. 0.65, P=0.098; 0.68 vs. 0.64, P=0.154; 0.64 vs. 0.55, P=0.032, respectively). Finally, our system was particularly beneficial for precise stratification of patients receiving adjuvant therapy, with an improved C-index (0.61 vs. 0.57, P<0.001), and higher AUCs for 1-year, 2-year, and 3-year survival (0.64 vs. 0.57, P<0.001; 0.64 vs. 0.58, P<0.001; 0.67 vs. 0.61, P<0.001; respectively) compared with the TNM system. CONCLUSION: These findings suggest that the TNMor system performed better than the TNM system in predicting pancreatic ductal adenocarcinoma prognosis. It is a promising system to screen risk-adjusted strategies for precision medicine.
format Online
Article
Text
id pubmed-10651292
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Lippincott Williams & Wilkins
record_format MEDLINE/PubMed
spelling pubmed-106512922023-11-15 A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study Li, Bo Wang, Beilei Zhuang, Pengjie Cao, Hongwei Wu, Shengyong Tan, Zhendong Gao, Suizhi Li, Penghao Jing, Wei Shao, Zhuo Zheng, Kailian Wu, Lele Gao, Bai Wang, Yang Jiang, Hui Guo, Shiwei He, Liang Yang, Yan Jin, Gang Int J Surg Original Research OBJECTIVE: To construct a novel tumor-node-morphology (TNMor) staging system derived from natural language processing (NLP) of pathology reports to predict outcomes of pancreatic ductal adenocarcinoma. METHOD: This retrospective study with 1657 participants was based on a large referral center and The Cancer Genome Atlas Program (TCGA) dataset. In the training cohort, NLP was used to extract and screen prognostic predictors from pathology reports to develop the TNMor system, which was further evaluated with the tumor-node-metastasis (TNM) system in the internal and external validation cohort, respectively. Main outcomes were evaluated by the log-rank test of Kaplan–Meier curves, the concordance index (C-index), and the area under the receiver operating curve (AUC). RESULTS: The precision, recall, and F1 scores of the NLP model were 88.83, 89.89, and 89.21%, respectively. In Kaplan–Meier analysis, survival differences between stages in the TNMor system were more significant than that in the TNM system. In addition, our system provided an improved C-index (internal validation, 0.58 vs. 0.54, P<0.001; external validation, 0.64 vs. 0.63, P<0.001), and higher AUCs for 1, 2, and 3-year survival (internal validation: 0.62 vs. 0.54, P<0.001; 0.64 vs. 0.60, P=0.017; 0.69 vs. 0.62, P=0.001; external validation: 0.69 vs. 0.65, P=0.098; 0.68 vs. 0.64, P=0.154; 0.64 vs. 0.55, P=0.032, respectively). Finally, our system was particularly beneficial for precise stratification of patients receiving adjuvant therapy, with an improved C-index (0.61 vs. 0.57, P<0.001), and higher AUCs for 1-year, 2-year, and 3-year survival (0.64 vs. 0.57, P<0.001; 0.64 vs. 0.58, P<0.001; 0.67 vs. 0.61, P<0.001; respectively) compared with the TNM system. CONCLUSION: These findings suggest that the TNMor system performed better than the TNM system in predicting pancreatic ductal adenocarcinoma prognosis. It is a promising system to screen risk-adjusted strategies for precision medicine. Lippincott Williams & Wilkins 2023-08-11 /pmc/articles/PMC10651292/ /pubmed/37578452 http://dx.doi.org/10.1097/JS9.0000000000000648 Text en Copyright © 2023 The Author(s). Published by Wolters Kluwer Health, Inc. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License 4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) (CCBY-NC-ND), where it is permissible to download and share the work provided it is properly cited. The work cannot be changed in any way or used commercially without permission from the journal. http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/)
spellingShingle Original Research
Li, Bo
Wang, Beilei
Zhuang, Pengjie
Cao, Hongwei
Wu, Shengyong
Tan, Zhendong
Gao, Suizhi
Li, Penghao
Jing, Wei
Shao, Zhuo
Zheng, Kailian
Wu, Lele
Gao, Bai
Wang, Yang
Jiang, Hui
Guo, Shiwei
He, Liang
Yang, Yan
Jin, Gang
A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study
title A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study
title_full A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study
title_fullStr A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study
title_full_unstemmed A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study
title_short A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study
title_sort novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10651292/
https://www.ncbi.nlm.nih.gov/pubmed/37578452
http://dx.doi.org/10.1097/JS9.0000000000000648
work_keys_str_mv AT libo anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT wangbeilei anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT zhuangpengjie anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT caohongwei anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT wushengyong anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT tanzhendong anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT gaosuizhi anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT lipenghao anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT jingwei anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT shaozhuo anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT zhengkailian anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT wulele anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT gaobai anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT wangyang anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT jianghui anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT guoshiwei anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT heliang anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT yangyan anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT jingang anovelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT libo novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT wangbeilei novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT zhuangpengjie novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT caohongwei novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT wushengyong novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT tanzhendong novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT gaosuizhi novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT lipenghao novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT jingwei novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT shaozhuo novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT zhengkailian novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT wulele novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT gaobai novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT wangyang novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT jianghui novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT guoshiwei novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT heliang novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT yangyan novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy
AT jingang novelstagingsystemderivedfromnaturallanguageprocessingofpathologyreportstopredictprognosticoutcomesofpancreaticcanceraretrospectivecohortstudy