Cargando…

A deep learning and natural language processing-based system for automatic identification and surveillance of high-risk patients undergoing upper endoscopy: A multicenter study

BACKGROUND: Timely identification and regular surveillance of patients at high risk are crucial for early diagnosis of upper gastrointestinal cancer. However, traditional manual surveillance method is time-consuming, and current surveillance rate is below 50%. Here, we aimed to develop a surveillanc...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jia, Hu, Shan, Shi, Conghui, Dong, Zehua, Pan, Jie, Ai, Yaowei, Liu, Jun, Zhou, Wei, Deng, Yunchao, Li, Yanxia, Yuan, Jingping, Zeng, Zhi, Wu, Lianlian, Yu, Honggang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9716327/
https://www.ncbi.nlm.nih.gov/pubmed/36467456
http://dx.doi.org/10.1016/j.eclinm.2022.101704
_version_ 1784842663855390720
author Li, Jia
Hu, Shan
Shi, Conghui
Dong, Zehua
Pan, Jie
Ai, Yaowei
Liu, Jun
Zhou, Wei
Deng, Yunchao
Li, Yanxia
Yuan, Jingping
Zeng, Zhi
Wu, Lianlian
Yu, Honggang
author_facet Li, Jia
Hu, Shan
Shi, Conghui
Dong, Zehua
Pan, Jie
Ai, Yaowei
Liu, Jun
Zhou, Wei
Deng, Yunchao
Li, Yanxia
Yuan, Jingping
Zeng, Zhi
Wu, Lianlian
Yu, Honggang
author_sort Li, Jia
collection PubMed
description BACKGROUND: Timely identification and regular surveillance of patients at high risk are crucial for early diagnosis of upper gastrointestinal cancer. However, traditional manual surveillance method is time-consuming, and current surveillance rate is below 50%. Here, we aimed to develop a surveillance system named ENDOANGEL-AS (automatic surveillance) for automatic identification and surveillance of high-risk patients. METHODS: 7874 patients from Renmin Hospital of Wuhan University between May 1 and July 31, 2021 were used as the training set, 6762 patients between August 1 and October 31, 2021 as the internal test set, and 7570 patients from two other hospitals between August 1 and October 31, 2021 as the external test sets. We first extracted descriptions of abnormalities from endoscopic and pathological reports based on natural language processing techniques to identify individuals. Then patients were classified at nine risk levels according to endoscopic and pathological findings, and a deep learning model was trained to identify demarcation line (DL) in gastric low-grade intraepithelial neoplasia (LGIN) using 1561 white-light still images for risk stratification of gastric LGIN. Finally, patients undergoing upper endoscopy were classified and assigned one of ten surveillance intervals according to guidelines. The performance of ENDOANGEL-AS was evaluated and compared with physicians. FINDINGS: Patient identification module achieved an accuracy of 100% and 99.91% in internal and external test sets, respectively. Risk level classification module achieved an accuracy of 100% and 99.85% in the internal and external test sets, respectively. DL identification module achieved an accuracy of 87.88%. ENDOANGEL-AS on surveillance interval assignment achieved an accuracy of 99.23% and 99.67% in internal and external test sets, respectively. ENDOANGEL-AS had significantly higher accuracy compared with physicians (99.00% vs 38.87%, p < 0.001). The accuracy (63.67%, p < 0.001) of endoscopists with the assistance of ENDOANGEL-AS was significantly improved. INTERPRETATION: We established a surveillance system that can automatically identify patients and assign surveillance intervals with high accuracy and good transferability. FUNDING: This work was partly supported by a grant from the Hubei Province Major Science and Technology Innovation Project (2018-916-000-008) and the Fundamental Research Funds for the Central Universities (2042021kf0084).
format Online
Article
Text
id pubmed-9716327
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-97163272022-12-03 A deep learning and natural language processing-based system for automatic identification and surveillance of high-risk patients undergoing upper endoscopy: A multicenter study Li, Jia Hu, Shan Shi, Conghui Dong, Zehua Pan, Jie Ai, Yaowei Liu, Jun Zhou, Wei Deng, Yunchao Li, Yanxia Yuan, Jingping Zeng, Zhi Wu, Lianlian Yu, Honggang eClinicalMedicine Articles BACKGROUND: Timely identification and regular surveillance of patients at high risk are crucial for early diagnosis of upper gastrointestinal cancer. However, traditional manual surveillance method is time-consuming, and current surveillance rate is below 50%. Here, we aimed to develop a surveillance system named ENDOANGEL-AS (automatic surveillance) for automatic identification and surveillance of high-risk patients. METHODS: 7874 patients from Renmin Hospital of Wuhan University between May 1 and July 31, 2021 were used as the training set, 6762 patients between August 1 and October 31, 2021 as the internal test set, and 7570 patients from two other hospitals between August 1 and October 31, 2021 as the external test sets. We first extracted descriptions of abnormalities from endoscopic and pathological reports based on natural language processing techniques to identify individuals. Then patients were classified at nine risk levels according to endoscopic and pathological findings, and a deep learning model was trained to identify demarcation line (DL) in gastric low-grade intraepithelial neoplasia (LGIN) using 1561 white-light still images for risk stratification of gastric LGIN. Finally, patients undergoing upper endoscopy were classified and assigned one of ten surveillance intervals according to guidelines. The performance of ENDOANGEL-AS was evaluated and compared with physicians. FINDINGS: Patient identification module achieved an accuracy of 100% and 99.91% in internal and external test sets, respectively. Risk level classification module achieved an accuracy of 100% and 99.85% in the internal and external test sets, respectively. DL identification module achieved an accuracy of 87.88%. ENDOANGEL-AS on surveillance interval assignment achieved an accuracy of 99.23% and 99.67% in internal and external test sets, respectively. ENDOANGEL-AS had significantly higher accuracy compared with physicians (99.00% vs 38.87%, p < 0.001). The accuracy (63.67%, p < 0.001) of endoscopists with the assistance of ENDOANGEL-AS was significantly improved. INTERPRETATION: We established a surveillance system that can automatically identify patients and assign surveillance intervals with high accuracy and good transferability. FUNDING: This work was partly supported by a grant from the Hubei Province Major Science and Technology Innovation Project (2018-916-000-008) and the Fundamental Research Funds for the Central Universities (2042021kf0084). Elsevier 2022-10-31 /pmc/articles/PMC9716327/ /pubmed/36467456 http://dx.doi.org/10.1016/j.eclinm.2022.101704 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Articles
Li, Jia
Hu, Shan
Shi, Conghui
Dong, Zehua
Pan, Jie
Ai, Yaowei
Liu, Jun
Zhou, Wei
Deng, Yunchao
Li, Yanxia
Yuan, Jingping
Zeng, Zhi
Wu, Lianlian
Yu, Honggang
A deep learning and natural language processing-based system for automatic identification and surveillance of high-risk patients undergoing upper endoscopy: A multicenter study
title A deep learning and natural language processing-based system for automatic identification and surveillance of high-risk patients undergoing upper endoscopy: A multicenter study
title_full A deep learning and natural language processing-based system for automatic identification and surveillance of high-risk patients undergoing upper endoscopy: A multicenter study
title_fullStr A deep learning and natural language processing-based system for automatic identification and surveillance of high-risk patients undergoing upper endoscopy: A multicenter study
title_full_unstemmed A deep learning and natural language processing-based system for automatic identification and surveillance of high-risk patients undergoing upper endoscopy: A multicenter study
title_short A deep learning and natural language processing-based system for automatic identification and surveillance of high-risk patients undergoing upper endoscopy: A multicenter study
title_sort deep learning and natural language processing-based system for automatic identification and surveillance of high-risk patients undergoing upper endoscopy: a multicenter study
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9716327/
https://www.ncbi.nlm.nih.gov/pubmed/36467456
http://dx.doi.org/10.1016/j.eclinm.2022.101704
work_keys_str_mv AT lijia adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT hushan adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT shiconghui adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT dongzehua adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT panjie adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT aiyaowei adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT liujun adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT zhouwei adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT dengyunchao adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT liyanxia adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT yuanjingping adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT zengzhi adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT wulianlian adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT yuhonggang adeeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT lijia deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT hushan deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT shiconghui deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT dongzehua deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT panjie deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT aiyaowei deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT liujun deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT zhouwei deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT dengyunchao deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT liyanxia deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT yuanjingping deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT zengzhi deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT wulianlian deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy
AT yuhonggang deeplearningandnaturallanguageprocessingbasedsystemforautomaticidentificationandsurveillanceofhighriskpatientsundergoingupperendoscopyamulticenterstudy