Cargando…

Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study

BACKGROUND: Manual data extraction of colonoscopy quality indicators is time and labor intensive. Natural language processing (NLP), a computer-based linguistics technique, can automate the extraction of important clinical information, such as adverse events, from unstructured free-text reports. NLP...

Descripción completa

Detalles Bibliográficos
Autores principales: Bae, Jung Ho, Han, Hyun Wook, Yang, Sun Young, Song, Gyuseon, Sa, Soonok, Chung, Goh Eun, Seo, Ji Yeon, Jin, Eun Hyo, Kim, Heecheon, An, DongUk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9055472/
https://www.ncbi.nlm.nih.gov/pubmed/35436226
http://dx.doi.org/10.2196/35257
_version_ 1784697419900911616
author Bae, Jung Ho
Han, Hyun Wook
Yang, Sun Young
Song, Gyuseon
Sa, Soonok
Chung, Goh Eun
Seo, Ji Yeon
Jin, Eun Hyo
Kim, Heecheon
An, DongUk
author_facet Bae, Jung Ho
Han, Hyun Wook
Yang, Sun Young
Song, Gyuseon
Sa, Soonok
Chung, Goh Eun
Seo, Ji Yeon
Jin, Eun Hyo
Kim, Heecheon
An, DongUk
author_sort Bae, Jung Ho
collection PubMed
description BACKGROUND: Manual data extraction of colonoscopy quality indicators is time and labor intensive. Natural language processing (NLP), a computer-based linguistics technique, can automate the extraction of important clinical information, such as adverse events, from unstructured free-text reports. NLP information extraction can facilitate the optimization of clinical work by helping to improve quality control and patient management. OBJECTIVE: We developed an NLP pipeline to analyze free-text colonoscopy and pathology reports and evaluated its ability to automatically assess adenoma detection rate (ADR), sessile serrated lesion detection rate (SDR), and postcolonoscopy surveillance intervals. METHODS: The NLP tool for extracting colonoscopy quality indicators was developed using a data set of 2000 screening colonoscopy reports from a single health care system, with an associated 1425 pathology reports. The NLP system was then tested on a data set of 1000 colonoscopy reports and its performance was compared with that of 5 human annotators. Additionally, data from 54,562 colonoscopies performed between 2010 and 2019 were analyzed using the NLP pipeline. RESULTS: The NLP pipeline achieved an overall accuracy of 0.99-1.00 for identifying polyp subtypes, 0.99-1.00 for identifying the anatomical location of polyps, and 0.98 for counting the number of neoplastic polyps. The NLP pipeline achieved performance similar to clinical experts for assessing ADR, SDR, and surveillance intervals. NLP analysis of a 10-year colonoscopy data set identified great individual variance in colonoscopy quality indicators among 25 endoscopists. CONCLUSIONS: The NLP pipeline could accurately extract information from colonoscopy and pathology reports and demonstrated clinical efficacy for assessing ADR, SDR, and surveillance intervals in these reports. Implementation of the system enabled automated analysis and feedback on quality indicators, which could motivate endoscopists to improve the quality of their performance and improve clinical decision-making in colorectal cancer screening programs.
format Online
Article
Text
id pubmed-9055472
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-90554722022-05-01 Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study Bae, Jung Ho Han, Hyun Wook Yang, Sun Young Song, Gyuseon Sa, Soonok Chung, Goh Eun Seo, Ji Yeon Jin, Eun Hyo Kim, Heecheon An, DongUk JMIR Med Inform Original Paper BACKGROUND: Manual data extraction of colonoscopy quality indicators is time and labor intensive. Natural language processing (NLP), a computer-based linguistics technique, can automate the extraction of important clinical information, such as adverse events, from unstructured free-text reports. NLP information extraction can facilitate the optimization of clinical work by helping to improve quality control and patient management. OBJECTIVE: We developed an NLP pipeline to analyze free-text colonoscopy and pathology reports and evaluated its ability to automatically assess adenoma detection rate (ADR), sessile serrated lesion detection rate (SDR), and postcolonoscopy surveillance intervals. METHODS: The NLP tool for extracting colonoscopy quality indicators was developed using a data set of 2000 screening colonoscopy reports from a single health care system, with an associated 1425 pathology reports. The NLP system was then tested on a data set of 1000 colonoscopy reports and its performance was compared with that of 5 human annotators. Additionally, data from 54,562 colonoscopies performed between 2010 and 2019 were analyzed using the NLP pipeline. RESULTS: The NLP pipeline achieved an overall accuracy of 0.99-1.00 for identifying polyp subtypes, 0.99-1.00 for identifying the anatomical location of polyps, and 0.98 for counting the number of neoplastic polyps. The NLP pipeline achieved performance similar to clinical experts for assessing ADR, SDR, and surveillance intervals. NLP analysis of a 10-year colonoscopy data set identified great individual variance in colonoscopy quality indicators among 25 endoscopists. CONCLUSIONS: The NLP pipeline could accurately extract information from colonoscopy and pathology reports and demonstrated clinical efficacy for assessing ADR, SDR, and surveillance intervals in these reports. Implementation of the system enabled automated analysis and feedback on quality indicators, which could motivate endoscopists to improve the quality of their performance and improve clinical decision-making in colorectal cancer screening programs. JMIR Publications 2022-04-15 /pmc/articles/PMC9055472/ /pubmed/35436226 http://dx.doi.org/10.2196/35257 Text en ©Jung Ho Bae, Hyun Wook Han, Sun Young Yang, Gyuseon Song, Soonok Sa, Goh Eun Chung, Ji Yeon Seo, Eun Hyo Jin, Heecheon Kim, DongUk An. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 15.04.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Bae, Jung Ho
Han, Hyun Wook
Yang, Sun Young
Song, Gyuseon
Sa, Soonok
Chung, Goh Eun
Seo, Ji Yeon
Jin, Eun Hyo
Kim, Heecheon
An, DongUk
Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study
title Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study
title_full Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study
title_fullStr Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study
title_full_unstemmed Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study
title_short Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study
title_sort natural language processing for assessing quality indicators in free-text colonoscopy and pathology reports: development and usability study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9055472/
https://www.ncbi.nlm.nih.gov/pubmed/35436226
http://dx.doi.org/10.2196/35257
work_keys_str_mv AT baejungho naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy
AT hanhyunwook naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy
AT yangsunyoung naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy
AT songgyuseon naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy
AT sasoonok naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy
AT chunggoheun naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy
AT seojiyeon naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy
AT jineunhyo naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy
AT kimheecheon naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy
AT andonguk naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy