Cargando…
Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study
BACKGROUND: Manual data extraction of colonoscopy quality indicators is time and labor intensive. Natural language processing (NLP), a computer-based linguistics technique, can automate the extraction of important clinical information, such as adverse events, from unstructured free-text reports. NLP...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9055472/ https://www.ncbi.nlm.nih.gov/pubmed/35436226 http://dx.doi.org/10.2196/35257 |
_version_ | 1784697419900911616 |
---|---|
author | Bae, Jung Ho Han, Hyun Wook Yang, Sun Young Song, Gyuseon Sa, Soonok Chung, Goh Eun Seo, Ji Yeon Jin, Eun Hyo Kim, Heecheon An, DongUk |
author_facet | Bae, Jung Ho Han, Hyun Wook Yang, Sun Young Song, Gyuseon Sa, Soonok Chung, Goh Eun Seo, Ji Yeon Jin, Eun Hyo Kim, Heecheon An, DongUk |
author_sort | Bae, Jung Ho |
collection | PubMed |
description | BACKGROUND: Manual data extraction of colonoscopy quality indicators is time and labor intensive. Natural language processing (NLP), a computer-based linguistics technique, can automate the extraction of important clinical information, such as adverse events, from unstructured free-text reports. NLP information extraction can facilitate the optimization of clinical work by helping to improve quality control and patient management. OBJECTIVE: We developed an NLP pipeline to analyze free-text colonoscopy and pathology reports and evaluated its ability to automatically assess adenoma detection rate (ADR), sessile serrated lesion detection rate (SDR), and postcolonoscopy surveillance intervals. METHODS: The NLP tool for extracting colonoscopy quality indicators was developed using a data set of 2000 screening colonoscopy reports from a single health care system, with an associated 1425 pathology reports. The NLP system was then tested on a data set of 1000 colonoscopy reports and its performance was compared with that of 5 human annotators. Additionally, data from 54,562 colonoscopies performed between 2010 and 2019 were analyzed using the NLP pipeline. RESULTS: The NLP pipeline achieved an overall accuracy of 0.99-1.00 for identifying polyp subtypes, 0.99-1.00 for identifying the anatomical location of polyps, and 0.98 for counting the number of neoplastic polyps. The NLP pipeline achieved performance similar to clinical experts for assessing ADR, SDR, and surveillance intervals. NLP analysis of a 10-year colonoscopy data set identified great individual variance in colonoscopy quality indicators among 25 endoscopists. CONCLUSIONS: The NLP pipeline could accurately extract information from colonoscopy and pathology reports and demonstrated clinical efficacy for assessing ADR, SDR, and surveillance intervals in these reports. Implementation of the system enabled automated analysis and feedback on quality indicators, which could motivate endoscopists to improve the quality of their performance and improve clinical decision-making in colorectal cancer screening programs. |
format | Online Article Text |
id | pubmed-9055472 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-90554722022-05-01 Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study Bae, Jung Ho Han, Hyun Wook Yang, Sun Young Song, Gyuseon Sa, Soonok Chung, Goh Eun Seo, Ji Yeon Jin, Eun Hyo Kim, Heecheon An, DongUk JMIR Med Inform Original Paper BACKGROUND: Manual data extraction of colonoscopy quality indicators is time and labor intensive. Natural language processing (NLP), a computer-based linguistics technique, can automate the extraction of important clinical information, such as adverse events, from unstructured free-text reports. NLP information extraction can facilitate the optimization of clinical work by helping to improve quality control and patient management. OBJECTIVE: We developed an NLP pipeline to analyze free-text colonoscopy and pathology reports and evaluated its ability to automatically assess adenoma detection rate (ADR), sessile serrated lesion detection rate (SDR), and postcolonoscopy surveillance intervals. METHODS: The NLP tool for extracting colonoscopy quality indicators was developed using a data set of 2000 screening colonoscopy reports from a single health care system, with an associated 1425 pathology reports. The NLP system was then tested on a data set of 1000 colonoscopy reports and its performance was compared with that of 5 human annotators. Additionally, data from 54,562 colonoscopies performed between 2010 and 2019 were analyzed using the NLP pipeline. RESULTS: The NLP pipeline achieved an overall accuracy of 0.99-1.00 for identifying polyp subtypes, 0.99-1.00 for identifying the anatomical location of polyps, and 0.98 for counting the number of neoplastic polyps. The NLP pipeline achieved performance similar to clinical experts for assessing ADR, SDR, and surveillance intervals. NLP analysis of a 10-year colonoscopy data set identified great individual variance in colonoscopy quality indicators among 25 endoscopists. CONCLUSIONS: The NLP pipeline could accurately extract information from colonoscopy and pathology reports and demonstrated clinical efficacy for assessing ADR, SDR, and surveillance intervals in these reports. Implementation of the system enabled automated analysis and feedback on quality indicators, which could motivate endoscopists to improve the quality of their performance and improve clinical decision-making in colorectal cancer screening programs. JMIR Publications 2022-04-15 /pmc/articles/PMC9055472/ /pubmed/35436226 http://dx.doi.org/10.2196/35257 Text en ©Jung Ho Bae, Hyun Wook Han, Sun Young Yang, Gyuseon Song, Soonok Sa, Goh Eun Chung, Ji Yeon Seo, Eun Hyo Jin, Heecheon Kim, DongUk An. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 15.04.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Bae, Jung Ho Han, Hyun Wook Yang, Sun Young Song, Gyuseon Sa, Soonok Chung, Goh Eun Seo, Ji Yeon Jin, Eun Hyo Kim, Heecheon An, DongUk Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study |
title | Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study |
title_full | Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study |
title_fullStr | Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study |
title_full_unstemmed | Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study |
title_short | Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study |
title_sort | natural language processing for assessing quality indicators in free-text colonoscopy and pathology reports: development and usability study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9055472/ https://www.ncbi.nlm.nih.gov/pubmed/35436226 http://dx.doi.org/10.2196/35257 |
work_keys_str_mv | AT baejungho naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy AT hanhyunwook naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy AT yangsunyoung naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy AT songgyuseon naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy AT sasoonok naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy AT chunggoheun naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy AT seojiyeon naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy AT jineunhyo naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy AT kimheecheon naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy AT andonguk naturallanguageprocessingforassessingqualityindicatorsinfreetextcolonoscopyandpathologyreportsdevelopmentandusabilitystudy |