Cargando…

A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study

BACKGROUND: Health research frequently requires manual chart reviews to identify patients in a study-specific cohort and examine their clinical outcomes. Manual chart review is a labor-intensive process that requires significant time investment for clinical researchers. OBJECTIVE: This study aims to...

Descripción completa

Detalles Bibliográficos
Autores principales: McKenzie, Jordan, Rajapakshe, Rasika, Shen, Hua, Rajapakshe, Shan, Lin, Angela
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8663661/
https://www.ncbi.nlm.nih.gov/pubmed/34766919
http://dx.doi.org/10.2196/29241
_version_ 1784613690284179456
author McKenzie, Jordan
Rajapakshe, Rasika
Shen, Hua
Rajapakshe, Shan
Lin, Angela
author_facet McKenzie, Jordan
Rajapakshe, Rasika
Shen, Hua
Rajapakshe, Shan
Lin, Angela
author_sort McKenzie, Jordan
collection PubMed
description BACKGROUND: Health research frequently requires manual chart reviews to identify patients in a study-specific cohort and examine their clinical outcomes. Manual chart review is a labor-intensive process that requires significant time investment for clinical researchers. OBJECTIVE: This study aims to evaluate the feasibility and accuracy of an assisted chart review program, using an in-house rule-based text-extraction program written in Python, to identify patients who developed radiation pneumonitis (RP) after receiving curative radiotherapy. METHODS: A retrospective manual chart review was completed for patients who received curative radiotherapy for stage 2-3 lung cancer from January 1, 2013 to December 31, 2015, at British Columbia Cancer, Kelowna Centre. In the manual chart review, RP diagnosis and grading were recorded using the Common Terminology Criteria for Adverse Events version 5.0. From the charts of 50 sample patients, a total of 1413 clinical documents were obtained for review from the electronic medical record system. The text-extraction program was built using the Natural Language Toolkit Python platform (and regular expressions, also known as RegEx). Python version 3.7.2 was used to run the text-extraction program. The output of the text-extraction program was a list of the full sentences containing the key terms, document IDs, and dates from which these sentences were extracted. The results from the manual review were used as the gold standard in this study, with which the results of the text-extraction program were compared. RESULTS: Fifty percent (25/50) of the sample patients developed grade ≥1 RP; the natural language processing program was able to ascertain 92% (23/25) of these patients (sensitivity 0.92, 95% CI 0.74-0.99; specificity 0.36, 95% CI 0.18-0.57). Furthermore, the text-extraction program was able to correctly identify all 9 patients with grade ≥2 RP, which are patients with clinically significant symptoms (sensitivity 1.0, 95% CI 0.66-1.0; specificity 0.27, 95% CI 0.14-0.43). The program was useful for distinguishing patients with RP from those without RP. The text-extraction program in this study avoided unnecessary manual review of 22% (11/50) of the sample patients, as these patients were identified as grade 0 RP and would not require further manual review in subsequent studies. CONCLUSIONS: This feasibility study showed that the text-extraction program was able to assist with the identification of patients who developed RP after curative radiotherapy. The program streamlines the manual chart review further by identifying the key sentences of interest. This work has the potential to improve future clinical research, as the text-extraction program shows promise in performing chart review in a more time-efficient manner, compared with the traditional labor-intensive manual chart review.
format Online
Article
Text
id pubmed-8663661
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-86636612021-12-30 A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study McKenzie, Jordan Rajapakshe, Rasika Shen, Hua Rajapakshe, Shan Lin, Angela JMIR Med Inform Original Paper BACKGROUND: Health research frequently requires manual chart reviews to identify patients in a study-specific cohort and examine their clinical outcomes. Manual chart review is a labor-intensive process that requires significant time investment for clinical researchers. OBJECTIVE: This study aims to evaluate the feasibility and accuracy of an assisted chart review program, using an in-house rule-based text-extraction program written in Python, to identify patients who developed radiation pneumonitis (RP) after receiving curative radiotherapy. METHODS: A retrospective manual chart review was completed for patients who received curative radiotherapy for stage 2-3 lung cancer from January 1, 2013 to December 31, 2015, at British Columbia Cancer, Kelowna Centre. In the manual chart review, RP diagnosis and grading were recorded using the Common Terminology Criteria for Adverse Events version 5.0. From the charts of 50 sample patients, a total of 1413 clinical documents were obtained for review from the electronic medical record system. The text-extraction program was built using the Natural Language Toolkit Python platform (and regular expressions, also known as RegEx). Python version 3.7.2 was used to run the text-extraction program. The output of the text-extraction program was a list of the full sentences containing the key terms, document IDs, and dates from which these sentences were extracted. The results from the manual review were used as the gold standard in this study, with which the results of the text-extraction program were compared. RESULTS: Fifty percent (25/50) of the sample patients developed grade ≥1 RP; the natural language processing program was able to ascertain 92% (23/25) of these patients (sensitivity 0.92, 95% CI 0.74-0.99; specificity 0.36, 95% CI 0.18-0.57). Furthermore, the text-extraction program was able to correctly identify all 9 patients with grade ≥2 RP, which are patients with clinically significant symptoms (sensitivity 1.0, 95% CI 0.66-1.0; specificity 0.27, 95% CI 0.14-0.43). The program was useful for distinguishing patients with RP from those without RP. The text-extraction program in this study avoided unnecessary manual review of 22% (11/50) of the sample patients, as these patients were identified as grade 0 RP and would not require further manual review in subsequent studies. CONCLUSIONS: This feasibility study showed that the text-extraction program was able to assist with the identification of patients who developed RP after curative radiotherapy. The program streamlines the manual chart review further by identifying the key sentences of interest. This work has the potential to improve future clinical research, as the text-extraction program shows promise in performing chart review in a more time-efficient manner, compared with the traditional labor-intensive manual chart review. JMIR Publications 2021-11-12 /pmc/articles/PMC8663661/ /pubmed/34766919 http://dx.doi.org/10.2196/29241 Text en ©Jordan McKenzie, Rasika Rajapakshe, Hua Shen, Shan Rajapakshe, Angela Lin. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 12.11.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
McKenzie, Jordan
Rajapakshe, Rasika
Shen, Hua
Rajapakshe, Shan
Lin, Angela
A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study
title A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study
title_full A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study
title_fullStr A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study
title_full_unstemmed A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study
title_short A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study
title_sort semiautomated chart review for assessing the development of radiation pneumonitis using natural language processing: diagnostic accuracy and feasibility study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8663661/
https://www.ncbi.nlm.nih.gov/pubmed/34766919
http://dx.doi.org/10.2196/29241
work_keys_str_mv AT mckenziejordan asemiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy
AT rajapaksherasika asemiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy
AT shenhua asemiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy
AT rajapaksheshan asemiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy
AT linangela asemiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy
AT mckenziejordan semiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy
AT rajapaksherasika semiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy
AT shenhua semiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy
AT rajapaksheshan semiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy
AT linangela semiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy