Cargando…
A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study
BACKGROUND: Health research frequently requires manual chart reviews to identify patients in a study-specific cohort and examine their clinical outcomes. Manual chart review is a labor-intensive process that requires significant time investment for clinical researchers. OBJECTIVE: This study aims to...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8663661/ https://www.ncbi.nlm.nih.gov/pubmed/34766919 http://dx.doi.org/10.2196/29241 |
_version_ | 1784613690284179456 |
---|---|
author | McKenzie, Jordan Rajapakshe, Rasika Shen, Hua Rajapakshe, Shan Lin, Angela |
author_facet | McKenzie, Jordan Rajapakshe, Rasika Shen, Hua Rajapakshe, Shan Lin, Angela |
author_sort | McKenzie, Jordan |
collection | PubMed |
description | BACKGROUND: Health research frequently requires manual chart reviews to identify patients in a study-specific cohort and examine their clinical outcomes. Manual chart review is a labor-intensive process that requires significant time investment for clinical researchers. OBJECTIVE: This study aims to evaluate the feasibility and accuracy of an assisted chart review program, using an in-house rule-based text-extraction program written in Python, to identify patients who developed radiation pneumonitis (RP) after receiving curative radiotherapy. METHODS: A retrospective manual chart review was completed for patients who received curative radiotherapy for stage 2-3 lung cancer from January 1, 2013 to December 31, 2015, at British Columbia Cancer, Kelowna Centre. In the manual chart review, RP diagnosis and grading were recorded using the Common Terminology Criteria for Adverse Events version 5.0. From the charts of 50 sample patients, a total of 1413 clinical documents were obtained for review from the electronic medical record system. The text-extraction program was built using the Natural Language Toolkit Python platform (and regular expressions, also known as RegEx). Python version 3.7.2 was used to run the text-extraction program. The output of the text-extraction program was a list of the full sentences containing the key terms, document IDs, and dates from which these sentences were extracted. The results from the manual review were used as the gold standard in this study, with which the results of the text-extraction program were compared. RESULTS: Fifty percent (25/50) of the sample patients developed grade ≥1 RP; the natural language processing program was able to ascertain 92% (23/25) of these patients (sensitivity 0.92, 95% CI 0.74-0.99; specificity 0.36, 95% CI 0.18-0.57). Furthermore, the text-extraction program was able to correctly identify all 9 patients with grade ≥2 RP, which are patients with clinically significant symptoms (sensitivity 1.0, 95% CI 0.66-1.0; specificity 0.27, 95% CI 0.14-0.43). The program was useful for distinguishing patients with RP from those without RP. The text-extraction program in this study avoided unnecessary manual review of 22% (11/50) of the sample patients, as these patients were identified as grade 0 RP and would not require further manual review in subsequent studies. CONCLUSIONS: This feasibility study showed that the text-extraction program was able to assist with the identification of patients who developed RP after curative radiotherapy. The program streamlines the manual chart review further by identifying the key sentences of interest. This work has the potential to improve future clinical research, as the text-extraction program shows promise in performing chart review in a more time-efficient manner, compared with the traditional labor-intensive manual chart review. |
format | Online Article Text |
id | pubmed-8663661 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-86636612021-12-30 A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study McKenzie, Jordan Rajapakshe, Rasika Shen, Hua Rajapakshe, Shan Lin, Angela JMIR Med Inform Original Paper BACKGROUND: Health research frequently requires manual chart reviews to identify patients in a study-specific cohort and examine their clinical outcomes. Manual chart review is a labor-intensive process that requires significant time investment for clinical researchers. OBJECTIVE: This study aims to evaluate the feasibility and accuracy of an assisted chart review program, using an in-house rule-based text-extraction program written in Python, to identify patients who developed radiation pneumonitis (RP) after receiving curative radiotherapy. METHODS: A retrospective manual chart review was completed for patients who received curative radiotherapy for stage 2-3 lung cancer from January 1, 2013 to December 31, 2015, at British Columbia Cancer, Kelowna Centre. In the manual chart review, RP diagnosis and grading were recorded using the Common Terminology Criteria for Adverse Events version 5.0. From the charts of 50 sample patients, a total of 1413 clinical documents were obtained for review from the electronic medical record system. The text-extraction program was built using the Natural Language Toolkit Python platform (and regular expressions, also known as RegEx). Python version 3.7.2 was used to run the text-extraction program. The output of the text-extraction program was a list of the full sentences containing the key terms, document IDs, and dates from which these sentences were extracted. The results from the manual review were used as the gold standard in this study, with which the results of the text-extraction program were compared. RESULTS: Fifty percent (25/50) of the sample patients developed grade ≥1 RP; the natural language processing program was able to ascertain 92% (23/25) of these patients (sensitivity 0.92, 95% CI 0.74-0.99; specificity 0.36, 95% CI 0.18-0.57). Furthermore, the text-extraction program was able to correctly identify all 9 patients with grade ≥2 RP, which are patients with clinically significant symptoms (sensitivity 1.0, 95% CI 0.66-1.0; specificity 0.27, 95% CI 0.14-0.43). The program was useful for distinguishing patients with RP from those without RP. The text-extraction program in this study avoided unnecessary manual review of 22% (11/50) of the sample patients, as these patients were identified as grade 0 RP and would not require further manual review in subsequent studies. CONCLUSIONS: This feasibility study showed that the text-extraction program was able to assist with the identification of patients who developed RP after curative radiotherapy. The program streamlines the manual chart review further by identifying the key sentences of interest. This work has the potential to improve future clinical research, as the text-extraction program shows promise in performing chart review in a more time-efficient manner, compared with the traditional labor-intensive manual chart review. JMIR Publications 2021-11-12 /pmc/articles/PMC8663661/ /pubmed/34766919 http://dx.doi.org/10.2196/29241 Text en ©Jordan McKenzie, Rasika Rajapakshe, Hua Shen, Shan Rajapakshe, Angela Lin. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 12.11.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper McKenzie, Jordan Rajapakshe, Rasika Shen, Hua Rajapakshe, Shan Lin, Angela A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study |
title | A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study |
title_full | A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study |
title_fullStr | A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study |
title_full_unstemmed | A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study |
title_short | A Semiautomated Chart Review for Assessing the Development of Radiation Pneumonitis Using Natural Language Processing: Diagnostic Accuracy and Feasibility Study |
title_sort | semiautomated chart review for assessing the development of radiation pneumonitis using natural language processing: diagnostic accuracy and feasibility study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8663661/ https://www.ncbi.nlm.nih.gov/pubmed/34766919 http://dx.doi.org/10.2196/29241 |
work_keys_str_mv | AT mckenziejordan asemiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy AT rajapaksherasika asemiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy AT shenhua asemiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy AT rajapaksheshan asemiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy AT linangela asemiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy AT mckenziejordan semiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy AT rajapaksherasika semiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy AT shenhua semiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy AT rajapaksheshan semiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy AT linangela semiautomatedchartreviewforassessingthedevelopmentofradiationpneumonitisusingnaturallanguageprocessingdiagnosticaccuracyandfeasibilitystudy |