Cargando…

Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer

PURPOSE: To compare the accuracy and reliability of a natural language processing (NLP) algorithm with manual coding by radiologists, and the combination of the two methods, for the identification of patients whose computed tomography (CT) reports raised the concern for lung cancer. METHODS: An NLP...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wadia, Roxanne, Akgun, Kathleen, Brandt, Cynthia, Fenton, Brenda T., Levin, Woody, Marple, Andrew H., Garla, Vijay, Rose, Michal G., Taddei, Tamar, Taylor, Caroline
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	American Society of Clinical Oncology 2018
Materias:	Original Reports
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6873962/ https://www.ncbi.nlm.nih.gov/pubmed/30652545 http://dx.doi.org/10.1200/CCI.17.00069

_version_	1783472756876115968
author	Wadia, Roxanne Akgun, Kathleen Brandt, Cynthia Fenton, Brenda T. Levin, Woody Marple, Andrew H. Garla, Vijay Rose, Michal G. Taddei, Tamar Taylor, Caroline
author_facet	Wadia, Roxanne Akgun, Kathleen Brandt, Cynthia Fenton, Brenda T. Levin, Woody Marple, Andrew H. Garla, Vijay Rose, Michal G. Taddei, Tamar Taylor, Caroline
author_sort	Wadia, Roxanne
collection	PubMed
description	PURPOSE: To compare the accuracy and reliability of a natural language processing (NLP) algorithm with manual coding by radiologists, and the combination of the two methods, for the identification of patients whose computed tomography (CT) reports raised the concern for lung cancer. METHODS: An NLP algorithm was developed using Clinical Text Analysis and Knowledge Extraction System (cTAKES) with the Yale cTAKES Extensions and trained to differentiate between language indicating benign lesions and lesions concerning for lung cancer. A random sample of 450 chest CT reports performed at Veterans Affairs Connecticut Healthcare System between January 2014 and July 2015 was selected. A reference standard was created by the manual review of reports to determine if the text stated that follow-up was needed for concern for cancer. The NLP algorithm was applied to all reports and compared with case identification using the manual coding by the radiologists. RESULTS: A total of 450 reports representing 428 patients were analyzed. NLP had higher sensitivity and lower specificity than manual coding (77.3% v 51.5% and 72.5% v 82.5%, respectively). NLP and manual coding had similar positive predictive values (88.4% v 88.9%), and NLP had a higher negative predictive value than manual coding (54% v 38.5%). When NLP and manual coding were combined, sensitivity increased to 92.3%, with a decrease in specificity to 62.85%. Combined NLP and manual coding had a positive predictive value of 87.0% and a negative predictive value of 75.2%. CONCLUSION: Our NLP algorithm was more sensitive than manual coding of CT chest reports for the identification of patients who required follow-up for suspicion of lung cancer. The combination of NLP and manual coding is a sensitive way to identify patients who need further workup for lung cancer.
format	Online Article Text
id	pubmed-6873962
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	American Society of Clinical Oncology
record_format	MEDLINE/PubMed
spelling	pubmed-68739622019-12-03 Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer Wadia, Roxanne Akgun, Kathleen Brandt, Cynthia Fenton, Brenda T. Levin, Woody Marple, Andrew H. Garla, Vijay Rose, Michal G. Taddei, Tamar Taylor, Caroline JCO Clin Cancer Inform Original Reports PURPOSE: To compare the accuracy and reliability of a natural language processing (NLP) algorithm with manual coding by radiologists, and the combination of the two methods, for the identification of patients whose computed tomography (CT) reports raised the concern for lung cancer. METHODS: An NLP algorithm was developed using Clinical Text Analysis and Knowledge Extraction System (cTAKES) with the Yale cTAKES Extensions and trained to differentiate between language indicating benign lesions and lesions concerning for lung cancer. A random sample of 450 chest CT reports performed at Veterans Affairs Connecticut Healthcare System between January 2014 and July 2015 was selected. A reference standard was created by the manual review of reports to determine if the text stated that follow-up was needed for concern for cancer. The NLP algorithm was applied to all reports and compared with case identification using the manual coding by the radiologists. RESULTS: A total of 450 reports representing 428 patients were analyzed. NLP had higher sensitivity and lower specificity than manual coding (77.3% v 51.5% and 72.5% v 82.5%, respectively). NLP and manual coding had similar positive predictive values (88.4% v 88.9%), and NLP had a higher negative predictive value than manual coding (54% v 38.5%). When NLP and manual coding were combined, sensitivity increased to 92.3%, with a decrease in specificity to 62.85%. Combined NLP and manual coding had a positive predictive value of 87.0% and a negative predictive value of 75.2%. CONCLUSION: Our NLP algorithm was more sensitive than manual coding of CT chest reports for the identification of patients who required follow-up for suspicion of lung cancer. The combination of NLP and manual coding is a sensitive way to identify patients who need further workup for lung cancer. American Society of Clinical Oncology 2018-02-20 /pmc/articles/PMC6873962/ /pubmed/30652545 http://dx.doi.org/10.1200/CCI.17.00069 Text en © 2018 by American Society of Clinical Oncology https://creativecommons.org/licenses/by/4.0/ Licensed under the Creative Commons Attribution 4.0 License: https://creativecommons.org/licenses/by/4.0/
spellingShingle	Original Reports Wadia, Roxanne Akgun, Kathleen Brandt, Cynthia Fenton, Brenda T. Levin, Woody Marple, Andrew H. Garla, Vijay Rose, Michal G. Taddei, Tamar Taylor, Caroline Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer
title	Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer
title_full	Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer
title_fullStr	Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer
title_full_unstemmed	Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer
title_short	Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer
title_sort	comparison of natural language processing and manual coding for the identification of cross-sectional imaging reports suspicious for lung cancer
topic	Original Reports
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6873962/ https://www.ncbi.nlm.nih.gov/pubmed/30652545 http://dx.doi.org/10.1200/CCI.17.00069
work_keys_str_mv	AT wadiaroxanne comparisonofnaturallanguageprocessingandmanualcodingfortheidentificationofcrosssectionalimagingreportssuspiciousforlungcancer AT akgunkathleen comparisonofnaturallanguageprocessingandmanualcodingfortheidentificationofcrosssectionalimagingreportssuspiciousforlungcancer AT brandtcynthia comparisonofnaturallanguageprocessingandmanualcodingfortheidentificationofcrosssectionalimagingreportssuspiciousforlungcancer AT fentonbrendat comparisonofnaturallanguageprocessingandmanualcodingfortheidentificationofcrosssectionalimagingreportssuspiciousforlungcancer AT levinwoody comparisonofnaturallanguageprocessingandmanualcodingfortheidentificationofcrosssectionalimagingreportssuspiciousforlungcancer AT marpleandrewh comparisonofnaturallanguageprocessingandmanualcodingfortheidentificationofcrosssectionalimagingreportssuspiciousforlungcancer AT garlavijay comparisonofnaturallanguageprocessingandmanualcodingfortheidentificationofcrosssectionalimagingreportssuspiciousforlungcancer AT rosemichalg comparisonofnaturallanguageprocessingandmanualcodingfortheidentificationofcrosssectionalimagingreportssuspiciousforlungcancer AT taddeitamar comparisonofnaturallanguageprocessingandmanualcodingfortheidentificationofcrosssectionalimagingreportssuspiciousforlungcancer AT taylorcaroline comparisonofnaturallanguageprocessingandmanualcodingfortheidentificationofcrosssectionalimagingreportssuspiciousforlungcancer

Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer

Ejemplares similares