Cargando…

Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool

PURPOSE: To evaluate the diagnostic performance of a natural language processing (NLP) model in detecting incidental lung nodules (ILNs) in unstructured chest computed tomography (CT) reports. METHODS: All unstructured consecutive reports of chest CT scans performed at a tertiary hospital between 20...

Descripción completa

Detalles Bibliográficos
Autores principales: Basilio, Rodrigo, Carvalho, Alysson Roncally, Rodrigues, Rosana, Conrado, Marco, Accorsi, Sephania, Forghani, Reza, Machuca, Tiago, Zanon, Matheus, Altmayer, Stephan, Hochhegger, Bruno
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Wolters Kluwer Health 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10581645/
https://www.ncbi.nlm.nih.gov/pubmed/37769221
http://dx.doi.org/10.1200/GO.23.00191
_version_ 1785122180351131648
author Basilio, Rodrigo
Carvalho, Alysson Roncally
Rodrigues, Rosana
Conrado, Marco
Accorsi, Sephania
Forghani, Reza
Machuca, Tiago
Zanon, Matheus
Altmayer, Stephan
Hochhegger, Bruno
author_facet Basilio, Rodrigo
Carvalho, Alysson Roncally
Rodrigues, Rosana
Conrado, Marco
Accorsi, Sephania
Forghani, Reza
Machuca, Tiago
Zanon, Matheus
Altmayer, Stephan
Hochhegger, Bruno
author_sort Basilio, Rodrigo
collection PubMed
description PURPOSE: To evaluate the diagnostic performance of a natural language processing (NLP) model in detecting incidental lung nodules (ILNs) in unstructured chest computed tomography (CT) reports. METHODS: All unstructured consecutive reports of chest CT scans performed at a tertiary hospital between 2020 and 2021 were retrospectively reviewed (n = 21,542) to train the NLP tool. Internal validation was performed using reference readings by two radiologists of both CT scans and reports, using a different external cohort of 300 chest CT scans. Second, external validation was performed in a cohort of all random unstructured chest CT reports from 57 different hospitals conducted in May 2022. A review by the same thoracic radiologists was used as the gold standard. The sensitivity, specificity, and accuracy were calculated. RESULTS: Of 21,542 CT reports, 484 mentioned at least one ILN (mean age, 71 ± 17.6 [standard deviation] years; women, 52%) and were included in the training set. In the internal validation (n = 300), the NLP tool detected ILN with a sensitivity of 100.0% (95% CI, 97.6 to 100.0), a specificity of 95.9% (95% CI, 91.3 to 98.5), and an accuracy of 98.0% (95% CI, 95.7 to 99.3). In the external validation (n = 977), the NLP tool yielded a sensitivity of 98.4% (95% CI, 94.5 to 99.8), a specificity of 98.6% (95% CI, 97.5 to 99.3), and an accuracy of 98.6% (95% CI, 97.6 to 99.2). Twelve months after the initial reports, 8 (8.60%) patients had a final diagnosis of lung cancer, among which 2 (2.15%) would have been lost to follow-up without the NLP tool. CONCLUSION: NLP can be used to identify ILNs in unstructured reports with high accuracy, allowing a timely recall of patients and a potential diagnosis of early-stage lung cancer that might have been lost to follow-up.
format Online
Article
Text
id pubmed-10581645
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Wolters Kluwer Health
record_format MEDLINE/PubMed
spelling pubmed-105816452023-10-18 Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool Basilio, Rodrigo Carvalho, Alysson Roncally Rodrigues, Rosana Conrado, Marco Accorsi, Sephania Forghani, Reza Machuca, Tiago Zanon, Matheus Altmayer, Stephan Hochhegger, Bruno JCO Glob Oncol ORIGINAL REPORTS PURPOSE: To evaluate the diagnostic performance of a natural language processing (NLP) model in detecting incidental lung nodules (ILNs) in unstructured chest computed tomography (CT) reports. METHODS: All unstructured consecutive reports of chest CT scans performed at a tertiary hospital between 2020 and 2021 were retrospectively reviewed (n = 21,542) to train the NLP tool. Internal validation was performed using reference readings by two radiologists of both CT scans and reports, using a different external cohort of 300 chest CT scans. Second, external validation was performed in a cohort of all random unstructured chest CT reports from 57 different hospitals conducted in May 2022. A review by the same thoracic radiologists was used as the gold standard. The sensitivity, specificity, and accuracy were calculated. RESULTS: Of 21,542 CT reports, 484 mentioned at least one ILN (mean age, 71 ± 17.6 [standard deviation] years; women, 52%) and were included in the training set. In the internal validation (n = 300), the NLP tool detected ILN with a sensitivity of 100.0% (95% CI, 97.6 to 100.0), a specificity of 95.9% (95% CI, 91.3 to 98.5), and an accuracy of 98.0% (95% CI, 95.7 to 99.3). In the external validation (n = 977), the NLP tool yielded a sensitivity of 98.4% (95% CI, 94.5 to 99.8), a specificity of 98.6% (95% CI, 97.5 to 99.3), and an accuracy of 98.6% (95% CI, 97.6 to 99.2). Twelve months after the initial reports, 8 (8.60%) patients had a final diagnosis of lung cancer, among which 2 (2.15%) would have been lost to follow-up without the NLP tool. CONCLUSION: NLP can be used to identify ILNs in unstructured reports with high accuracy, allowing a timely recall of patients and a potential diagnosis of early-stage lung cancer that might have been lost to follow-up. Wolters Kluwer Health 2023-09-28 /pmc/articles/PMC10581645/ /pubmed/37769221 http://dx.doi.org/10.1200/GO.23.00191 Text en © 2023 by American Society of Clinical Oncology https://creativecommons.org/licenses/by-nc-nd/4.0/Creative Commons Attribution Non-Commercial No Derivatives 4.0 License: http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/)
spellingShingle ORIGINAL REPORTS
Basilio, Rodrigo
Carvalho, Alysson Roncally
Rodrigues, Rosana
Conrado, Marco
Accorsi, Sephania
Forghani, Reza
Machuca, Tiago
Zanon, Matheus
Altmayer, Stephan
Hochhegger, Bruno
Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool
title Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool
title_full Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool
title_fullStr Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool
title_full_unstemmed Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool
title_short Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool
title_sort natural language processing for the identification of incidental lung nodules in computed tomography reports: a quality control tool
topic ORIGINAL REPORTS
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10581645/
https://www.ncbi.nlm.nih.gov/pubmed/37769221
http://dx.doi.org/10.1200/GO.23.00191
work_keys_str_mv AT basiliorodrigo naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool
AT carvalhoalyssonroncally naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool
AT rodriguesrosana naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool
AT conradomarco naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool
AT accorsisephania naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool
AT forghanireza naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool
AT machucatiago naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool
AT zanonmatheus naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool
AT altmayerstephan naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool
AT hochheggerbruno naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool