Cargando…
Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool
PURPOSE: To evaluate the diagnostic performance of a natural language processing (NLP) model in detecting incidental lung nodules (ILNs) in unstructured chest computed tomography (CT) reports. METHODS: All unstructured consecutive reports of chest CT scans performed at a tertiary hospital between 20...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Wolters Kluwer Health
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10581645/ https://www.ncbi.nlm.nih.gov/pubmed/37769221 http://dx.doi.org/10.1200/GO.23.00191 |
_version_ | 1785122180351131648 |
---|---|
author | Basilio, Rodrigo Carvalho, Alysson Roncally Rodrigues, Rosana Conrado, Marco Accorsi, Sephania Forghani, Reza Machuca, Tiago Zanon, Matheus Altmayer, Stephan Hochhegger, Bruno |
author_facet | Basilio, Rodrigo Carvalho, Alysson Roncally Rodrigues, Rosana Conrado, Marco Accorsi, Sephania Forghani, Reza Machuca, Tiago Zanon, Matheus Altmayer, Stephan Hochhegger, Bruno |
author_sort | Basilio, Rodrigo |
collection | PubMed |
description | PURPOSE: To evaluate the diagnostic performance of a natural language processing (NLP) model in detecting incidental lung nodules (ILNs) in unstructured chest computed tomography (CT) reports. METHODS: All unstructured consecutive reports of chest CT scans performed at a tertiary hospital between 2020 and 2021 were retrospectively reviewed (n = 21,542) to train the NLP tool. Internal validation was performed using reference readings by two radiologists of both CT scans and reports, using a different external cohort of 300 chest CT scans. Second, external validation was performed in a cohort of all random unstructured chest CT reports from 57 different hospitals conducted in May 2022. A review by the same thoracic radiologists was used as the gold standard. The sensitivity, specificity, and accuracy were calculated. RESULTS: Of 21,542 CT reports, 484 mentioned at least one ILN (mean age, 71 ± 17.6 [standard deviation] years; women, 52%) and were included in the training set. In the internal validation (n = 300), the NLP tool detected ILN with a sensitivity of 100.0% (95% CI, 97.6 to 100.0), a specificity of 95.9% (95% CI, 91.3 to 98.5), and an accuracy of 98.0% (95% CI, 95.7 to 99.3). In the external validation (n = 977), the NLP tool yielded a sensitivity of 98.4% (95% CI, 94.5 to 99.8), a specificity of 98.6% (95% CI, 97.5 to 99.3), and an accuracy of 98.6% (95% CI, 97.6 to 99.2). Twelve months after the initial reports, 8 (8.60%) patients had a final diagnosis of lung cancer, among which 2 (2.15%) would have been lost to follow-up without the NLP tool. CONCLUSION: NLP can be used to identify ILNs in unstructured reports with high accuracy, allowing a timely recall of patients and a potential diagnosis of early-stage lung cancer that might have been lost to follow-up. |
format | Online Article Text |
id | pubmed-10581645 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Wolters Kluwer Health |
record_format | MEDLINE/PubMed |
spelling | pubmed-105816452023-10-18 Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool Basilio, Rodrigo Carvalho, Alysson Roncally Rodrigues, Rosana Conrado, Marco Accorsi, Sephania Forghani, Reza Machuca, Tiago Zanon, Matheus Altmayer, Stephan Hochhegger, Bruno JCO Glob Oncol ORIGINAL REPORTS PURPOSE: To evaluate the diagnostic performance of a natural language processing (NLP) model in detecting incidental lung nodules (ILNs) in unstructured chest computed tomography (CT) reports. METHODS: All unstructured consecutive reports of chest CT scans performed at a tertiary hospital between 2020 and 2021 were retrospectively reviewed (n = 21,542) to train the NLP tool. Internal validation was performed using reference readings by two radiologists of both CT scans and reports, using a different external cohort of 300 chest CT scans. Second, external validation was performed in a cohort of all random unstructured chest CT reports from 57 different hospitals conducted in May 2022. A review by the same thoracic radiologists was used as the gold standard. The sensitivity, specificity, and accuracy were calculated. RESULTS: Of 21,542 CT reports, 484 mentioned at least one ILN (mean age, 71 ± 17.6 [standard deviation] years; women, 52%) and were included in the training set. In the internal validation (n = 300), the NLP tool detected ILN with a sensitivity of 100.0% (95% CI, 97.6 to 100.0), a specificity of 95.9% (95% CI, 91.3 to 98.5), and an accuracy of 98.0% (95% CI, 95.7 to 99.3). In the external validation (n = 977), the NLP tool yielded a sensitivity of 98.4% (95% CI, 94.5 to 99.8), a specificity of 98.6% (95% CI, 97.5 to 99.3), and an accuracy of 98.6% (95% CI, 97.6 to 99.2). Twelve months after the initial reports, 8 (8.60%) patients had a final diagnosis of lung cancer, among which 2 (2.15%) would have been lost to follow-up without the NLP tool. CONCLUSION: NLP can be used to identify ILNs in unstructured reports with high accuracy, allowing a timely recall of patients and a potential diagnosis of early-stage lung cancer that might have been lost to follow-up. Wolters Kluwer Health 2023-09-28 /pmc/articles/PMC10581645/ /pubmed/37769221 http://dx.doi.org/10.1200/GO.23.00191 Text en © 2023 by American Society of Clinical Oncology https://creativecommons.org/licenses/by-nc-nd/4.0/Creative Commons Attribution Non-Commercial No Derivatives 4.0 License: http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) |
spellingShingle | ORIGINAL REPORTS Basilio, Rodrigo Carvalho, Alysson Roncally Rodrigues, Rosana Conrado, Marco Accorsi, Sephania Forghani, Reza Machuca, Tiago Zanon, Matheus Altmayer, Stephan Hochhegger, Bruno Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool |
title | Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool |
title_full | Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool |
title_fullStr | Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool |
title_full_unstemmed | Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool |
title_short | Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool |
title_sort | natural language processing for the identification of incidental lung nodules in computed tomography reports: a quality control tool |
topic | ORIGINAL REPORTS |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10581645/ https://www.ncbi.nlm.nih.gov/pubmed/37769221 http://dx.doi.org/10.1200/GO.23.00191 |
work_keys_str_mv | AT basiliorodrigo naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool AT carvalhoalyssonroncally naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool AT rodriguesrosana naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool AT conradomarco naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool AT accorsisephania naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool AT forghanireza naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool AT machucatiago naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool AT zanonmatheus naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool AT altmayerstephan naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool AT hochheggerbruno naturallanguageprocessingfortheidentificationofincidentallungnodulesincomputedtomographyreportsaqualitycontroltool |