Cargando…

How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation

BACKGROUND: Natural language processing (NLP) is thought to be a promising solution to extract and store concepts from free text in a structured manner for data mining purposes. This is also true for radiology reports, which still consist mostly of free text. Accurate and complete reports are very i...

Descripción completa

Detalles Bibliográficos
Autores principales: Puts, Sander, Nobel, Martijn, Zegers, Catharina, Bermejo, Iñigo, Robben, Simon, Dekker, Andre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10131747/
https://www.ncbi.nlm.nih.gov/pubmed/36947118
http://dx.doi.org/10.2196/38125
_version_ 1785031244178784256
author Puts, Sander
Nobel, Martijn
Zegers, Catharina
Bermejo, Iñigo
Robben, Simon
Dekker, Andre
author_facet Puts, Sander
Nobel, Martijn
Zegers, Catharina
Bermejo, Iñigo
Robben, Simon
Dekker, Andre
author_sort Puts, Sander
collection PubMed
description BACKGROUND: Natural language processing (NLP) is thought to be a promising solution to extract and store concepts from free text in a structured manner for data mining purposes. This is also true for radiology reports, which still consist mostly of free text. Accurate and complete reports are very important for clinical decision support, for instance, in oncological staging. As such, NLP can be a tool to structure the content of the radiology report, thereby increasing the report’s value. OBJECTIVE: This study describes the implementation and validation of an N-stage classifier for pulmonary oncology. It is based on free-text radiological chest computed tomography reports according to the tumor, node, and metastasis (TNM) classification, which has been added to the already existing T-stage classifier to create a combined TN-stage classifier. METHODS: SpaCy, PyContextNLP, and regular expressions were used for proper information extraction, after additional rules were set to accurately extract N-stage. RESULTS: The overall TN-stage classifier accuracy scores were 0.84 and 0.85, respectively, for the training (N=95) and validation (N=97) sets. This is comparable to the outcomes of the T-stage classifier (0.87-0.92). CONCLUSIONS: This study shows that NLP has potential in classifying pulmonary oncology from free-text radiological reports according to the TNM classification system as both the T- and N-stages can be extracted with high accuracy.
format Online
Article
Text
id pubmed-10131747
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-101317472023-04-27 How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation Puts, Sander Nobel, Martijn Zegers, Catharina Bermejo, Iñigo Robben, Simon Dekker, Andre JMIR Form Res Original Paper BACKGROUND: Natural language processing (NLP) is thought to be a promising solution to extract and store concepts from free text in a structured manner for data mining purposes. This is also true for radiology reports, which still consist mostly of free text. Accurate and complete reports are very important for clinical decision support, for instance, in oncological staging. As such, NLP can be a tool to structure the content of the radiology report, thereby increasing the report’s value. OBJECTIVE: This study describes the implementation and validation of an N-stage classifier for pulmonary oncology. It is based on free-text radiological chest computed tomography reports according to the tumor, node, and metastasis (TNM) classification, which has been added to the already existing T-stage classifier to create a combined TN-stage classifier. METHODS: SpaCy, PyContextNLP, and regular expressions were used for proper information extraction, after additional rules were set to accurately extract N-stage. RESULTS: The overall TN-stage classifier accuracy scores were 0.84 and 0.85, respectively, for the training (N=95) and validation (N=97) sets. This is comparable to the outcomes of the T-stage classifier (0.87-0.92). CONCLUSIONS: This study shows that NLP has potential in classifying pulmonary oncology from free-text radiological reports according to the TNM classification system as both the T- and N-stages can be extracted with high accuracy. JMIR Publications 2023-03-22 /pmc/articles/PMC10131747/ /pubmed/36947118 http://dx.doi.org/10.2196/38125 Text en ©Sander Puts, Martijn Nobel, Catharina Zegers, Iñigo Bermejo, Simon Robben, Andre Dekker. Originally published in JMIR Formative Research (https://formative.jmir.org), 22.03.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Puts, Sander
Nobel, Martijn
Zegers, Catharina
Bermejo, Iñigo
Robben, Simon
Dekker, Andre
How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation
title How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation
title_full How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation
title_fullStr How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation
title_full_unstemmed How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation
title_short How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation
title_sort how natural language processing can aid with pulmonary oncology tumor node metastasis staging from free-text radiology reports: algorithm development and validation
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10131747/
https://www.ncbi.nlm.nih.gov/pubmed/36947118
http://dx.doi.org/10.2196/38125
work_keys_str_mv AT putssander hownaturallanguageprocessingcanaidwithpulmonaryoncologytumornodemetastasisstagingfromfreetextradiologyreportsalgorithmdevelopmentandvalidation
AT nobelmartijn hownaturallanguageprocessingcanaidwithpulmonaryoncologytumornodemetastasisstagingfromfreetextradiologyreportsalgorithmdevelopmentandvalidation
AT zegerscatharina hownaturallanguageprocessingcanaidwithpulmonaryoncologytumornodemetastasisstagingfromfreetextradiologyreportsalgorithmdevelopmentandvalidation
AT bermejoinigo hownaturallanguageprocessingcanaidwithpulmonaryoncologytumornodemetastasisstagingfromfreetextradiologyreportsalgorithmdevelopmentandvalidation
AT robbensimon hownaturallanguageprocessingcanaidwithpulmonaryoncologytumornodemetastasisstagingfromfreetextradiologyreportsalgorithmdevelopmentandvalidation
AT dekkerandre hownaturallanguageprocessingcanaidwithpulmonaryoncologytumornodemetastasisstagingfromfreetextradiologyreportsalgorithmdevelopmentandvalidation