Cargando…

Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon

SIMPLE SUMMARY: Lack of standardization among radiologists in writing radiological reports impacts the ability to interpret cancer response to treatment at a large-scale. This is an issue since large-scale data collection is necessary to generate Real World Evidence (RWE) towards understanding the e...

Descripción completa

Detalles Bibliográficos
Autores principales: Elbatarny, Lydia, Do, Richard K. G., Gangai, Natalie, Ahmed, Firas, Chhabra, Shalini, Simpson, Amber L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10605614/
https://www.ncbi.nlm.nih.gov/pubmed/37894276
http://dx.doi.org/10.3390/cancers15204909
_version_ 1785127116145164288
author Elbatarny, Lydia
Do, Richard K. G.
Gangai, Natalie
Ahmed, Firas
Chhabra, Shalini
Simpson, Amber L.
author_facet Elbatarny, Lydia
Do, Richard K. G.
Gangai, Natalie
Ahmed, Firas
Chhabra, Shalini
Simpson, Amber L.
author_sort Elbatarny, Lydia
collection PubMed
description SIMPLE SUMMARY: Lack of standardization among radiologists in writing radiological reports impacts the ability to interpret cancer response to treatment at a large-scale. This is an issue since large-scale data collection is necessary to generate Real World Evidence (RWE) towards understanding the effectiveness of cancer treatments and developing personalized patient treatment decisions. This study aims to examine the utility of applying natural language processing (NLP) for large-scale interpretation of disease response using the standardized oncologic response categories known as the OR-RADS to facilitate RWE collection. This study demonstrates the feasibility of applying NLP to predict disease response in cancer patients, exceeding human performance, thus encouraging use of the standardized OR-RADS categories among radiologists and researchers to improve large-scale response prediction accuracy. ABSTRACT: Generating Real World Evidence (RWE) on disease responses from radiological reports is important for understanding cancer treatment effectiveness and developing personalized treatment. A lack of standardization in reporting among radiologists impacts the feasibility of large-scale interpretation of disease response. This study examines the utility of applying natural language processing (NLP) to the large-scale interpretation of disease responses using a standardized oncologic response lexicon (OR-RADS) to facilitate RWE collection. Radiologists annotated 3503 retrospectively collected clinical impressions from radiological reports across several cancer types with one of seven OR-RADS categories. A Bidirectional Encoder Representations from Transformers (BERT) model was trained on this dataset with an 80–20% train/test split to perform multiclass and single-class classification tasks using the OR-RADS. Radiologists also performed the classification to compare human and model performance. The model achieved accuracies from 95 to 99% across all classification tasks, performing better in single-class tasks compared to the multiclass task and producing minimal misclassifications, which pertained mostly to overpredicting the equivocal and mixed OR-RADS labels. Human accuracy ranged from 74 to 93% across all classification tasks, performing better on single-class tasks. This study demonstrates the feasibility of the BERT NLP model in predicting disease response in cancer patients, exceeding human performance, and encourages the use of the standardized OR-RADS lexicon to improve large-scale prediction accuracy.
format Online
Article
Text
id pubmed-10605614
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106056142023-10-28 Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon Elbatarny, Lydia Do, Richard K. G. Gangai, Natalie Ahmed, Firas Chhabra, Shalini Simpson, Amber L. Cancers (Basel) Article SIMPLE SUMMARY: Lack of standardization among radiologists in writing radiological reports impacts the ability to interpret cancer response to treatment at a large-scale. This is an issue since large-scale data collection is necessary to generate Real World Evidence (RWE) towards understanding the effectiveness of cancer treatments and developing personalized patient treatment decisions. This study aims to examine the utility of applying natural language processing (NLP) for large-scale interpretation of disease response using the standardized oncologic response categories known as the OR-RADS to facilitate RWE collection. This study demonstrates the feasibility of applying NLP to predict disease response in cancer patients, exceeding human performance, thus encouraging use of the standardized OR-RADS categories among radiologists and researchers to improve large-scale response prediction accuracy. ABSTRACT: Generating Real World Evidence (RWE) on disease responses from radiological reports is important for understanding cancer treatment effectiveness and developing personalized treatment. A lack of standardization in reporting among radiologists impacts the feasibility of large-scale interpretation of disease response. This study examines the utility of applying natural language processing (NLP) to the large-scale interpretation of disease responses using a standardized oncologic response lexicon (OR-RADS) to facilitate RWE collection. Radiologists annotated 3503 retrospectively collected clinical impressions from radiological reports across several cancer types with one of seven OR-RADS categories. A Bidirectional Encoder Representations from Transformers (BERT) model was trained on this dataset with an 80–20% train/test split to perform multiclass and single-class classification tasks using the OR-RADS. Radiologists also performed the classification to compare human and model performance. The model achieved accuracies from 95 to 99% across all classification tasks, performing better in single-class tasks compared to the multiclass task and producing minimal misclassifications, which pertained mostly to overpredicting the equivocal and mixed OR-RADS labels. Human accuracy ranged from 74 to 93% across all classification tasks, performing better on single-class tasks. This study demonstrates the feasibility of the BERT NLP model in predicting disease response in cancer patients, exceeding human performance, and encourages the use of the standardized OR-RADS lexicon to improve large-scale prediction accuracy. MDPI 2023-10-10 /pmc/articles/PMC10605614/ /pubmed/37894276 http://dx.doi.org/10.3390/cancers15204909 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Elbatarny, Lydia
Do, Richard K. G.
Gangai, Natalie
Ahmed, Firas
Chhabra, Shalini
Simpson, Amber L.
Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon
title Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon
title_full Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon
title_fullStr Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon
title_full_unstemmed Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon
title_short Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon
title_sort applying natural language processing to single-report prediction of metastatic disease response using the or-rads lexicon
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10605614/
https://www.ncbi.nlm.nih.gov/pubmed/37894276
http://dx.doi.org/10.3390/cancers15204909
work_keys_str_mv AT elbatarnylydia applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon
AT dorichardkg applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon
AT gangainatalie applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon
AT ahmedfiras applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon
AT chhabrashalini applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon
AT simpsonamberl applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon