Cargando…
Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon
SIMPLE SUMMARY: Lack of standardization among radiologists in writing radiological reports impacts the ability to interpret cancer response to treatment at a large-scale. This is an issue since large-scale data collection is necessary to generate Real World Evidence (RWE) towards understanding the e...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10605614/ https://www.ncbi.nlm.nih.gov/pubmed/37894276 http://dx.doi.org/10.3390/cancers15204909 |
_version_ | 1785127116145164288 |
---|---|
author | Elbatarny, Lydia Do, Richard K. G. Gangai, Natalie Ahmed, Firas Chhabra, Shalini Simpson, Amber L. |
author_facet | Elbatarny, Lydia Do, Richard K. G. Gangai, Natalie Ahmed, Firas Chhabra, Shalini Simpson, Amber L. |
author_sort | Elbatarny, Lydia |
collection | PubMed |
description | SIMPLE SUMMARY: Lack of standardization among radiologists in writing radiological reports impacts the ability to interpret cancer response to treatment at a large-scale. This is an issue since large-scale data collection is necessary to generate Real World Evidence (RWE) towards understanding the effectiveness of cancer treatments and developing personalized patient treatment decisions. This study aims to examine the utility of applying natural language processing (NLP) for large-scale interpretation of disease response using the standardized oncologic response categories known as the OR-RADS to facilitate RWE collection. This study demonstrates the feasibility of applying NLP to predict disease response in cancer patients, exceeding human performance, thus encouraging use of the standardized OR-RADS categories among radiologists and researchers to improve large-scale response prediction accuracy. ABSTRACT: Generating Real World Evidence (RWE) on disease responses from radiological reports is important for understanding cancer treatment effectiveness and developing personalized treatment. A lack of standardization in reporting among radiologists impacts the feasibility of large-scale interpretation of disease response. This study examines the utility of applying natural language processing (NLP) to the large-scale interpretation of disease responses using a standardized oncologic response lexicon (OR-RADS) to facilitate RWE collection. Radiologists annotated 3503 retrospectively collected clinical impressions from radiological reports across several cancer types with one of seven OR-RADS categories. A Bidirectional Encoder Representations from Transformers (BERT) model was trained on this dataset with an 80–20% train/test split to perform multiclass and single-class classification tasks using the OR-RADS. Radiologists also performed the classification to compare human and model performance. The model achieved accuracies from 95 to 99% across all classification tasks, performing better in single-class tasks compared to the multiclass task and producing minimal misclassifications, which pertained mostly to overpredicting the equivocal and mixed OR-RADS labels. Human accuracy ranged from 74 to 93% across all classification tasks, performing better on single-class tasks. This study demonstrates the feasibility of the BERT NLP model in predicting disease response in cancer patients, exceeding human performance, and encourages the use of the standardized OR-RADS lexicon to improve large-scale prediction accuracy. |
format | Online Article Text |
id | pubmed-10605614 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-106056142023-10-28 Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon Elbatarny, Lydia Do, Richard K. G. Gangai, Natalie Ahmed, Firas Chhabra, Shalini Simpson, Amber L. Cancers (Basel) Article SIMPLE SUMMARY: Lack of standardization among radiologists in writing radiological reports impacts the ability to interpret cancer response to treatment at a large-scale. This is an issue since large-scale data collection is necessary to generate Real World Evidence (RWE) towards understanding the effectiveness of cancer treatments and developing personalized patient treatment decisions. This study aims to examine the utility of applying natural language processing (NLP) for large-scale interpretation of disease response using the standardized oncologic response categories known as the OR-RADS to facilitate RWE collection. This study demonstrates the feasibility of applying NLP to predict disease response in cancer patients, exceeding human performance, thus encouraging use of the standardized OR-RADS categories among radiologists and researchers to improve large-scale response prediction accuracy. ABSTRACT: Generating Real World Evidence (RWE) on disease responses from radiological reports is important for understanding cancer treatment effectiveness and developing personalized treatment. A lack of standardization in reporting among radiologists impacts the feasibility of large-scale interpretation of disease response. This study examines the utility of applying natural language processing (NLP) to the large-scale interpretation of disease responses using a standardized oncologic response lexicon (OR-RADS) to facilitate RWE collection. Radiologists annotated 3503 retrospectively collected clinical impressions from radiological reports across several cancer types with one of seven OR-RADS categories. A Bidirectional Encoder Representations from Transformers (BERT) model was trained on this dataset with an 80–20% train/test split to perform multiclass and single-class classification tasks using the OR-RADS. Radiologists also performed the classification to compare human and model performance. The model achieved accuracies from 95 to 99% across all classification tasks, performing better in single-class tasks compared to the multiclass task and producing minimal misclassifications, which pertained mostly to overpredicting the equivocal and mixed OR-RADS labels. Human accuracy ranged from 74 to 93% across all classification tasks, performing better on single-class tasks. This study demonstrates the feasibility of the BERT NLP model in predicting disease response in cancer patients, exceeding human performance, and encourages the use of the standardized OR-RADS lexicon to improve large-scale prediction accuracy. MDPI 2023-10-10 /pmc/articles/PMC10605614/ /pubmed/37894276 http://dx.doi.org/10.3390/cancers15204909 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Elbatarny, Lydia Do, Richard K. G. Gangai, Natalie Ahmed, Firas Chhabra, Shalini Simpson, Amber L. Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon |
title | Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon |
title_full | Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon |
title_fullStr | Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon |
title_full_unstemmed | Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon |
title_short | Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon |
title_sort | applying natural language processing to single-report prediction of metastatic disease response using the or-rads lexicon |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10605614/ https://www.ncbi.nlm.nih.gov/pubmed/37894276 http://dx.doi.org/10.3390/cancers15204909 |
work_keys_str_mv | AT elbatarnylydia applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon AT dorichardkg applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon AT gangainatalie applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon AT ahmedfiras applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon AT chhabrashalini applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon AT simpsonamberl applyingnaturallanguageprocessingtosinglereportpredictionofmetastaticdiseaseresponseusingtheorradslexicon |