Cargando…
Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models
Understanding the diagnostic goal of medical reports is valuable information for understanding patient flows. This work focuses on extracting the reason for taking an MRI scan of Multiple Sclerosis (MS) patients using the attached free-form reports: Diagnosis, Progression or Monitoring. We investiga...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10093295/ https://www.ncbi.nlm.nih.gov/pubmed/37046469 http://dx.doi.org/10.3390/diagnostics13071251 |
_version_ | 1785023551440420864 |
---|---|
author | Rietberg, Max Tigo Nguyen, Van Bach Geerdink, Jeroen Vijlbrief, Onno Seifert, Christin |
author_facet | Rietberg, Max Tigo Nguyen, Van Bach Geerdink, Jeroen Vijlbrief, Onno Seifert, Christin |
author_sort | Rietberg, Max Tigo |
collection | PubMed |
description | Understanding the diagnostic goal of medical reports is valuable information for understanding patient flows. This work focuses on extracting the reason for taking an MRI scan of Multiple Sclerosis (MS) patients using the attached free-form reports: Diagnosis, Progression or Monitoring. We investigate the performance of domain-dependent and general state-of-the-art language models and their alignment with domain expertise. To this end, eXplainable Artificial Intelligence (XAI) techniques are used to acquire insight into the inner workings of the model, which are verified on their trustworthiness. The verified XAI explanations are then compared with explanations from a domain expert, to indirectly determine the reliability of the model. BERTje, a Dutch Bidirectional Encoder Representations from Transformers (BERT) model, outperforms RobBERT and MedRoBERTa.nl in both accuracy and reliability. The latter model (MedRoBERTa.nl) is a domain-specific model, while BERTje is a generic model, showing that domain-specific models are not always superior. Our validation of BERTje in a small prospective study shows promising results for the potential uptake of the model in a practical setting. |
format | Online Article Text |
id | pubmed-10093295 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100932952023-04-13 Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models Rietberg, Max Tigo Nguyen, Van Bach Geerdink, Jeroen Vijlbrief, Onno Seifert, Christin Diagnostics (Basel) Article Understanding the diagnostic goal of medical reports is valuable information for understanding patient flows. This work focuses on extracting the reason for taking an MRI scan of Multiple Sclerosis (MS) patients using the attached free-form reports: Diagnosis, Progression or Monitoring. We investigate the performance of domain-dependent and general state-of-the-art language models and their alignment with domain expertise. To this end, eXplainable Artificial Intelligence (XAI) techniques are used to acquire insight into the inner workings of the model, which are verified on their trustworthiness. The verified XAI explanations are then compared with explanations from a domain expert, to indirectly determine the reliability of the model. BERTje, a Dutch Bidirectional Encoder Representations from Transformers (BERT) model, outperforms RobBERT and MedRoBERTa.nl in both accuracy and reliability. The latter model (MedRoBERTa.nl) is a domain-specific model, while BERTje is a generic model, showing that domain-specific models are not always superior. Our validation of BERTje in a small prospective study shows promising results for the potential uptake of the model in a practical setting. MDPI 2023-03-27 /pmc/articles/PMC10093295/ /pubmed/37046469 http://dx.doi.org/10.3390/diagnostics13071251 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Rietberg, Max Tigo Nguyen, Van Bach Geerdink, Jeroen Vijlbrief, Onno Seifert, Christin Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models |
title | Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models |
title_full | Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models |
title_fullStr | Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models |
title_full_unstemmed | Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models |
title_short | Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models |
title_sort | accurate and reliable classification of unstructured reports on their diagnostic goal using bert models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10093295/ https://www.ncbi.nlm.nih.gov/pubmed/37046469 http://dx.doi.org/10.3390/diagnostics13071251 |
work_keys_str_mv | AT rietbergmaxtigo accurateandreliableclassificationofunstructuredreportsontheirdiagnosticgoalusingbertmodels AT nguyenvanbach accurateandreliableclassificationofunstructuredreportsontheirdiagnosticgoalusingbertmodels AT geerdinkjeroen accurateandreliableclassificationofunstructuredreportsontheirdiagnosticgoalusingbertmodels AT vijlbriefonno accurateandreliableclassificationofunstructuredreportsontheirdiagnosticgoalusingbertmodels AT seifertchristin accurateandreliableclassificationofunstructuredreportsontheirdiagnosticgoalusingbertmodels |