Cargando…

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

Understanding the diagnostic goal of medical reports is valuable information for understanding patient flows. This work focuses on extracting the reason for taking an MRI scan of Multiple Sclerosis (MS) patients using the attached free-form reports: Diagnosis, Progression or Monitoring. We investiga...

Descripción completa

Detalles Bibliográficos
Autores principales: Rietberg, Max Tigo, Nguyen, Van Bach, Geerdink, Jeroen, Vijlbrief, Onno, Seifert, Christin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10093295/
https://www.ncbi.nlm.nih.gov/pubmed/37046469
http://dx.doi.org/10.3390/diagnostics13071251
_version_ 1785023551440420864
author Rietberg, Max Tigo
Nguyen, Van Bach
Geerdink, Jeroen
Vijlbrief, Onno
Seifert, Christin
author_facet Rietberg, Max Tigo
Nguyen, Van Bach
Geerdink, Jeroen
Vijlbrief, Onno
Seifert, Christin
author_sort Rietberg, Max Tigo
collection PubMed
description Understanding the diagnostic goal of medical reports is valuable information for understanding patient flows. This work focuses on extracting the reason for taking an MRI scan of Multiple Sclerosis (MS) patients using the attached free-form reports: Diagnosis, Progression or Monitoring. We investigate the performance of domain-dependent and general state-of-the-art language models and their alignment with domain expertise. To this end, eXplainable Artificial Intelligence (XAI) techniques are used to acquire insight into the inner workings of the model, which are verified on their trustworthiness. The verified XAI explanations are then compared with explanations from a domain expert, to indirectly determine the reliability of the model. BERTje, a Dutch Bidirectional Encoder Representations from Transformers (BERT) model, outperforms RobBERT and MedRoBERTa.nl in both accuracy and reliability. The latter model (MedRoBERTa.nl) is a domain-specific model, while BERTje is a generic model, showing that domain-specific models are not always superior. Our validation of BERTje in a small prospective study shows promising results for the potential uptake of the model in a practical setting.
format Online
Article
Text
id pubmed-10093295
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100932952023-04-13 Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models Rietberg, Max Tigo Nguyen, Van Bach Geerdink, Jeroen Vijlbrief, Onno Seifert, Christin Diagnostics (Basel) Article Understanding the diagnostic goal of medical reports is valuable information for understanding patient flows. This work focuses on extracting the reason for taking an MRI scan of Multiple Sclerosis (MS) patients using the attached free-form reports: Diagnosis, Progression or Monitoring. We investigate the performance of domain-dependent and general state-of-the-art language models and their alignment with domain expertise. To this end, eXplainable Artificial Intelligence (XAI) techniques are used to acquire insight into the inner workings of the model, which are verified on their trustworthiness. The verified XAI explanations are then compared with explanations from a domain expert, to indirectly determine the reliability of the model. BERTje, a Dutch Bidirectional Encoder Representations from Transformers (BERT) model, outperforms RobBERT and MedRoBERTa.nl in both accuracy and reliability. The latter model (MedRoBERTa.nl) is a domain-specific model, while BERTje is a generic model, showing that domain-specific models are not always superior. Our validation of BERTje in a small prospective study shows promising results for the potential uptake of the model in a practical setting. MDPI 2023-03-27 /pmc/articles/PMC10093295/ /pubmed/37046469 http://dx.doi.org/10.3390/diagnostics13071251 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Rietberg, Max Tigo
Nguyen, Van Bach
Geerdink, Jeroen
Vijlbrief, Onno
Seifert, Christin
Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models
title Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models
title_full Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models
title_fullStr Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models
title_full_unstemmed Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models
title_short Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models
title_sort accurate and reliable classification of unstructured reports on their diagnostic goal using bert models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10093295/
https://www.ncbi.nlm.nih.gov/pubmed/37046469
http://dx.doi.org/10.3390/diagnostics13071251
work_keys_str_mv AT rietbergmaxtigo accurateandreliableclassificationofunstructuredreportsontheirdiagnosticgoalusingbertmodels
AT nguyenvanbach accurateandreliableclassificationofunstructuredreportsontheirdiagnosticgoalusingbertmodels
AT geerdinkjeroen accurateandreliableclassificationofunstructuredreportsontheirdiagnosticgoalusingbertmodels
AT vijlbriefonno accurateandreliableclassificationofunstructuredreportsontheirdiagnosticgoalusingbertmodels
AT seifertchristin accurateandreliableclassificationofunstructuredreportsontheirdiagnosticgoalusingbertmodels