Cargando…

Event-Based Clinical Finding Extraction from Radiology Reports with Pre-trained Language Model

Radiology reports contain a diverse and rich set of clinical abnormalities documented by radiologists during their interpretation of the images. Comprehensive semantic representations of radiological findings would enable a wide range of secondary use applications to support diagnosis, triage, outco...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lau, Wilson, Lybarger, Kevin, Gunn, Martin L., Yetisgen, Meliha
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2022
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9576130/ https://www.ncbi.nlm.nih.gov/pubmed/36253581 http://dx.doi.org/10.1007/s10278-022-00717-5

_version_	1784811464476852224
author	Lau, Wilson Lybarger, Kevin Gunn, Martin L. Yetisgen, Meliha
author_facet	Lau, Wilson Lybarger, Kevin Gunn, Martin L. Yetisgen, Meliha
author_sort	Lau, Wilson
collection	PubMed
description	Radiology reports contain a diverse and rich set of clinical abnormalities documented by radiologists during their interpretation of the images. Comprehensive semantic representations of radiological findings would enable a wide range of secondary use applications to support diagnosis, triage, outcomes prediction, and clinical research. In this paper, we present a new corpus of radiology reports annotated with clinical findings. Our annotation schema captures detailed representations of pathologic findings that are observable on imaging (“lesions”) and other types of clinical problems (“medical problems”). The schema used an event-based representation to capture fine-grained details, including assertion, anatomy, characteristics, size, and count. Our gold standard corpus contained a total of 500 annotated computed tomography (CT) reports. We extracted triggers and argument entities using two state-of-the-art deep learning architectures, including BERT. We then predicted the linkages between trigger and argument entities (referred to as argument roles) using a BERT-based relation extraction model. We achieved the best extraction performance using a BERT model pre-trained on 3 million radiology reports from our institution: 90.9–93.4% F1 for finding triggers and 72.0–85.6% F1 for argument roles. To assess model generalizability, we used an external validation set randomly sampled from the MIMIC Chest X-ray (MIMIC-CXR) database. The extraction performance on this validation set was 95.6% for finding triggers and 79.1–89.7% for argument roles, demonstrating that the model generalized well to the cross-institutional data with a different imaging modality. We extracted the finding events from all the radiology reports in the MIMIC-CXR database and provided the extractions to the research community.
format	Online Article Text
id	pubmed-9576130
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-95761302022-10-18 Event-Based Clinical Finding Extraction from Radiology Reports with Pre-trained Language Model Lau, Wilson Lybarger, Kevin Gunn, Martin L. Yetisgen, Meliha J Digit Imaging Original Paper Radiology reports contain a diverse and rich set of clinical abnormalities documented by radiologists during their interpretation of the images. Comprehensive semantic representations of radiological findings would enable a wide range of secondary use applications to support diagnosis, triage, outcomes prediction, and clinical research. In this paper, we present a new corpus of radiology reports annotated with clinical findings. Our annotation schema captures detailed representations of pathologic findings that are observable on imaging (“lesions”) and other types of clinical problems (“medical problems”). The schema used an event-based representation to capture fine-grained details, including assertion, anatomy, characteristics, size, and count. Our gold standard corpus contained a total of 500 annotated computed tomography (CT) reports. We extracted triggers and argument entities using two state-of-the-art deep learning architectures, including BERT. We then predicted the linkages between trigger and argument entities (referred to as argument roles) using a BERT-based relation extraction model. We achieved the best extraction performance using a BERT model pre-trained on 3 million radiology reports from our institution: 90.9–93.4% F1 for finding triggers and 72.0–85.6% F1 for argument roles. To assess model generalizability, we used an external validation set randomly sampled from the MIMIC Chest X-ray (MIMIC-CXR) database. The extraction performance on this validation set was 95.6% for finding triggers and 79.1–89.7% for argument roles, demonstrating that the model generalized well to the cross-institutional data with a different imaging modality. We extracted the finding events from all the radiology reports in the MIMIC-CXR database and provided the extractions to the research community. Springer International Publishing 2022-10-17 2023-02 /pmc/articles/PMC9576130/ /pubmed/36253581 http://dx.doi.org/10.1007/s10278-022-00717-5 Text en © The Author(s) under exclusive licence to Society for Imaging Informatics in Medicine 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
spellingShingle	Original Paper Lau, Wilson Lybarger, Kevin Gunn, Martin L. Yetisgen, Meliha Event-Based Clinical Finding Extraction from Radiology Reports with Pre-trained Language Model
title	Event-Based Clinical Finding Extraction from Radiology Reports with Pre-trained Language Model
title_full	Event-Based Clinical Finding Extraction from Radiology Reports with Pre-trained Language Model
title_fullStr	Event-Based Clinical Finding Extraction from Radiology Reports with Pre-trained Language Model
title_full_unstemmed	Event-Based Clinical Finding Extraction from Radiology Reports with Pre-trained Language Model
title_short	Event-Based Clinical Finding Extraction from Radiology Reports with Pre-trained Language Model
title_sort	event-based clinical finding extraction from radiology reports with pre-trained language model
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9576130/ https://www.ncbi.nlm.nih.gov/pubmed/36253581 http://dx.doi.org/10.1007/s10278-022-00717-5
work_keys_str_mv	AT lauwilson eventbasedclinicalfindingextractionfromradiologyreportswithpretrainedlanguagemodel AT lybargerkevin eventbasedclinicalfindingextractionfromradiologyreportswithpretrainedlanguagemodel AT gunnmartinl eventbasedclinicalfindingextractionfromradiologyreportswithpretrainedlanguagemodel AT yetisgenmeliha eventbasedclinicalfindingextractionfromradiologyreportswithpretrainedlanguagemodel

Event-Based Clinical Finding Extraction from Radiology Reports with Pre-trained Language Model

Ejemplares similares