Cargando…

Weakly supervised spatial relation extraction from radiology reports

OBJECTIVE: Weak supervision holds significant promise to improve clinical natural language processing by leveraging domain resources and expertise instead of large manually annotated datasets alone. Here, our objective is to evaluate a weak supervision approach to extract spatial information from ra...

Descripción completa

Detalles Bibliográficos
Autores principales: Datta, Surabhi, Roberts, Kirk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10122604/
https://www.ncbi.nlm.nih.gov/pubmed/37096148
http://dx.doi.org/10.1093/jamiaopen/ooad027
_version_ 1785029527285530624
author Datta, Surabhi
Roberts, Kirk
author_facet Datta, Surabhi
Roberts, Kirk
author_sort Datta, Surabhi
collection PubMed
description OBJECTIVE: Weak supervision holds significant promise to improve clinical natural language processing by leveraging domain resources and expertise instead of large manually annotated datasets alone. Here, our objective is to evaluate a weak supervision approach to extract spatial information from radiology reports. MATERIALS AND METHODS: Our weak supervision approach is based on data programming that uses rules (or labeling functions) relying on domain-specific dictionaries and radiology language characteristics to generate weak labels. The labels correspond to different spatial relations that are critical to understanding radiology reports. These weak labels are then used to fine-tune a pretrained Bidirectional Encoder Representations from Transformers (BERT) model. RESULTS: Our weakly supervised BERT model provided satisfactory results in extracting spatial relations without manual annotations for training (spatial trigger F1: 72.89, relation F1: 52.47). When this model is further fine-tuned on manual annotations (relation F1: 68.76), performance surpasses the fully supervised state-of-the-art. DISCUSSION: To our knowledge, this is the first work to automatically create detailed weak labels corresponding to radiological information of clinical significance. Our data programming approach is (1) adaptable as the labeling functions can be updated with relatively little manual effort to incorporate more variations in radiology language reporting formats and (2) generalizable as these functions can be applied across multiple radiology subdomains in most cases. CONCLUSIONS: We demonstrate a weakly supervision model performs sufficiently well in identifying a variety of relations from radiology text without manual annotations, while exceeding state-of-the-art results when annotated data are available.
format Online
Article
Text
id pubmed-10122604
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101226042023-04-23 Weakly supervised spatial relation extraction from radiology reports Datta, Surabhi Roberts, Kirk JAMIA Open Research and Applications OBJECTIVE: Weak supervision holds significant promise to improve clinical natural language processing by leveraging domain resources and expertise instead of large manually annotated datasets alone. Here, our objective is to evaluate a weak supervision approach to extract spatial information from radiology reports. MATERIALS AND METHODS: Our weak supervision approach is based on data programming that uses rules (or labeling functions) relying on domain-specific dictionaries and radiology language characteristics to generate weak labels. The labels correspond to different spatial relations that are critical to understanding radiology reports. These weak labels are then used to fine-tune a pretrained Bidirectional Encoder Representations from Transformers (BERT) model. RESULTS: Our weakly supervised BERT model provided satisfactory results in extracting spatial relations without manual annotations for training (spatial trigger F1: 72.89, relation F1: 52.47). When this model is further fine-tuned on manual annotations (relation F1: 68.76), performance surpasses the fully supervised state-of-the-art. DISCUSSION: To our knowledge, this is the first work to automatically create detailed weak labels corresponding to radiological information of clinical significance. Our data programming approach is (1) adaptable as the labeling functions can be updated with relatively little manual effort to incorporate more variations in radiology language reporting formats and (2) generalizable as these functions can be applied across multiple radiology subdomains in most cases. CONCLUSIONS: We demonstrate a weakly supervision model performs sufficiently well in identifying a variety of relations from radiology text without manual annotations, while exceeding state-of-the-art results when annotated data are available. Oxford University Press 2023-04-22 /pmc/articles/PMC10122604/ /pubmed/37096148 http://dx.doi.org/10.1093/jamiaopen/ooad027 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research and Applications
Datta, Surabhi
Roberts, Kirk
Weakly supervised spatial relation extraction from radiology reports
title Weakly supervised spatial relation extraction from radiology reports
title_full Weakly supervised spatial relation extraction from radiology reports
title_fullStr Weakly supervised spatial relation extraction from radiology reports
title_full_unstemmed Weakly supervised spatial relation extraction from radiology reports
title_short Weakly supervised spatial relation extraction from radiology reports
title_sort weakly supervised spatial relation extraction from radiology reports
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10122604/
https://www.ncbi.nlm.nih.gov/pubmed/37096148
http://dx.doi.org/10.1093/jamiaopen/ooad027
work_keys_str_mv AT dattasurabhi weaklysupervisedspatialrelationextractionfromradiologyreports
AT robertskirk weaklysupervisedspatialrelationextractionfromradiologyreports