Cargando…

Acute stroke CDS: automatic retrieval of thrombolysis contraindications from unstructured clinical letters

INTRODUCTION: Thrombolysis treatment for acute ischaemic stroke can lead to better outcomes if administered early enough. However, contraindications exist which put the patient at greater risk of a bleed (e.g. recent major surgery, anticoagulant medication). Therefore, clinicians must check a patien...

Descripción completa

Detalles Bibliográficos
Autores principales: Cutforth, Murray, Watson, Hannah, Brown, Cameron, Wang, Chaoyang, Thomson, Stuart, Fell, Dickon, Dilys, Vismantas, Scrimgeour, Morag, Schrempf, Patrick, Lesh, James, Muir, Keith, Weir, Alexander, O’Neil, Alison Q
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10305776/
https://www.ncbi.nlm.nih.gov/pubmed/37388253
http://dx.doi.org/10.3389/fdgth.2023.1186516
_version_ 1785065810754011136
author Cutforth, Murray
Watson, Hannah
Brown, Cameron
Wang, Chaoyang
Thomson, Stuart
Fell, Dickon
Dilys, Vismantas
Scrimgeour, Morag
Schrempf, Patrick
Lesh, James
Muir, Keith
Weir, Alexander
O’Neil, Alison Q
author_facet Cutforth, Murray
Watson, Hannah
Brown, Cameron
Wang, Chaoyang
Thomson, Stuart
Fell, Dickon
Dilys, Vismantas
Scrimgeour, Morag
Schrempf, Patrick
Lesh, James
Muir, Keith
Weir, Alexander
O’Neil, Alison Q
author_sort Cutforth, Murray
collection PubMed
description INTRODUCTION: Thrombolysis treatment for acute ischaemic stroke can lead to better outcomes if administered early enough. However, contraindications exist which put the patient at greater risk of a bleed (e.g. recent major surgery, anticoagulant medication). Therefore, clinicians must check a patient's past medical history before proceeding with treatment. In this work we present a machine learning approach for accurate automatic detection of this information in unstructured text documents such as discharge letters or referral letters, to support the clinician in making a decision about whether to administer thrombolysis. METHODS: We consulted local and national guidelines for thrombolysis eligibility, identifying 86 entities which are relevant to the thrombolysis decision. A total of 8,067 documents from 2,912 patients were manually annotated with these entities by medical students and clinicians. Using this data, we trained and validated several transformer-based named entity recognition (NER) models, focusing on transformer models which have been pre-trained on a biomedical corpus as these have shown most promise in the biomedical NER literature. RESULTS: Our best model was a PubMedBERT-based approach, which obtained a lenient micro/macro F1 score of 0.829/0.723. Ensembling 5 variants of this model gave a significant boost to precision, obtaining micro/macro F1 of 0.846/0.734 which approaches the human annotator performance of 0.847/0.839. We further propose numeric definitions for the concepts of name regularity (similarity of all spans which refer to an entity) and context regularity (similarity of all context surrounding mentions of an entity), using these to analyse the types of errors made by the system and finding that the name regularity of an entity is a stronger predictor of model performance than raw training set frequency. DISCUSSION: Overall, this work shows the potential of machine learning to provide clinical decision support (CDS) for the time-critical decision of thrombolysis administration in ischaemic stroke by quickly surfacing relevant information, leading to prompt treatment and hence to better patient outcomes.
format Online
Article
Text
id pubmed-10305776
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-103057762023-06-29 Acute stroke CDS: automatic retrieval of thrombolysis contraindications from unstructured clinical letters Cutforth, Murray Watson, Hannah Brown, Cameron Wang, Chaoyang Thomson, Stuart Fell, Dickon Dilys, Vismantas Scrimgeour, Morag Schrempf, Patrick Lesh, James Muir, Keith Weir, Alexander O’Neil, Alison Q Front Digit Health Digital Health INTRODUCTION: Thrombolysis treatment for acute ischaemic stroke can lead to better outcomes if administered early enough. However, contraindications exist which put the patient at greater risk of a bleed (e.g. recent major surgery, anticoagulant medication). Therefore, clinicians must check a patient's past medical history before proceeding with treatment. In this work we present a machine learning approach for accurate automatic detection of this information in unstructured text documents such as discharge letters or referral letters, to support the clinician in making a decision about whether to administer thrombolysis. METHODS: We consulted local and national guidelines for thrombolysis eligibility, identifying 86 entities which are relevant to the thrombolysis decision. A total of 8,067 documents from 2,912 patients were manually annotated with these entities by medical students and clinicians. Using this data, we trained and validated several transformer-based named entity recognition (NER) models, focusing on transformer models which have been pre-trained on a biomedical corpus as these have shown most promise in the biomedical NER literature. RESULTS: Our best model was a PubMedBERT-based approach, which obtained a lenient micro/macro F1 score of 0.829/0.723. Ensembling 5 variants of this model gave a significant boost to precision, obtaining micro/macro F1 of 0.846/0.734 which approaches the human annotator performance of 0.847/0.839. We further propose numeric definitions for the concepts of name regularity (similarity of all spans which refer to an entity) and context regularity (similarity of all context surrounding mentions of an entity), using these to analyse the types of errors made by the system and finding that the name regularity of an entity is a stronger predictor of model performance than raw training set frequency. DISCUSSION: Overall, this work shows the potential of machine learning to provide clinical decision support (CDS) for the time-critical decision of thrombolysis administration in ischaemic stroke by quickly surfacing relevant information, leading to prompt treatment and hence to better patient outcomes. Frontiers Media S.A. 2023-06-14 /pmc/articles/PMC10305776/ /pubmed/37388253 http://dx.doi.org/10.3389/fdgth.2023.1186516 Text en © 2023 Cutforth, Watson, Brown, Wang, Thomson, Fell, Dilys, Scrimgeour, Schrempf, Lesh, Muir, Weir and O'Neil. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (https://creativecommons.org/licenses/by/4.0/) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Digital Health
Cutforth, Murray
Watson, Hannah
Brown, Cameron
Wang, Chaoyang
Thomson, Stuart
Fell, Dickon
Dilys, Vismantas
Scrimgeour, Morag
Schrempf, Patrick
Lesh, James
Muir, Keith
Weir, Alexander
O’Neil, Alison Q
Acute stroke CDS: automatic retrieval of thrombolysis contraindications from unstructured clinical letters
title Acute stroke CDS: automatic retrieval of thrombolysis contraindications from unstructured clinical letters
title_full Acute stroke CDS: automatic retrieval of thrombolysis contraindications from unstructured clinical letters
title_fullStr Acute stroke CDS: automatic retrieval of thrombolysis contraindications from unstructured clinical letters
title_full_unstemmed Acute stroke CDS: automatic retrieval of thrombolysis contraindications from unstructured clinical letters
title_short Acute stroke CDS: automatic retrieval of thrombolysis contraindications from unstructured clinical letters
title_sort acute stroke cds: automatic retrieval of thrombolysis contraindications from unstructured clinical letters
topic Digital Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10305776/
https://www.ncbi.nlm.nih.gov/pubmed/37388253
http://dx.doi.org/10.3389/fdgth.2023.1186516
work_keys_str_mv AT cutforthmurray acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT watsonhannah acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT browncameron acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT wangchaoyang acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT thomsonstuart acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT felldickon acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT dilysvismantas acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT scrimgeourmorag acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT schrempfpatrick acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT leshjames acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT muirkeith acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT weiralexander acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters
AT oneilalisonq acutestrokecdsautomaticretrievalofthrombolysiscontraindicationsfromunstructuredclinicalletters