Cargando…

Finding relevant free-text radiology reports at scale with IBM Watson Content Analytics: a feasibility study in the UK NHS

BACKGROUND: Significant amounts of health data are stored as free-text within clinical reports, letters, discharge summaries and notes. Busy clinicians have limited time to read such large amounts of free-text and are at risk of information overload and consequently missing information vital to pati...

Descripción completa

Detalles Bibliográficos
Autores principales: Piotrkowicz, Alicja, Johnson, Owen, Hall, Geoff
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6849164/
https://www.ncbi.nlm.nih.gov/pubmed/31711538
http://dx.doi.org/10.1186/s13326-019-0213-5
_version_ 1783469150954323968
author Piotrkowicz, Alicja
Johnson, Owen
Hall, Geoff
author_facet Piotrkowicz, Alicja
Johnson, Owen
Hall, Geoff
author_sort Piotrkowicz, Alicja
collection PubMed
description BACKGROUND: Significant amounts of health data are stored as free-text within clinical reports, letters, discharge summaries and notes. Busy clinicians have limited time to read such large amounts of free-text and are at risk of information overload and consequently missing information vital to patient care. Automatically identifying relevant information at the point of care has the potential to reduce these risks but represents a considerable research challenge. One software solution that has been proposed in industry is the IBM Watson analytics suite which includes rule-based analytics capable of processing large document collections at scale. RESULTS: In this paper we present an overview of IBM Watson Content Analytics and a feasibility study using Content Analytics with a large-scale corpus of clinical free-text reports within a UK National Health Service (NHS) context. We created dictionaries and rules for identifying positive incidence of hydronephrosis and brain metastasis from 5.6 m radiology reports and were able to achieve 94% precision, 95% recall and 89% precision, 94% recall respectively on a sample of manually annotated reports. With minor changes for US English we applied the same rule set to an open access corpus of 0.5 m radiology reports from a US hospital and achieved 93% precision, 94% recall and 84% precision, 88% recall respectively. CONCLUSIONS: We were able to implement IBM Watson within a UK NHS context and demonstrate effective results that could provide clinicians with an automatic safety net which highlights clinically important information within free-text documents. Our results suggest that currently available technologies such as IBM Watson Content Analytics already have the potential to address information overload and improve clinical safety and that solutions developed in one hospital and country may be transportable to different hospitals and countries. Our study was limited to exploring technical aspects of the feasibility of one industry solution and we recognise that healthcare text analytics research is a fast-moving field. That said, we believe our study suggests that text analytics is sufficiently advanced to be implemented within industry solutions that can improve clinical safety.
format Online
Article
Text
id pubmed-6849164
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68491642019-11-15 Finding relevant free-text radiology reports at scale with IBM Watson Content Analytics: a feasibility study in the UK NHS Piotrkowicz, Alicja Johnson, Owen Hall, Geoff J Biomed Semantics Research BACKGROUND: Significant amounts of health data are stored as free-text within clinical reports, letters, discharge summaries and notes. Busy clinicians have limited time to read such large amounts of free-text and are at risk of information overload and consequently missing information vital to patient care. Automatically identifying relevant information at the point of care has the potential to reduce these risks but represents a considerable research challenge. One software solution that has been proposed in industry is the IBM Watson analytics suite which includes rule-based analytics capable of processing large document collections at scale. RESULTS: In this paper we present an overview of IBM Watson Content Analytics and a feasibility study using Content Analytics with a large-scale corpus of clinical free-text reports within a UK National Health Service (NHS) context. We created dictionaries and rules for identifying positive incidence of hydronephrosis and brain metastasis from 5.6 m radiology reports and were able to achieve 94% precision, 95% recall and 89% precision, 94% recall respectively on a sample of manually annotated reports. With minor changes for US English we applied the same rule set to an open access corpus of 0.5 m radiology reports from a US hospital and achieved 93% precision, 94% recall and 84% precision, 88% recall respectively. CONCLUSIONS: We were able to implement IBM Watson within a UK NHS context and demonstrate effective results that could provide clinicians with an automatic safety net which highlights clinically important information within free-text documents. Our results suggest that currently available technologies such as IBM Watson Content Analytics already have the potential to address information overload and improve clinical safety and that solutions developed in one hospital and country may be transportable to different hospitals and countries. Our study was limited to exploring technical aspects of the feasibility of one industry solution and we recognise that healthcare text analytics research is a fast-moving field. That said, we believe our study suggests that text analytics is sufficiently advanced to be implemented within industry solutions that can improve clinical safety. BioMed Central 2019-11-12 /pmc/articles/PMC6849164/ /pubmed/31711538 http://dx.doi.org/10.1186/s13326-019-0213-5 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Piotrkowicz, Alicja
Johnson, Owen
Hall, Geoff
Finding relevant free-text radiology reports at scale with IBM Watson Content Analytics: a feasibility study in the UK NHS
title Finding relevant free-text radiology reports at scale with IBM Watson Content Analytics: a feasibility study in the UK NHS
title_full Finding relevant free-text radiology reports at scale with IBM Watson Content Analytics: a feasibility study in the UK NHS
title_fullStr Finding relevant free-text radiology reports at scale with IBM Watson Content Analytics: a feasibility study in the UK NHS
title_full_unstemmed Finding relevant free-text radiology reports at scale with IBM Watson Content Analytics: a feasibility study in the UK NHS
title_short Finding relevant free-text radiology reports at scale with IBM Watson Content Analytics: a feasibility study in the UK NHS
title_sort finding relevant free-text radiology reports at scale with ibm watson content analytics: a feasibility study in the uk nhs
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6849164/
https://www.ncbi.nlm.nih.gov/pubmed/31711538
http://dx.doi.org/10.1186/s13326-019-0213-5
work_keys_str_mv AT piotrkowiczalicja findingrelevantfreetextradiologyreportsatscalewithibmwatsoncontentanalyticsafeasibilitystudyintheuknhs
AT johnsonowen findingrelevantfreetextradiologyreportsatscalewithibmwatsoncontentanalyticsafeasibilitystudyintheuknhs
AT hallgeoff findingrelevantfreetextradiologyreportsatscalewithibmwatsoncontentanalyticsafeasibilitystudyintheuknhs