Cargando…

Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning

Unstructured and semi-structured radiology reports represent an underutilized trove of information for machine learning (ML)-based clinical informatics applications, including abnormality tracking systems, research cohort identification, point-of-care summarization, semi-automated report writing, an...

Descripción completa

Detalles Bibliográficos
Autores principales: Steinkamp, Jackson M., Chambers, Charles, Lalevic, Darco, Zafar, Hanna M., Cook, Tessa S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6646440/
https://www.ncbi.nlm.nih.gov/pubmed/31218554
http://dx.doi.org/10.1007/s10278-019-00234-y
_version_ 1783437561415335936
author Steinkamp, Jackson M.
Chambers, Charles
Lalevic, Darco
Zafar, Hanna M.
Cook, Tessa S.
author_facet Steinkamp, Jackson M.
Chambers, Charles
Lalevic, Darco
Zafar, Hanna M.
Cook, Tessa S.
author_sort Steinkamp, Jackson M.
collection PubMed
description Unstructured and semi-structured radiology reports represent an underutilized trove of information for machine learning (ML)-based clinical informatics applications, including abnormality tracking systems, research cohort identification, point-of-care summarization, semi-automated report writing, and as a source of weak data labels for training image processing systems. Clinical ML systems must be interpretable to ensure user trust. To create interpretable models applicable to all of these tasks, we can build general-purpose systems which extract all relevant human-level assertions or “facts” documented in reports; identifying these facts is an information extraction (IE) task. Previous IE work in radiology has focused on a limited set of information, and extracts isolated entities (i.e., single words such as “lesion” or “cyst”) rather than complete facts, which require the linking of multiple entities and modifiers. Here, we develop a prototype system to extract all useful information in abdominopelvic radiology reports (findings, recommendations, clinical history, procedures, imaging indications and limitations, etc.), in the form of complete, contextualized facts. We construct an information schema to capture the bulk of information in reports, develop real-time ML models to extract this information, and demonstrate the feasibility and performance of the system.
format Online
Article
Text
id pubmed-6646440
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-66464402019-08-06 Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning Steinkamp, Jackson M. Chambers, Charles Lalevic, Darco Zafar, Hanna M. Cook, Tessa S. J Digit Imaging Article Unstructured and semi-structured radiology reports represent an underutilized trove of information for machine learning (ML)-based clinical informatics applications, including abnormality tracking systems, research cohort identification, point-of-care summarization, semi-automated report writing, and as a source of weak data labels for training image processing systems. Clinical ML systems must be interpretable to ensure user trust. To create interpretable models applicable to all of these tasks, we can build general-purpose systems which extract all relevant human-level assertions or “facts” documented in reports; identifying these facts is an information extraction (IE) task. Previous IE work in radiology has focused on a limited set of information, and extracts isolated entities (i.e., single words such as “lesion” or “cyst”) rather than complete facts, which require the linking of multiple entities and modifiers. Here, we develop a prototype system to extract all useful information in abdominopelvic radiology reports (findings, recommendations, clinical history, procedures, imaging indications and limitations, etc.), in the form of complete, contextualized facts. We construct an information schema to capture the bulk of information in reports, develop real-time ML models to extract this information, and demonstrate the feasibility and performance of the system. Springer International Publishing 2019-06-19 2019-08 /pmc/articles/PMC6646440/ /pubmed/31218554 http://dx.doi.org/10.1007/s10278-019-00234-y Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Steinkamp, Jackson M.
Chambers, Charles
Lalevic, Darco
Zafar, Hanna M.
Cook, Tessa S.
Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning
title Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning
title_full Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning
title_fullStr Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning
title_full_unstemmed Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning
title_short Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning
title_sort toward complete structured information extraction from radiology reports using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6646440/
https://www.ncbi.nlm.nih.gov/pubmed/31218554
http://dx.doi.org/10.1007/s10278-019-00234-y
work_keys_str_mv AT steinkampjacksonm towardcompletestructuredinformationextractionfromradiologyreportsusingmachinelearning
AT chamberscharles towardcompletestructuredinformationextractionfromradiologyreportsusingmachinelearning
AT lalevicdarco towardcompletestructuredinformationextractionfromradiologyreportsusingmachinelearning
AT zafarhannam towardcompletestructuredinformationextractionfromradiologyreportsusingmachinelearning
AT cooktessas towardcompletestructuredinformationextractionfromradiologyreportsusingmachinelearning