Cargando…
Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning
Unstructured and semi-structured radiology reports represent an underutilized trove of information for machine learning (ML)-based clinical informatics applications, including abnormality tracking systems, research cohort identification, point-of-care summarization, semi-automated report writing, an...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6646440/ https://www.ncbi.nlm.nih.gov/pubmed/31218554 http://dx.doi.org/10.1007/s10278-019-00234-y |
_version_ | 1783437561415335936 |
---|---|
author | Steinkamp, Jackson M. Chambers, Charles Lalevic, Darco Zafar, Hanna M. Cook, Tessa S. |
author_facet | Steinkamp, Jackson M. Chambers, Charles Lalevic, Darco Zafar, Hanna M. Cook, Tessa S. |
author_sort | Steinkamp, Jackson M. |
collection | PubMed |
description | Unstructured and semi-structured radiology reports represent an underutilized trove of information for machine learning (ML)-based clinical informatics applications, including abnormality tracking systems, research cohort identification, point-of-care summarization, semi-automated report writing, and as a source of weak data labels for training image processing systems. Clinical ML systems must be interpretable to ensure user trust. To create interpretable models applicable to all of these tasks, we can build general-purpose systems which extract all relevant human-level assertions or “facts” documented in reports; identifying these facts is an information extraction (IE) task. Previous IE work in radiology has focused on a limited set of information, and extracts isolated entities (i.e., single words such as “lesion” or “cyst”) rather than complete facts, which require the linking of multiple entities and modifiers. Here, we develop a prototype system to extract all useful information in abdominopelvic radiology reports (findings, recommendations, clinical history, procedures, imaging indications and limitations, etc.), in the form of complete, contextualized facts. We construct an information schema to capture the bulk of information in reports, develop real-time ML models to extract this information, and demonstrate the feasibility and performance of the system. |
format | Online Article Text |
id | pubmed-6646440 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-66464402019-08-06 Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning Steinkamp, Jackson M. Chambers, Charles Lalevic, Darco Zafar, Hanna M. Cook, Tessa S. J Digit Imaging Article Unstructured and semi-structured radiology reports represent an underutilized trove of information for machine learning (ML)-based clinical informatics applications, including abnormality tracking systems, research cohort identification, point-of-care summarization, semi-automated report writing, and as a source of weak data labels for training image processing systems. Clinical ML systems must be interpretable to ensure user trust. To create interpretable models applicable to all of these tasks, we can build general-purpose systems which extract all relevant human-level assertions or “facts” documented in reports; identifying these facts is an information extraction (IE) task. Previous IE work in radiology has focused on a limited set of information, and extracts isolated entities (i.e., single words such as “lesion” or “cyst”) rather than complete facts, which require the linking of multiple entities and modifiers. Here, we develop a prototype system to extract all useful information in abdominopelvic radiology reports (findings, recommendations, clinical history, procedures, imaging indications and limitations, etc.), in the form of complete, contextualized facts. We construct an information schema to capture the bulk of information in reports, develop real-time ML models to extract this information, and demonstrate the feasibility and performance of the system. Springer International Publishing 2019-06-19 2019-08 /pmc/articles/PMC6646440/ /pubmed/31218554 http://dx.doi.org/10.1007/s10278-019-00234-y Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Article Steinkamp, Jackson M. Chambers, Charles Lalevic, Darco Zafar, Hanna M. Cook, Tessa S. Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning |
title | Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning |
title_full | Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning |
title_fullStr | Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning |
title_full_unstemmed | Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning |
title_short | Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning |
title_sort | toward complete structured information extraction from radiology reports using machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6646440/ https://www.ncbi.nlm.nih.gov/pubmed/31218554 http://dx.doi.org/10.1007/s10278-019-00234-y |
work_keys_str_mv | AT steinkampjacksonm towardcompletestructuredinformationextractionfromradiologyreportsusingmachinelearning AT chamberscharles towardcompletestructuredinformationextractionfromradiologyreportsusingmachinelearning AT lalevicdarco towardcompletestructuredinformationextractionfromradiologyreportsusingmachinelearning AT zafarhannam towardcompletestructuredinformationextractionfromradiologyreportsusingmachinelearning AT cooktessas towardcompletestructuredinformationextractionfromradiologyreportsusingmachinelearning |