Cargando…
Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing
STUDY DESIGN: Retrospective study. OBJECTIVES: Huge amounts of images and medical reports are being generated in radiology departments. While these datasets can potentially be employed to train artificial intelligence tools to detect findings on radiological images, the unstructured nature of the re...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10416592/ https://www.ncbi.nlm.nih.gov/pubmed/34219477 http://dx.doi.org/10.1177/21925682211026910 |
_version_ | 1785087816266416128 |
---|---|
author | Galbusera, Fabio Cina, Andrea Bassani, Tito Panico, Matteo Sconfienza, Luca Maria |
author_facet | Galbusera, Fabio Cina, Andrea Bassani, Tito Panico, Matteo Sconfienza, Luca Maria |
author_sort | Galbusera, Fabio |
collection | PubMed |
description | STUDY DESIGN: Retrospective study. OBJECTIVES: Huge amounts of images and medical reports are being generated in radiology departments. While these datasets can potentially be employed to train artificial intelligence tools to detect findings on radiological images, the unstructured nature of the reports limits the accessibility of information. In this study, we tested if natural language processing (NLP) can be useful to generate training data for deep learning models analyzing planar radiographs of the lumbar spine. METHODS: NLP classifiers based on the Bidirectional Encoder Representations from Transformers (BERT) model able to extract structured information from radiological reports were developed and used to generate annotations for a large set of radiographic images of the lumbar spine (N = 10 287). Deep learning (ResNet-18) models aimed at detecting radiological findings directly from the images were then trained and tested on a set of 204 human-annotated images. RESULTS: The NLP models had accuracies between 0.88 and 0.98 and specificities between 0.84 and 0.99; 7 out of 12 radiological findings had sensitivity >0.90. The ResNet-18 models showed performances dependent on the specific radiological findings with sensitivities and specificities between 0.53 and 0.93. CONCLUSIONS: NLP generates valuable data to train deep learning models able to detect radiological findings in spine images. Despite the noisy nature of reports and NLP predictions, this approach effectively mitigates the difficulties associated with the manual annotation of large quantities of data and opens the way to the era of big data for artificial intelligence in musculoskeletal radiology. |
format | Online Article Text |
id | pubmed-10416592 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-104165922023-08-12 Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing Galbusera, Fabio Cina, Andrea Bassani, Tito Panico, Matteo Sconfienza, Luca Maria Global Spine J Original Articles STUDY DESIGN: Retrospective study. OBJECTIVES: Huge amounts of images and medical reports are being generated in radiology departments. While these datasets can potentially be employed to train artificial intelligence tools to detect findings on radiological images, the unstructured nature of the reports limits the accessibility of information. In this study, we tested if natural language processing (NLP) can be useful to generate training data for deep learning models analyzing planar radiographs of the lumbar spine. METHODS: NLP classifiers based on the Bidirectional Encoder Representations from Transformers (BERT) model able to extract structured information from radiological reports were developed and used to generate annotations for a large set of radiographic images of the lumbar spine (N = 10 287). Deep learning (ResNet-18) models aimed at detecting radiological findings directly from the images were then trained and tested on a set of 204 human-annotated images. RESULTS: The NLP models had accuracies between 0.88 and 0.98 and specificities between 0.84 and 0.99; 7 out of 12 radiological findings had sensitivity >0.90. The ResNet-18 models showed performances dependent on the specific radiological findings with sensitivities and specificities between 0.53 and 0.93. CONCLUSIONS: NLP generates valuable data to train deep learning models able to detect radiological findings in spine images. Despite the noisy nature of reports and NLP predictions, this approach effectively mitigates the difficulties associated with the manual annotation of large quantities of data and opens the way to the era of big data for artificial intelligence in musculoskeletal radiology. SAGE Publications 2021-07-05 2023-06 /pmc/articles/PMC10416592/ /pubmed/34219477 http://dx.doi.org/10.1177/21925682211026910 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 License (https://creativecommons.org/licenses/by-nc-nd/4.0/) which permits non-commercial use, reproduction and distribution of the work as published without adaptation or alteration, without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Articles Galbusera, Fabio Cina, Andrea Bassani, Tito Panico, Matteo Sconfienza, Luca Maria Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing |
title | Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing |
title_full | Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing |
title_fullStr | Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing |
title_full_unstemmed | Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing |
title_short | Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing |
title_sort | automatic diagnosis of spinal disorders on radiographic images: leveraging existing unstructured datasets with natural language processing |
topic | Original Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10416592/ https://www.ncbi.nlm.nih.gov/pubmed/34219477 http://dx.doi.org/10.1177/21925682211026910 |
work_keys_str_mv | AT galbuserafabio automaticdiagnosisofspinaldisordersonradiographicimagesleveragingexistingunstructureddatasetswithnaturallanguageprocessing AT cinaandrea automaticdiagnosisofspinaldisordersonradiographicimagesleveragingexistingunstructureddatasetswithnaturallanguageprocessing AT bassanitito automaticdiagnosisofspinaldisordersonradiographicimagesleveragingexistingunstructureddatasetswithnaturallanguageprocessing AT panicomatteo automaticdiagnosisofspinaldisordersonradiographicimagesleveragingexistingunstructureddatasetswithnaturallanguageprocessing AT sconfienzalucamaria automaticdiagnosisofspinaldisordersonradiographicimagesleveragingexistingunstructureddatasetswithnaturallanguageprocessing |