Cargando…

Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing

STUDY DESIGN: Retrospective study. OBJECTIVES: Huge amounts of images and medical reports are being generated in radiology departments. While these datasets can potentially be employed to train artificial intelligence tools to detect findings on radiological images, the unstructured nature of the re...

Descripción completa

Detalles Bibliográficos
Autores principales: Galbusera, Fabio, Cina, Andrea, Bassani, Tito, Panico, Matteo, Sconfienza, Luca Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10416592/
https://www.ncbi.nlm.nih.gov/pubmed/34219477
http://dx.doi.org/10.1177/21925682211026910
_version_ 1785087816266416128
author Galbusera, Fabio
Cina, Andrea
Bassani, Tito
Panico, Matteo
Sconfienza, Luca Maria
author_facet Galbusera, Fabio
Cina, Andrea
Bassani, Tito
Panico, Matteo
Sconfienza, Luca Maria
author_sort Galbusera, Fabio
collection PubMed
description STUDY DESIGN: Retrospective study. OBJECTIVES: Huge amounts of images and medical reports are being generated in radiology departments. While these datasets can potentially be employed to train artificial intelligence tools to detect findings on radiological images, the unstructured nature of the reports limits the accessibility of information. In this study, we tested if natural language processing (NLP) can be useful to generate training data for deep learning models analyzing planar radiographs of the lumbar spine. METHODS: NLP classifiers based on the Bidirectional Encoder Representations from Transformers (BERT) model able to extract structured information from radiological reports were developed and used to generate annotations for a large set of radiographic images of the lumbar spine (N = 10 287). Deep learning (ResNet-18) models aimed at detecting radiological findings directly from the images were then trained and tested on a set of 204 human-annotated images. RESULTS: The NLP models had accuracies between 0.88 and 0.98 and specificities between 0.84 and 0.99; 7 out of 12 radiological findings had sensitivity >0.90. The ResNet-18 models showed performances dependent on the specific radiological findings with sensitivities and specificities between 0.53 and 0.93. CONCLUSIONS: NLP generates valuable data to train deep learning models able to detect radiological findings in spine images. Despite the noisy nature of reports and NLP predictions, this approach effectively mitigates the difficulties associated with the manual annotation of large quantities of data and opens the way to the era of big data for artificial intelligence in musculoskeletal radiology.
format Online
Article
Text
id pubmed-10416592
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-104165922023-08-12 Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing Galbusera, Fabio Cina, Andrea Bassani, Tito Panico, Matteo Sconfienza, Luca Maria Global Spine J Original Articles STUDY DESIGN: Retrospective study. OBJECTIVES: Huge amounts of images and medical reports are being generated in radiology departments. While these datasets can potentially be employed to train artificial intelligence tools to detect findings on radiological images, the unstructured nature of the reports limits the accessibility of information. In this study, we tested if natural language processing (NLP) can be useful to generate training data for deep learning models analyzing planar radiographs of the lumbar spine. METHODS: NLP classifiers based on the Bidirectional Encoder Representations from Transformers (BERT) model able to extract structured information from radiological reports were developed and used to generate annotations for a large set of radiographic images of the lumbar spine (N = 10 287). Deep learning (ResNet-18) models aimed at detecting radiological findings directly from the images were then trained and tested on a set of 204 human-annotated images. RESULTS: The NLP models had accuracies between 0.88 and 0.98 and specificities between 0.84 and 0.99; 7 out of 12 radiological findings had sensitivity >0.90. The ResNet-18 models showed performances dependent on the specific radiological findings with sensitivities and specificities between 0.53 and 0.93. CONCLUSIONS: NLP generates valuable data to train deep learning models able to detect radiological findings in spine images. Despite the noisy nature of reports and NLP predictions, this approach effectively mitigates the difficulties associated with the manual annotation of large quantities of data and opens the way to the era of big data for artificial intelligence in musculoskeletal radiology. SAGE Publications 2021-07-05 2023-06 /pmc/articles/PMC10416592/ /pubmed/34219477 http://dx.doi.org/10.1177/21925682211026910 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 License (https://creativecommons.org/licenses/by-nc-nd/4.0/) which permits non-commercial use, reproduction and distribution of the work as published without adaptation or alteration, without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Articles
Galbusera, Fabio
Cina, Andrea
Bassani, Tito
Panico, Matteo
Sconfienza, Luca Maria
Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing
title Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing
title_full Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing
title_fullStr Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing
title_full_unstemmed Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing
title_short Automatic Diagnosis of Spinal Disorders on Radiographic Images: Leveraging Existing Unstructured Datasets With Natural Language Processing
title_sort automatic diagnosis of spinal disorders on radiographic images: leveraging existing unstructured datasets with natural language processing
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10416592/
https://www.ncbi.nlm.nih.gov/pubmed/34219477
http://dx.doi.org/10.1177/21925682211026910
work_keys_str_mv AT galbuserafabio automaticdiagnosisofspinaldisordersonradiographicimagesleveragingexistingunstructureddatasetswithnaturallanguageprocessing
AT cinaandrea automaticdiagnosisofspinaldisordersonradiographicimagesleveragingexistingunstructureddatasetswithnaturallanguageprocessing
AT bassanitito automaticdiagnosisofspinaldisordersonradiographicimagesleveragingexistingunstructureddatasetswithnaturallanguageprocessing
AT panicomatteo automaticdiagnosisofspinaldisordersonradiographicimagesleveragingexistingunstructureddatasetswithnaturallanguageprocessing
AT sconfienzalucamaria automaticdiagnosisofspinaldisordersonradiographicimagesleveragingexistingunstructureddatasetswithnaturallanguageprocessing