Cargando…

Automatic detection and voxel‐wise mapping of lumbar spine Modic changes with deep learning

BACKGROUND: Modic changes (MCs) are the most prevalent classification system for describing magnetic resonance imaging (MRI) signal intensity changes in the vertebrae. However, there is a growing need for novel quantitative and standardized methods of characterizing these anomalies, particularly for...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Kenneth T., Tibrewala, Radhika, Hess, Madeline, Bharadwaj, Upasana U., Inamdar, Gaurav, Link, Thomas M., Chin, Cynthia T., Pedoia, Valentina, Majumdar, Sharmila
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9238279/
https://www.ncbi.nlm.nih.gov/pubmed/35783915
http://dx.doi.org/10.1002/jsp2.1204
Descripción
Sumario:BACKGROUND: Modic changes (MCs) are the most prevalent classification system for describing magnetic resonance imaging (MRI) signal intensity changes in the vertebrae. However, there is a growing need for novel quantitative and standardized methods of characterizing these anomalies, particularly for lesions of transitional or mixed nature, due to the lack of conclusive evidence of their associations with low back pain. This retrospective imaging study aims to develop an interpretable deep learning‐based detection tool for voxel‐wise mapping of MCs. METHODS: Seventy‐five lumbar spine MRI exams that presented with acute‐to‐chronic low back pain, radiculopathy, and other symptoms of the lumbar spine were enrolled. The pipeline consists of two deep convolutional neural networks to generate an interpretable voxel‐wise Modic map. First, an autoencoder was trained to segment vertebral bodies from T(1)‐weighted sagittal lumbar spine images. Next, two radiologists segmented and labeled MCs from a combined T(1)‐ and T(2)‐weighted assessment to serve as ground truth for training a second autoencoder that performs segmentation of MCs. The voxels in the detected regions were then categorized to the appropriate Modic type using a rule‐based signal intensity algorithm. Post hoc, three radiologists independently graded a second dataset with the aid of the model predictions in an artificial (AI)‐assisted experiment. RESULTS: The model successfully identified the presence of changes in 85.7% of samples in the unseen test set with a sensitivity of 0.71 (±0.072), specificity of 0.95 (±0.022), and Cohen's kappa score of 0.63. In the AI‐assisted experiment, the agreement between the junior radiologist and the senior neuroradiologist significantly improved from Cohen's kappa score of 0.52 to 0.58 (p < 0.05). CONCLUSIONS: This deep learning‐based approach demonstrates substantial agreement with radiologists and may serve as a tool to improve inter‐rater reliability in the assessment of MCs.